July 6-7, 2017
9:30 am - 5:00 pm
Instructors: Jason Williams, John Fonner
Helpers:
Data Carpentry workshops are for any researcher who has data they want to analyze, and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data.
We will cover Data analysis and visualization in R and Cloud computing and command line for genomics. Participants should bring their laptops and plan to participate actively. By the end of the workshop learners should be able to more effectively manage and analyze data and be able to apply the tools and approaches directly to their ongoing research.
Who: The course is aimed at graduate students and other researchers.
Where: 1 University of Arkansas, Fayetteville, AR 72701. Get directions with OpenStreetMap or Google Maps.
Requirements: Participants must bring a laptop with a few specific software packages installed (listed below). They are also required to abide by Data Carpentry's Code of Conduct.
Contact: Please mail williams@cshl.edu for more information. Registration is directly through CyVerse at http://www.cyverse.org/blog/events/cyverse-tools-and-services-data-carpentry-workshop-langebio-cinvestav-irapuato-mx-may-30
Morning: Intro, Data Processing, and Organization | |
Intro to Data Carpentry | Jason |
Intro to the Data Set | Jason |
Genomics Data Tidyness | John |
Connecting to the Cloud in 5 Minutes or Less | John |
R and R Studio Orientation | Jason |
Intro to R and R Studio | Jason |
Dataframes and Metadata | John |
Afternoon: Data Cleaning and Visualization in R - Intro to Linux | |
Dataframes Continued | Jason |
Data Cleaning and Manipulation with dplyr | Jason |
Data Clearning and Manipulation with dplyr (cont'd) | Jason |
Plotting and Visualizing in R | Jason |
Data Importing and Uploading | John |
Intro to the Linux Shell - Filesystem and Navigation | John |
Morning: Using Linux to organize and process Genomics Data | |
Intro to the Linux Shell - Searching and Metadata | Jason |
Project Organziation and Documentation | John |
'For' loops - QC of Sequencing Data | Jason |
Afternoon: Using Linux to Automate | |
Automating Analyses - Shell Scripting | John |
Creating Workflows - Varient Calling Workflow | Jason |
Workshop Conclusion | Please take the post-survey |
How to Make This Work on Your Own | |
Launching Your Own Cloud Instances | On Your Own |
Etherpad: http://pad.software-carpentry.org/2017-07-06-uniarkansas.
We will use this Etherpad for chatting, taking notes, and sharing URLs and bits of code.
To participate in a Data Carpentry workshop, you will need working copies of the described software. Please make sure to install everything (or at least to download the installers) before the start of your workshop. Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop.
Please follow these Setup Instructions.
We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.