Install software and use FileZilla to transfer files¶
Before moving forward to mapping reads to the genome, it is necessary to QC the reads and remove low quality reads and adapter sequences. Since most of you are familiar with Trimmomatic and FastQC on the HPCC from MMG 433, we will do all the analysis there instead of on your own local machine. Further we need the ability to perform differential gene expression between conditions. We have chosen to do this through several software packages. Below are some instructions on how to install them.
Using FileZilla to transfer files:¶
R¶
In order to do differential gene expression to compare treatments and identify what is changing at the transcript level, we need to install the statistical programming language R and a really neat interface to work in R called RStudio.
- R can be downloaded from here.
- Click on the appropriate link for your operating system (Linux, Mac OS X, or Windows).
- Then, click on the latest version of the software. This will initiate the download. Windows users unless you know you already have R installed, click on the install R for the first time link (see below). Mac users you have two options based on what flavor of OS X you have. If you are not sure what version you have, look at the screen shots below.
Windows users:
Mac users (to determine OS X version):
Mac users (version of R to download):
- After the download finishes, double-click on the file and follow the instructions to install the software.
- Congratulations! You’ve installed yet another piece of software for this module.
RStudio¶
RStudio is a fantastic interface to work in R. R does have a graphical user interface (GUI) that you can download and use, however I find RStudio much more intuitive/easier to use.
- RStudio can be downloaded here.
- We want to download and install the open-source version of RStudio for the desktop.
- Double click the file after the software has finished downloading and follow the instructions to install the software.
- That’s it!
Qualimap¶
The last piece of software that is nice to have is a software suite that works with Java and R to generate PDF documents summarizing the data post-alignment to the genome.
- Qualimap can be downloaded here.
- Click on the .zip file (as seen below)
- The file will end up in your downloads folder and will likely need to be unzipped as it is compressed. This can typically be accomplished just by double clicking on the file itself in your downloads folder.
- Once this is done, move the folder to your desktop.
- Before we can run this software suite, we will need to install a few packages in R: optparse (from CRAN), NOISeq, Repitools, Rsamtools, GenomicFeatures, rtracklayer (all available from Bioconductor).
- To do this, open RStudio, and click on the packages tab and then the Install button.
Type optparse into the Packages (separate multiple with space or comma): field. Then click Install. RStudio should do the rest.
To install the packages from Bioconductor, click next to the > cursor. Copy and paste this command next to the > and hit Enter/Return:
source("http://bioconductor.org/biocLite.R")
- Then click next to the > cursor and type biocLite(“NOISeq”) and hit Enter/Return. Repeat for Repitools, Rsamtools, GenomicFeatures, and rtracklayer.
- That’s all the software we need to install for now!
Transferring files from your machine to the HPCC using FileZilla¶
1. If you haven’t already downloaded and installed FileZilla, please do so. We want to download the FileZilla Client and not the server version.
2. Open the application and then we will need to input a few things to get connected to the MSU HPCC.
- Host: hpcc.msu.edu
- Username: Your MSU NetID
- Password: Your MSU NetID password
- Port: 22
- Click Quickconnect
3. Now that you are connected, you can move files from your computer (red circle) to the MSU HPCC (green circle) and vice versa, simply by double clicking the file. You can navigate through the folders on your machine and the HPCC just like you would when browsing for documents on your own computer. The connection closes if you exit out of the application.
Let’s get to work!