Posted in announcements on July 12th, 2010 by admin – Be the first to comment
Many organisms genomes contain a high percentage of so called “repetitive elements”, and the study of these elements is a very active area of research. After a research group has created a draft assembly of some organisms genome, usually the next step is to start annotating various genomic features such as genes and repetitive elements. One tool, RepeatMasker, by the Institute for System Biology has emerged as a defacto standard in de novo, and database repeat identification and classification.
I’ve added some more code to the NextGenScripts page, one little script helps split fasta formatted files into smaller pieces. The other is a script to run RepeatMasker on a cluster, this speeds up the programs execution time greatly. None of my code mentions how to install or work with RepeatMasker, so please follow the above link to install the software first.
Posted in announcements on July 1st, 2010 by admin – Be the first to comment
I added a new page titled Preferred Tools which is nothing more than a list of tasks and the tools I use to accomplish them. I hope it is useful to people out there. Secondly I started adding some code to my repositories. You can check out both pages by clicking links in the right hand menu.
Posted in linux, tips and tricks on June 30th, 2010 by admin – 1 Comment
Here is a quick command line script that should work on any linux system with wget. This command will download chromosomes 1- 8 for the possum. You would need to modify the list of numbers to be “X Y M Un” to get the non-numeric chromosomes.
for i in 01 02 03 04 05 06 07 08; do wget "ftp://ftp.ncbi.nih.gov/genomes/Monodelphis_domestica/CHR_${i}/mdm_ref_chr${i}.fa.gz"; done;
You can do this for any species or really any organized FTP site. Here is the link to the NCBI genome ftp site.
Posted in announcements on June 29th, 2010 by admin – Be the first to comment
Welcome to my updated portfolio. Please stay tuned as I add information about my current work, publications and even some code.