Skip to content

dangliu/Assembly-breaking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Assembly-breaking

Scripts related to assembly breaking

-Genome_cut.py

Input[1]: reference.genome.len.txt, as "Scaffold_ID[tab]Scaffold_length"

Input[2]: n (kb)

usage = """ Generate fragmented genome gff with scaffolds randomly broken into fixed size of n kb from genome length file with n kb specified, usage:

python Genome_cut.py genome.len.txt n

Written by Dang, Academia sinica, Dec 28, 2016.

"""

-GeneIntoFrag.py

Input[1]: fragment.gff (output of Genome_cut.py)

Input[2]: reference gff

usage = """ Generate annotated gff for fragmented genome segments, usage:

python GeneIntoFrag.py fragment.gff ref.gff

Written by Dang, Academia sinica, Jan 05, 2017.

"""

Fasta file of fragmented assembly can be made using bedtools http://bedtools.readthedocs.io/ with original assembly fasta file

"""

-Sample Dataset

c_elegans.ref.fa.len.txt and c_elegans.coding_gene.gff are processed from c_elegans.genome.fasta and c_elegans.gff downloaded from WS255 dataset in WormBase: http://www.wormbase.org/

About

Scripts related to assembly breaking

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages