Skip to content

pachterlab/isolate_transcripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

isolate_transcripts.py

Given a GTF annotation and a corresponding fasta file, prints out a file containing the processed transcripts, and another file containing a window of size 2L-2 around each splice junction of each transcript.

usage

./isolate_transcripts.py annotation.gtf scaffolds.fasta output_directory [-L LENGTH]

  • annotation.gtf Contains annotations for exons, transcripts, and genes
  • scaffolds.fasta Fasta file that corresponds to annotation.gtf
  • output_directory The directory in which the results are to be written.
  • [-L LENGTH], where LENGTH is the read length (default 98)

transcripts to capture

In order to generate a list of transcripts to capture using bustools capture, you can use the following sed command on the fasta files generated by isolate_transcripts.py:

sed -n 's/^>//p' foo.fasta > foo.to_capture.fasta

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages