Package org.biojava.nbio.core.sequence
Class TranscriptSequence
java.lang.Object
org.biojava.nbio.core.sequence.template.AbstractSequence<NucleotideCompound>
org.biojava.nbio.core.sequence.DNASequence
org.biojava.nbio.core.sequence.TranscriptSequence
- All Implemented Interfaces:
Iterable<NucleotideCompound>
,Accessioned
,Sequence<NucleotideCompound>
This is the sequence if you want to go from a gene sequence to a protein sequence. Need to start with a
ChromosomeSequence then getting a GeneSequence and then a TranscriptSequence
- Author:
- Scooter Willis
-
Nested Class Summary
Nested classes/interfaces inherited from class org.biojava.nbio.core.sequence.DNASequence
DNASequence.DNAType
Nested classes/interfaces inherited from class org.biojava.nbio.core.sequence.template.AbstractSequence
AbstractSequence.AnnotationType
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionaddCDS
(AccessionID accession, int begin, int end, int phase) Add a Coding Sequence region with phase to the transcript sequencevoid
addStartCodonSequence
(AccessionID accession, int begin, int end) void
addStopCodonSequence
(AccessionID accession, int begin, int end) Get the CDS sequences that have been added to the TranscriptSequencesGet the stitched together CDS sequences then maps to the cDNAint
Returns the length of the SequenceReturn a list of protein sequences based on each CDS sequence where the phase shift between two CDS sequences is assigned to the CDS sequence that starts the triplet.Get the protein sequenceGet the protein sequence with user defined TranscriptEngineRemove a CDS or coding sequence from the transcript sequenceMethods inherited from class org.biojava.nbio.core.sequence.DNASequence
getComplement, getDNAType, getGCCount, getReverse, getReverseComplement, getRNASequence, getRNASequence, getRNASequence, getRNASequence, main, setDNAType
Methods inherited from class org.biojava.nbio.core.sequence.template.AbstractSequence
addFeature, addFeature, addNote, countCompounds, getAccession, getAnnotationType, getAsList, getBioBegin, getBioEnd, getCompoundAt, getCompoundSet, getDatabaseReferences, getDescription, getFeatureRetriever, getFeatures, getFeatures, getFeatures, getFeaturesByType, getFeaturesKeyWord, getIndexOf, getInverse, getLastIndexOf, getNotesList, getOriginalHeader, getParentSequence, getProxySequenceReader, getSequenceAsString, getSequenceAsString, getSequenceScore, getSource, getSubSequence, getTaxonomy, getUserCollection, iterator, removeFeature, removeNote, setAccession, setAnnotationType, setBioBegin, setBioEnd, setCompoundSet, setDatabaseReferences, setDescription, setFeatureRetriever, setFeaturesKeyWord, setNotesList, setOriginalHeader, setParentSequence, setProxySequenceReader, setSequenceScore, setSource, setTaxonomy, setUserCollection, toString
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
Constructor Details
-
TranscriptSequence
- Parameters:
parentDNASequence
-begin
-end
- inclusive of end
-
-
Method Details
-
getLength
public int getLength()Description copied from interface:Sequence
Returns the length of the Sequence- Specified by:
getLength
in interfaceSequence<NucleotideCompound>
- Overrides:
getLength
in classAbstractSequence<NucleotideCompound>
-
getStrand
- Returns:
- the strand
-
removeCDS
Remove a CDS or coding sequence from the transcript sequence- Parameters:
accession
-- Returns:
-
getCDSSequences
Get the CDS sequences that have been added to the TranscriptSequences- Returns:
-
addCDS
Add a Coding Sequence region with phase to the transcript sequence- Parameters:
accession
-begin
-end
-phase
- 0,1,2- Returns:
- Throws:
Exception
-
getProteinCDSSequences
Return a list of protein sequences based on each CDS sequence where the phase shift between two CDS sequences is assigned to the CDS sequence that starts the triplet. This can be used to map a CDS/exon region of a protein sequence back to the DNA sequence If you have a protein sequence and a predicted gene you can take the predict CDS protein sequences and align back to the protein sequence. If you have errors in mapping the predicted protein CDS regions to an the known protein sequence then you can identify possible errors in the prediction- Returns:
-
getDNACodingSequence
Get the stitched together CDS sequences then maps to the cDNA- Returns:
-
getProteinSequence
Get the protein sequence- Returns:
-
getProteinSequence
Get the protein sequence with user defined TranscriptEngine- Parameters:
engine
-- Returns:
-
getStartCodonSequence
- Returns:
- the startCodonSequence
-
addStartCodonSequence
- Parameters:
startCodonSequence
- the startCodonSequence to set
-
getStopCodonSequence
- Returns:
- the stopCodonSequence
-
addStopCodonSequence
- Parameters:
stopCodonSequence
- the stopCodonSequence to set
-