Assignment
- Given a protein sequence, find out which protein it is and from which organism it belongs using BLAST Program.
Protein Sequence
MERGVRRGAALVAAWRSLWERGGLALFRPQCRTGCGACRVQGTRPFSLSAAASAVLGLGSWGGDSGKQKLTLQDVAELIRKKECRRVVVMAGAGIS
TPSGIPDFRSPGSGLYSNLEQYNIPYPEAIFELAYFFINPKPFFTLAKELYPGNYRPNYAHYFLRLLHDKGLLLRLYTQNIDGLERVAGIPPDRLVEAHGT
FATATCTVCRRKFPGEDFRGDVMADKVPHCRVCTGIVKPDIVFFGEELPQRFFLHMTDFPMADLLFVIGTSLEVEPFASLAGAVRNSVPRVLINRDLV
GPFAWQQRYNDIAQLGDVVTGVEKMVELLDWNEEMQTLIQKEKEKLDAKDK
From the given details, find
- What is the length of the query sequence ?
- What is the number of sequences in the database searched ?
- What is the E-value ?
- Given a nucleotide sequence, find the details regarding the sequence like which organism, name, accession id etc. using BLAST
Nucleotide Sequence :
AGTGCCGCGCGTCGAGCGGAGCAGAGGAGGCGAGGGCGGAGGGCCAGAGAGGCAGTTGGAAGATGGCGGACGAGGTGGCGCTCGCCCTTC
AGGCCGCCGGCTCCCCTTCCGCGGCGGCCGCCATGGAGGCCGCGTCGCAGCCGGCGGACGAGCCGCTCCGCAAGAGGCCCCGCCGAGA
CGGGCCTGGCCTCGGGCGCAGCCCGGGCGAGCCGAGCGCAGCAGTGGCGCCGGCGGCCGCGGGGTGTGAGGCGGCGAGCGCCGCGGC
CCCGGCGGCGCTGTGGCGGGAGGCGGCAGGGGCGGCGGCGAGCGCGGAGCGGGAGGCCCCGGCGACGGCCGTGGCCGGGGACGGAGA
CAATGGGTCCGGCCTGCGGCGGGAGCCGAGGGCGGCTGACGACTTCGACGACGACGAGGGCGAGGAGGAGGACGAGGCGGCGGCGGCAG
CGGCGGCGGCAGCGATCGGCTACCGAGACAACCTCCTGTTGACCGATGGACTCCTCACTAATGGCTTTCATTCCTGTGAAAGTGATGACGATG
ACAGAACGTCACACGCCAGCTCTAGTGACTGGACTCCGCGGCCGCGGATAGGTCCATATACTTTTGTTCAGCAACATCTCATGATTGGCACCG
ATCCTCGAACAATTCTTAAAGATTTATTACCAGAAACAATTCCTCCACCTGAGCTGGATGATATGACGCTGTGGCAGATTGTTATTAATATCCTTTC
AGAACCACCAAAGCGGAAAAAAAGAAAAGATATCAATACAATTGAAGATGCTGTGAAGTTACTGCAGGAGTGTAAAAAGATAATAGTTCTGACTGGA
GCTGGGGTTTCTGTCTCCTGTGGGATTCCTGACTTCAGATCAAGAGACGGTATCTATGCTCGCCTTGCGGTGGACTTCCCAGACCTCCCAGA
CCCTCAAGCCATGTTTGATATTGAGTATTTTAGAAAAGACCCAAGACCATTCTTCAAGTTTGCAAAGGAAATATATCCCGGACAGTTCCAGCCGT
CTCTGTGTCACAAATTCATAGCTTTGTCAGATAAGGAAGGAAAACTACTTCGAAATTATACTCAAAATATAGATACCTTGGAGCAGGTTGCAGGAAT
CCAAAGGATCCTTCAGTGTCATGGTTCCTTTGCAACAGCATCTTGCCTGATTTGTAAATACAAAGTTGATTGTGAAGCTGTTCGTGGAGACATTTTT
AATCAGGTAGTTCCTCGGTGCCCTAGGTGCCCAGCTGATGAGCCACTTGCCATCATGAAGCCAGAGATTGTCTTCTTTGGTGAAAACTTACCAG
AACAGTTTCATAGAGCCATGAAGTATGACAAAGATGAAGTTGACCTCCTCATTGTTATTGGATCTTCTCTGAAAGTGAGACCAGTAGCACTAATTCC
AAGTTCTATACCCCATGAAGTGCCTCAAATATTAATAAATAGGGAACCTTTGCCTCATCTACATTTTGATGTAGAGCTCCTTGGAGACTGCGATGTT
ATAATTAATGAGTTGTGTCATAGGCTAGGTGGTGAATATGCCAAACTTTGTTGTAACCCTGTAAAGCTTTCAGAAATTACTGAAAAACCTCCACGCC
CACAAAAGGAATTGGTTCATTTATCAGAGTTGCCACCAACACCTCTTCATATTTCGGAAGACTCAAGTTCACCTGAAAGAACTGTACCACAAGACT
CTTCTGTGATTGCTACACTTGTAGACCAAGCAACAAACAACAATGTTAATGATTTAGAAGTATCTGAATCAAGTTGTGTGGAAGAAAAACCACAAGAA
GTACAGACTAGTAGGAATGTTGAGAACATTAATGTGGAAAATCCAGATTTTAAGGCTGTTGGTTCCAGTACTGCAGACAAAAATGAAAGAACTTCAGT
TGCAGAAACAGTGAGAAAATGCTGGCCTAATAGACTTGCAAAGGAGCAGATTAGTAAGCGGCTTGAGGGTAATCAATACCTGTTTGTACCACCAAA
TCGTTACATATTCCACGGTGCTGAGGTATACTCAGACTCTGAAGATGACGTCTTGTCCTCTAGTTCCTGTGGCAGTAACAGTGACAGTGGCACAT
GCCAGAGTCCAAGTTTAGAAGAACCCTTGGAAGATGAAAGTGAAATTGAAGAATTCTACAATGGCTTGGAAGATGATACGGAGAGGCCCGAATGTG
CTGGAGGATCTGGATTTGGAGCTGATGGAGGGGATCAAGAGGTTGTTAATGAAGCTATAGCTACAAGACAGGAATTGACAGATGTAAACTATCCAT
CAGACAAATCATAACACTATTGAAGCTGTCCGGATTCAGGAATTGCTCCACCAGCATTGGGAACTTTAGCATGTCAAAAAATGAATGTTTACTTGTG
AACTTGAACAAGGAAATCTGAAAGATGTATTATTTATAGACTGGAAAATAGATTGTCTTCTTGGATAATTTCTAAAGTTCCATCATTTCTGTTTGTACTT
GTACATTCAACACTGTTGGTTGACTTCATCTTCCTTTCAAGGTTCATTTGTATGATACATTCGTATGTATGTATAATTTTGTTTTTTGCCTAATGAGTT
TCAACCTTTTAAAGTTTTCAAAAGCCATTGGAATGTTAATGTAAAGGGAACAGCTTATCTAGACCAAAGAATGGTATTTCACACTTTTTTGTTTGTAAC
ATTGAATAGTTTAAAGCCCTCAATTTCTGTTCTGCTGAACTTTTATTTTTAGGACAGTTAACTTTTTAAACACTGGCATTTTCCAAAACTTGTGGCAGC
TAACTTTTTAAAATCACAGATGACTTGTAATGTGAGGAGTCAGCACCGTGTCTGGAGCACTCAAAACTTGGTGCTCAGTGTGTGAAGCGTACTTAC
TGCATCGTTTTTGTACTTGCTGCAGACGTGGTAATGTCCAAACAGGCCCCTGAGACTAATCTGATAAATGATTTGGAAATGTGTTTCAGTTGTTCTA
GAAACAATAGTGCCTGTCTATATAGGTCCCCTTAGTTTGAATATTTGCCATTGTTTAATTAAATACCTATCACTGTGGTAGAGCCTGCATAGATCTTC
ACCACAAATACTGCCAAGATGTGAATATGCAAAGCCTTTCTGAATCTAATAATGGTACTTCTACTGGGGAGAGTGTAATATTTTGGACTGCTGTTTTT
CCATTAATGAGGAAAGCAATAGGCCTCTTAATTAAAGTCCCAAAGTCATAAGATAAATTGTAGCTCAACCAGAAAGTACACTGTTGCCTGTTGAGGAT
TTGGTGTAATGTATCCCAAGGTGTTAGCCTTGTATTATGGAGATGAATACAGATCCAATAGTCAAATGAAACTAGTTCTTAGTTATTTAAAAGCTTAGC
TTGCCTTAAAACTAGGGATCAATTTTCTCAACTGCAGAAACTTTTAGCCTTTCAAACAGTTCACACCTCAGAAAGTCAGTATTTATTTTACAGACTTC
TTTGGAACATTGCCCCCAAATTTAAATATTCATGTGGGTTTAGTATTTATTACAAAAAAATGATTTGAAATATAGCTGTTCTTTATGCATAAAATACCCA
GTTAGGACCATTACTGCCAGAGGAGAAAAGTATTAAGTAGCTCATTTCCCTACCTAAAAGATAACTGAATTTATTTGGCTACACTAAAGAATGCAGTA
TATTTAGTTTTCCATTTGCATGATGTGTTTGTGCTATAGACAATATTTTAAATTGAAAAATTTGTTTTAAATTATTTTTACAGTGAAGACTGTTTTCAGCT
CTTTTTATATTGTACATAGACTTTTATGTAATCTGGCATATGTTTTGTAGACCGTTTAATGACTGGATTATCTTCCTCCAACTTTTGAAATACAAAAACA
GTGTTTTATACTTGTATCTTGTTTTAAAGTCTTATATTAAAATTGTCATTTGACTTTTTTCCCGTTAAAAAAAAAAAAAAA