BLASTN 2.2.26+ Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), "A greedy algorithm for aligning DNA sequences", J Comput Biol 2000; 7(1-2):203-14. Database: TAIR10_cdna_20110103_representative_gene_model_updated 33,602 sequences; 51,074,197 total letters Query= Ahg950175 Length=765 Score E Sequences producing significant alignments: (Bits) Value AT5G10010.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 268 4e-71 > AT5G10010.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: nucleolus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G64910.1); Has 33260 Blast hits to 16857 proteins in 1270 species: Archae - 88; Bacteria - 3040; Metazoa - 11915; Fungi - 3137; Plants - 1371; Viruses - 424; Other Eukaryotes - 13285 (source: NCBI BLink). | chr5:3127906-3131727 FORWARD LENGTH=1772 Length=1772 Score = 268 bits (145), Expect = 4e-71 Identities = 248/295 (84%), Gaps = 18/295 (6%) Strand=Plus/Plus Query 463 GGAATTTGTCGATGGTCTGATTGAAGATGAAGCATTACCTGTTGAACAAAAGGATGAATT 522 ||||||||||||| ||| ||||||| ||||||||||||| ||||||| ||||||||| Sbjct 1209 GGAATTTGTCGATAAACTGGTTGAAGAGGAAGCATTACCTGCTGAACAAGCGGATGAATT 1268 Query 523 CAACGAAT-TCGTCAAAGAGCAAGTTCGAGCAGCGAAGAAAGC-AAGCAAAGAGGCCAAA 580 ||| |||| | ||||||||||||||||||||||| |||||||| || | |||||||||| Sbjct 1269 CAAAGAATAT-GTCAAAGAGCAAGTTCGAGCAGCAAAGAAAGCAAATC-GAGAGGCCAAA 1326 Query 581 GTTGCTCGAGAGAAAGCAATAGAAGAAATGAGCGAAGATACTAAGGAAGCCTTTGAAAAC 640 | ||||||| ||||||||||||||||||||||||||||||||||| |||||||| |||| Sbjct 1327 GATGCTCGAAAGAAAGCAATAGAAGAAATGAGCGAAGATACTAAGCAAGCCTTTCAAAAG 1386 Query 641 ATGAAGATCTACAAATTCTACCCTCGGCATTCACCAGACGTC-CCTAG--GT-TCAAGAC 696 |||||| |||||||||||||||||| || ||||||||| | || || || || | Sbjct 1387 ATGAAGTTCTACAAATTCTACCCTCAGCCTTCACCAGATA-CACC-AGACGTCTCTGGT- 1443 Query 697 GTC--GTC---ATACATTAACCAATACTACGGGAAAG-CTCATCAAGTCCTTTGA 745 ||| ||| || |||||||| |||||| || |||| ||||| ||||||||||| Sbjct 1444 GTCCAGTCCCCATTCATTAACCGATACTATGG-AAAGGCTCATGAAGTCCTTTGA 1497 Score = 176 bits (95), Expect = 3e-43 Identities = 130/146 (89%), Gaps = 6/146 (4%) Strand=Plus/Plus Query 322 GAGGAGAGCTGCTCTCAGACATATGAAGGAAGATC--GTGTAAAAATGTTTGAGTATTGT 379 ||| ||| ||||||||||||||||||||||||||| | ||| || |||||||||||| Sbjct 1029 GAGAAGATCTGCTCTCAGACATATGAAGGAAGATCAACT-TAAGAA-GTTTGAGTATTGC 1086 Query 380 CTTCCTTATTTCTAT-AACCCATTGAAGGAAGATGAACTTGAACAGAGTACTGAGGTCGA 438 ||||||||||||||| ||||| || ||||||||||||||||||||||||||||||||| | Sbjct 1087 CTTCCTTATTTCTATCAACCC-TTTAAGGAAGATGAACTTGAACAGAGTACTGAGGTCCA 1145 Query 439 CATATTGTTCCCCTCTGAACCGCCGG 464 ||| |||||||||||||||| |||| Sbjct 1146 AATAATGTTCCCCTCTGAACCCCCGG 1171 Lambda K H 1.33 0.621 1.12 Gapped Lambda K H 1.28 0.460 0.850 Effective search space used: 37173268780 Database: TAIR10_cdna_20110103_representative_gene_model_updated Posted date: Sep 25, 2014 6:13 PM Number of letters in database: 51,074,197 Number of sequences in database: 33,602 Matrix: blastn matrix 1 -2 Gap Penalties: Existence: 0, Extension: 2.5