BLASTN 2.2.26+ Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), "A greedy algorithm for aligning DNA sequences", J Comput Biol 2000; 7(1-2):203-14. Database: TAIR10_cdna_20110103_representative_gene_model_updated 33,602 sequences; 51,074,197 total letters Query= Ahg923458 Length=422 Score E Sequences producing significant alignments: (Bits) Value AT1G50220.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 599 7e-171 AT1G43171.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 597 3e-170 AT5G54067.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 303 6e-82 AT1G50190.1 | Symbols: | Cysteine/Histidine-rich C1 domain fam... 279 1e-74 > AT1G50220.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43171.1); Has 31 Blast hits to 31 proteins in 2 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:18602958-18604575 REVERSE LENGTH=612 Length=612 Score = 599 bits (324), Expect = 7e-171 Identities = 367/388 (95%), Gaps = 2/388 (1%) Strand=Plus/Plus Query 15 AGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA 74 |||||||||||||||| ||||||||||||||||||||||||||||||||||||||||||| Sbjct 226 AGCAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA 285 Query 75 TCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTC 134 ||||||||||||||||||||||||||| |||||||||| | |||||||||||||||||| Sbjct 286 ACAAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTC 345 Query 135 ACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC 194 ||||||||||||||||| |||||||||||||||||||||||||||||||||||||||||| Sbjct 346 ACGAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC 405 Query 195 GACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGACGACACT-AAGTA 253 |||||||| ||||||||| | | ||||||||||||||||||||||||||||| | ||||| Sbjct 406 GACATAATGGAAGACGATCTGTACACAGTTACCCTGAAAAGAATTGACGACA-TGAAGTA 464 Query 254 TTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGTAGAAGGCGATGT 313 ||||||||||||||||||||||| |||||||||||| ||||||||||||||||||||||| Sbjct 465 TTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGTAGAAGGCGATGT 524 Query 314 TCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAATTTTCAGTATAA 373 || |||||| || |||||||||||||||||||||||||||||||| ||||||||| |||| Sbjct 525 TCAGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAATTTTCAGCATAA 584 Query 374 GACTATGGGAATAATGATTAATGTGTAG 401 ||||||||||||||||||| ||||||| Sbjct 585 GACTATGGGAATAATGATTCCTGTGTAG 612 > AT1G43171.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50220.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr1:16269075-16270513 FORWARD LENGTH=1439 Length=1439 Score = 597 bits (323), Expect = 3e-170 Identities = 388/418 (93%), Gaps = 9/418 (2%) Strand=Plus/Plus Query 3 GTTAAATTTTTTAGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATA 62 |||||| |||||||||||||||||||| | |||||||||||||||||||||||||||| Sbjct 891 GTTAAA-ATTTTAGCAAGATGAATTATGCT--AGATGAACTAGACGGGGCTATGATCATA 947 Query 63 TCAAAGACTCTATCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTG 122 |||||||||||| |||||| |||||||||||||||||||| |||||||||| |||||||| Sbjct 948 TCAAAGACTCTAACAAAGACTGACATTGTTGGTAATGTGGCATTACCAAAAGCACAAGTG 1007 Query 123 ATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAA 182 |||||||||||||| |||||||||||||| |||||||||||||||||||||||||||||| Sbjct 1008 ATGTCTGTCCTCACAAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAA 1067 Query 183 GTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGAC 242 |||||||||||||||||||| ||||||||| ||| ||||||||||||||||||||||||| Sbjct 1068 GTGCAAGTCCACGACATAATGGAAGACGATCTATACACAGTTACCCTGAAAAGAATTGAC 1127 Query 243 GACACT-AAGTATTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGT 301 |||| | |||||||||||||||||||||||||||| |||||||||||| ||||||||||| Sbjct 1128 GACA-TGAAGTATTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGT 1186 Query 302 AGAAGGCGATGTTCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAA 361 ||||||||||||||||||||| || |||||||||||||||||||||||||||||||| || Sbjct 1187 AGAAGGCGATGTTCTGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAA 1246 Query 362 TTTTCAGTATAAGACTATGGGAATAATGATTAATGTGTAGCTCGCTCGCTTACATCTA 419 |||| || ||||||||||||||| ||||||| ||||||||| | ||||||||||| Sbjct 1247 TTTTTAGCATAAGACTATGGGAACAATGATTCCTGTGTAGCT---T-GCTTACATCTA 1300 > AT5G54067.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50220.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21941602-21942146 REVERSE LENGTH=545 Length=545 Score = 303 bits (164), Expect = 6e-82 Identities = 308/373 (83%), Gaps = 27/373 (7%) Strand=Plus/Plus Query 50 GGCTATGATCATATCAAAGACT-CTATCAAAGAGTGACATTGTTGGTAATGTGGTATTAC 108 ||||||||||||| |||| | | |||||||||||||| || ||||||||||||||||||| Sbjct 27 GGCTATGATCATAACAAA-AGTGCTATCAAAGAGTGATATCGTTGGTAATGTGGTATTAC 85 Query 109 CAAAAACACAAGTGATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGG 168 | ||| || |||||||||||||||||||||||||||||| | | || || | | |||| Sbjct 86 CGAAAGCAGAAGTGATGTCTGTCCTCACGAGGATGAATG-T-TAAC-GACCAAGATTTGC 142 Query 169 -ACAACGGTTTTGAAGTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACC 227 | |||||| |||||||||||||| |||||||||| |||||||| |||| ||||||||| Sbjct 143 TA-AACGGTGTTGAAGTGCAAGTCGACGACATAATGGAAGACGACTTATACACAGTTACG 201 Query 228 CTGAAAAGAATT-G--A-CGACA--C-TAAGTATTATTTCGGGACTGGTTGGAGTATTAT 280 || ||| | || | | ||| | | ||| ||||||||||| ||||||||||||| ||| Sbjct 202 CTCAAA-GTATCAGGTATCGATAAACCTAAATATTATTTCGGTACTGGTTGGAGTACTAT 260 Query 281 GAAGCATTCGTTAGATCTCGT-AGAAGGCGATGTTCTGAAGCTTTACTGG-GATCAGTTT 338 ||||||||||||||||||| | |||||||||||||||||| || |||||| | ||| || Sbjct 261 GAAGCATTCGTTAGATCTC-TCAGAAGGCGATGTTCTGAAACTCTACTGGAG-TCACTTG 318 Query 339 GAAAACAAATTCATTGTTCTT-AATTTTCAGTATA-AG-ACTATGGGAATAATGATTA-A 394 || ||||| ||| ||||| || |||||||||||| || ||| | | |||||||| | Sbjct 319 GACAACAAGTTCGTTGTT-TTGAATTTTCAGTATTCAGTACT-TCC-ATTAATGATTCCA 375 Query 395 TGTGTAGCTCGCT 407 |||||||||||| Sbjct 376 -GTGTAGCTCGCT 387 > AT1G50190.1 | Symbols: | Cysteine/Histidine-rich C1 domain family protein | chr1:18588229-18590799 REVERSE LENGTH=1860 Length=1860 Score = 279 bits (151), Expect = 1e-74 Identities = 163/169 (96%), Gaps = 0/169 (0%) Strand=Plus/Plus Query 17 CAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTATC 76 |||||||||||||| ||||||||||||||||||||||||||||||||||||||||||| | Sbjct 1689 CAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTAAC 1748 Query 77 AAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTCAC 136 |||||||||||||||||||||||||| |||||||||| | |||||||||||||||||||| Sbjct 1749 AAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTCAC 1808 Query 137 GAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTG 185 ||||||||||||||| ||||||||||||||||||||||||||||||||| Sbjct 1809 GAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTG 1857 Lambda K H 1.33 0.621 1.12 Gapped Lambda K H 1.28 0.460 0.850 Effective search space used: 20006564102 Database: TAIR10_cdna_20110103_representative_gene_model_updated Posted date: Sep 25, 2014 6:13 PM Number of letters in database: 51,074,197 Number of sequences in database: 33,602 Matrix: blastn matrix 1 -2 Gap Penalties: Existence: 0, Extension: 2.5