BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg923458
Length=422
Score E
Sequences producing significant alignments: (Bits) Value
AT1G50220.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 599 7e-171
AT1G43171.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 597 3e-170
AT5G54067.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 303 6e-82
AT1G50190.1 | Symbols: | Cysteine/Histidine-rich C1 domain fam... 279 1e-74
> AT1G50220.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G43171.1);
Has 31 Blast hits to 31 proteins in 2 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 31; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:18602958-18604575
REVERSE LENGTH=612
Length=612
Score = 599 bits (324), Expect = 7e-171
Identities = 367/388 (95%), Gaps = 2/388 (1%)
Strand=Plus/Plus
Query 15 AGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA 74
|||||||||||||||| |||||||||||||||||||||||||||||||||||||||||||
Sbjct 226 AGCAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA 285
Query 75 TCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTC 134
||||||||||||||||||||||||||| |||||||||| | ||||||||||||||||||
Sbjct 286 ACAAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTC 345
Query 135 ACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC 194
||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct 346 ACGAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC 405
Query 195 GACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGACGACACT-AAGTA 253
|||||||| ||||||||| | | ||||||||||||||||||||||||||||| | |||||
Sbjct 406 GACATAATGGAAGACGATCTGTACACAGTTACCCTGAAAAGAATTGACGACA-TGAAGTA 464
Query 254 TTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGTAGAAGGCGATGT 313
||||||||||||||||||||||| |||||||||||| |||||||||||||||||||||||
Sbjct 465 TTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGTAGAAGGCGATGT 524
Query 314 TCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAATTTTCAGTATAA 373
|| |||||| || |||||||||||||||||||||||||||||||| ||||||||| ||||
Sbjct 525 TCAGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAATTTTCAGCATAA 584
Query 374 GACTATGGGAATAATGATTAATGTGTAG 401
||||||||||||||||||| |||||||
Sbjct 585 GACTATGGGAATAATGATTCCTGTGTAG 612
> AT1G43171.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: cellular_component unknown; BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT1G50220.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422;
Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source:
NCBI BLink). | chr1:16269075-16270513 FORWARD LENGTH=1439
Length=1439
Score = 597 bits (323), Expect = 3e-170
Identities = 388/418 (93%), Gaps = 9/418 (2%)
Strand=Plus/Plus
Query 3 GTTAAATTTTTTAGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATA 62
|||||| |||||||||||||||||||| | ||||||||||||||||||||||||||||
Sbjct 891 GTTAAA-ATTTTAGCAAGATGAATTATGCT--AGATGAACTAGACGGGGCTATGATCATA 947
Query 63 TCAAAGACTCTATCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTG 122
|||||||||||| |||||| |||||||||||||||||||| |||||||||| ||||||||
Sbjct 948 TCAAAGACTCTAACAAAGACTGACATTGTTGGTAATGTGGCATTACCAAAAGCACAAGTG 1007
Query 123 ATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAA 182
|||||||||||||| |||||||||||||| ||||||||||||||||||||||||||||||
Sbjct 1008 ATGTCTGTCCTCACAAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAA 1067
Query 183 GTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGAC 242
|||||||||||||||||||| ||||||||| ||| |||||||||||||||||||||||||
Sbjct 1068 GTGCAAGTCCACGACATAATGGAAGACGATCTATACACAGTTACCCTGAAAAGAATTGAC 1127
Query 243 GACACT-AAGTATTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGT 301
|||| | |||||||||||||||||||||||||||| |||||||||||| |||||||||||
Sbjct 1128 GACA-TGAAGTATTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGT 1186
Query 302 AGAAGGCGATGTTCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAA 361
||||||||||||||||||||| || |||||||||||||||||||||||||||||||| ||
Sbjct 1187 AGAAGGCGATGTTCTGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAA 1246
Query 362 TTTTCAGTATAAGACTATGGGAATAATGATTAATGTGTAGCTCGCTCGCTTACATCTA 419
|||| || ||||||||||||||| ||||||| ||||||||| | |||||||||||
Sbjct 1247 TTTTTAGCATAAGACTATGGGAACAATGATTCCTGTGTAGCT---T-GCTTACATCTA 1300
> AT5G54067.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G50220.1);
Has 30201 Blast hits to 17322 proteins in 780 species: Archae
- 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants
- 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:21941602-21942146 REVERSE LENGTH=545
Length=545
Score = 303 bits (164), Expect = 6e-82
Identities = 308/373 (83%), Gaps = 27/373 (7%)
Strand=Plus/Plus
Query 50 GGCTATGATCATATCAAAGACT-CTATCAAAGAGTGACATTGTTGGTAATGTGGTATTAC 108
||||||||||||| |||| | | |||||||||||||| || |||||||||||||||||||
Sbjct 27 GGCTATGATCATAACAAA-AGTGCTATCAAAGAGTGATATCGTTGGTAATGTGGTATTAC 85
Query 109 CAAAAACACAAGTGATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGG 168
| ||| || |||||||||||||||||||||||||||||| | | || || | | ||||
Sbjct 86 CGAAAGCAGAAGTGATGTCTGTCCTCACGAGGATGAATG-T-TAAC-GACCAAGATTTGC 142
Query 169 -ACAACGGTTTTGAAGTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACC 227
| |||||| |||||||||||||| |||||||||| |||||||| |||| |||||||||
Sbjct 143 TA-AACGGTGTTGAAGTGCAAGTCGACGACATAATGGAAGACGACTTATACACAGTTACG 201
Query 228 CTGAAAAGAATT-G--A-CGACA--C-TAAGTATTATTTCGGGACTGGTTGGAGTATTAT 280
|| ||| | || | | ||| | | ||| ||||||||||| ||||||||||||| |||
Sbjct 202 CTCAAA-GTATCAGGTATCGATAAACCTAAATATTATTTCGGTACTGGTTGGAGTACTAT 260
Query 281 GAAGCATTCGTTAGATCTCGT-AGAAGGCGATGTTCTGAAGCTTTACTGG-GATCAGTTT 338
||||||||||||||||||| | |||||||||||||||||| || |||||| | ||| ||
Sbjct 261 GAAGCATTCGTTAGATCTC-TCAGAAGGCGATGTTCTGAAACTCTACTGGAG-TCACTTG 318
Query 339 GAAAACAAATTCATTGTTCTT-AATTTTCAGTATA-AG-ACTATGGGAATAATGATTA-A 394
|| ||||| ||| ||||| || |||||||||||| || ||| | | |||||||| |
Sbjct 319 GACAACAAGTTCGTTGTT-TTGAATTTTCAGTATTCAGTACT-TCC-ATTAATGATTCCA 375
Query 395 TGTGTAGCTCGCT 407
||||||||||||
Sbjct 376 -GTGTAGCTCGCT 387
> AT1G50190.1 | Symbols: | Cysteine/Histidine-rich C1 domain family
protein | chr1:18588229-18590799 REVERSE LENGTH=1860
Length=1860
Score = 279 bits (151), Expect = 1e-74
Identities = 163/169 (96%), Gaps = 0/169 (0%)
Strand=Plus/Plus
Query 17 CAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTATC 76
|||||||||||||| ||||||||||||||||||||||||||||||||||||||||||| |
Sbjct 1689 CAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTAAC 1748
Query 77 AAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTCAC 136
|||||||||||||||||||||||||| |||||||||| | ||||||||||||||||||||
Sbjct 1749 AAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTCAC 1808
Query 137 GAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTG 185
||||||||||||||| |||||||||||||||||||||||||||||||||
Sbjct 1809 GAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTG 1857
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 20006564102
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5