BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg906119
Length=552
Score E
Sequences producing significant alignments: (Bits) Value
AT1G43720.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 272 2e-72
AT3G30520.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 255 2e-67
AT3G44030.1 | Symbols: | pseudogene, similar to OSJNBb0043H09.... 250 1e-65
AT5G36080.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 172 2e-42
AT1G35820.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 122 2e-27
> AT1G43720.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G42870.1);
Has 51 Blast hits to 51 proteins in 3 species: Archae - 0;
Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 49; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:16495080-16496078
FORWARD LENGTH=945
Length=945
Score = 272 bits (147), Expect = 2e-72
Identities = 193/215 (90%), Gaps = 3/215 (1%)
Strand=Plus/Plus
Query 168 CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA 227
|||||||| ||||| |||||||||||||| | |||| |||||| ||||||||||||||||
Sbjct 295 CGACAATCCTTTGAAACAACAATACAAGACAGCATCACTGGCTTCAGAGAATTCCAACGA 354
Query 228 CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA 287
|||||||||||||||||||||||| || | |||||| ||||||||||| |||||||| ||
Sbjct 355 CAAAGTTTTCAACAACTTCGTCCTGGT-GCTTTTGATCAAGATGATTACGATGAATTTAA 413
Query 288 AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCCTAAGCACA-TAGGATTTTATTGGG 346
|||||||||||||||||| || |||||| | ||||||||||||| ||| |||| ||||||
Sbjct 414 AAAGGCGGAAGCGATATTTATCGCGCTAAATCTTCCTAAGCACACTAG-ATTTCATTGGG 472
Query 347 CATGCATTAATACACATAAGGAACTAGTATTTTGG 381
||||||||||| ||| |||||| ||||||||||||
Sbjct 473 CATGCATTAATGCACTTAAGGAGCTAGTATTTTGG 507
> AT3G30520.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G42870.1);
Has 70 Blast hits to 70 proteins in 9 species: Archae - 0;
Bacteria - 8; Metazoa - 5; Fungi - 0; Plants - 57; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:12130340-12132173
FORWARD LENGTH=1194
Length=1194
Score = 255 bits (138), Expect = 2e-67
Identities = 190/215 (88%), Gaps = 3/215 (1%)
Strand=Plus/Plus
Query 168 CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA 227
|||||||| ||||| |||||||||||||| | |||| || ||| | ||||||||||||||
Sbjct 625 CGACAATCCTTTGAAACAACAATACAAGACAGCATCACTAGCTTCGGAGAATTCCAACGA 684
Query 228 CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA 287
|||||||||||||||||||||||| || | |||||||||||||||||| |||||||| ||
Sbjct 685 CAAAGTTTTCAACAACTTCGTCCTCGT-GCTTTTGACCAAGATGATTACGATGAATTTAA 743
Query 288 AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCCTAAGCACA-TAGGATTTTATTGGG 346
|||||||||||||||||| | || ||||| ||||| ||||||| ||| |||| ||||||
Sbjct 744 AAAGGCGGAAGCGATATTTACCGCCCTAGATCTTCCCAAGCACACTAG-ATTTCATTGGG 802
Query 347 CATGCATTAATACACATAAGGAACTAGTATTTTGG 381
||||||||||| ||| |||||| ||||||||||||
Sbjct 803 CATGCATTAATGCACTTAAGGAGCTAGTATTTTGG 837
> AT3G44030.1 | Symbols: | pseudogene, similar to OSJNBb0043H09.1,
blastp match of 40% identity and 8.4e-38 P-value to GP|21740634|emb|CAD40195.1||AL606611
OSJNBb0043H09.1 {Oryza sativa
(japonica cultivar-group)} | chr3:15803993-15806948 FORWARD
LENGTH=2956
Length=2956
Score = 250 bits (135), Expect = 1e-65
Identities = 190/216 (88%), Gaps = 5/216 (2%)
Strand=Plus/Plus
Query 168 CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA 227
|||||||| ||||| |||||||||||||| | |||| ||||| ||||||||||||||||
Sbjct 987 CGACAATCCTTTGAAACAACAATACAAGACAGCATCATTGGCTTCAGAGAATTCCAACGA 1046
Query 228 CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA 287
||||||||||||||||||||| || || | |||||||||||||||||| |||||||| ||
Sbjct 1047 CAAAGTTTTCAACAACTTCGTTCTGGT-GCTTTTGACCAAGATGATTACGATGAATTTAA 1105
Query 288 AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCC-TAAGCACA-TAGGATTTTATTGG 345
||| |||||||||||||| || |||||||| ||||| || ||||| ||| |||| |||||
Sbjct 1106 AAAAGCGGAAGCGATATTTATCGCGCTAGATCTTCCCTA-GCACACTAG-ATTTCATTGG 1163
Query 346 GCATGCATTAATACACATAAGGAACTAGTATTTTGG 381
|||||||||||| ||| ||| || ||||||||||||
Sbjct 1164 GCATGCATTAATGCACTTAAAGAGCTAGTATTTTGG 1199
Score = 220 bits (119), Expect = 9e-57
Identities = 149/164 (91%), Gaps = 0/164 (0%)
Strand=Plus/Plus
Query 382 GGAGGTCTATCATCTGGAAGTCCATCTTCTGTGGGTAATAATTTAGGAGGGCGAAACTCA 441
|||||| |||||||||||||||||||||| |||||||||||||| ||||||| ||| |||
Sbjct 1341 GGAGGTATATCATCTGGAAGTCCATCTTCAGTGGGTAATAATTTGGGAGGGCAAAATTCA 1400
Query 442 CCCGGTTGTTGGGGTCCCGTTTATCCACAGTGGGGAACACCACCAAATGCTCCACAGTGG 501
||||||| |||||||||| |||||||||||||||||||||||||||| | | ||||||||
Sbjct 1401 CCCGGTTTTTGGGGTCCCATTTATCCACAGTGGGGAACACCACCAAACGTTGCACAGTGG 1460
Query 502 AGAACACCACCAAATGCTCCACAGTGGGGTACACCACCAAATGC 545
|||||| ||||||| ||||| ||||||| ||||||||||||||
Sbjct 1461 GGAACACAACCAAATACTCCATAGTGGGGAACACCACCAAATGC 1504
> AT5G36080.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G32904.1);
Has 1807 Blast hits to 1807 proteins in 277 species: Archae
- 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
| chr5:14184423-14185277 FORWARD LENGTH=462
Length=462
Score = 172 bits (93), Expect = 2e-42
Identities = 138/160 (86%), Gaps = 1/160 (1%)
Strand=Plus/Plus
Query 157 GAAATCGTTGGCGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAG 216
|||||||| | ||||||| ||||| ||||| |||||||| || || ||||| || ||||
Sbjct 304 GAAATCGTAGAAGACAATCTTTTGAAACAACTATACAAGACACTATTGCTGGTTATAGAG 363
Query 217 AATTCCAACGACAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTAT 276
|||| ||||||||||||||||||||||||||||| | || ||||||||||||||||||
Sbjct 364 AATTTCAACGACAAAGTTTTCAACAACTTCGTCCC-GATGCTTTTGACCAAGATGATTAC 422
Query 277 GATGAATTGAAAAAGGCGGAAGCGATATTCATTGCGCTAG 316
||||||| |||||||| ||||||||||| |||||||||
Sbjct 423 AATGAATTTAAAAAGGCAGAAGCGATATTTGTTGCGCTAG 462
> AT1G35820.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G30520.1);
Has 30201 Blast hits to 17322 proteins in 780 species: Archae
- 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants
- 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr1:13308119-13309257 REVERSE LENGTH=864
Length=864
Score = 122 bits (66), Expect = 2e-27
Identities = 78/84 (93%), Gaps = 0/84 (0%)
Strand=Plus/Plus
Query 168 CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA 227
|||||||| ||||| |||||||||||||| | |||| |||||| ||||||||||||||||
Sbjct 541 CGACAATCCTTTGAAACAACAATACAAGACATCATCACTGGCTTCAGAGAATTCCAACGA 600
Query 228 CAAAGTTTTCAACAACTTCGTCCT 251
||||||||||||||||||||||||
Sbjct 601 CAAAGTTTTCAACAACTTCGTCCT 624
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 26473395469
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5