BLASTN 2.2.26+


Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.



Database: TAIR10_cdna_20110103_representative_gene_model_updated
           33,602 sequences; 51,074,197 total letters



Query= Ahg906119

Length=552
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

  AT1G43720.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   272    2e-72
  AT3G30520.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   255    2e-67
  AT3G44030.1 | Symbols:  | pseudogene, similar to OSJNBb0043H09....   250    1e-65
  AT5G36080.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   172    2e-42
  AT1G35820.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   122    2e-27


> AT1G43720.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT3G42870.1); 
Has 51 Blast hits to 51 proteins in 3 species: Archae - 0; 
Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 49; Viruses 
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:16495080-16496078 
FORWARD LENGTH=945
Length=945

 Score =  272 bits (147),  Expect = 2e-72
 Identities = 193/215 (90%), Gaps = 3/215 (1%)
 Strand=Plus/Plus

Query  168  CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA  227
            |||||||| ||||| |||||||||||||| | |||| |||||| ||||||||||||||||
Sbjct  295  CGACAATCCTTTGAAACAACAATACAAGACAGCATCACTGGCTTCAGAGAATTCCAACGA  354

Query  228  CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA  287
            |||||||||||||||||||||||| || | |||||| ||||||||||| |||||||| ||
Sbjct  355  CAAAGTTTTCAACAACTTCGTCCTGGT-GCTTTTGATCAAGATGATTACGATGAATTTAA  413

Query  288  AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCCTAAGCACA-TAGGATTTTATTGGG  346
            |||||||||||||||||| || |||||| | ||||||||||||| ||| |||| ||||||
Sbjct  414  AAAGGCGGAAGCGATATTTATCGCGCTAAATCTTCCTAAGCACACTAG-ATTTCATTGGG  472

Query  347  CATGCATTAATACACATAAGGAACTAGTATTTTGG  381
            ||||||||||| ||| |||||| ||||||||||||
Sbjct  473  CATGCATTAATGCACTTAAGGAGCTAGTATTTTGG  507


> AT3G30520.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT3G42870.1); 
Has 70 Blast hits to 70 proteins in 9 species: Archae - 0; 
Bacteria - 8; Metazoa - 5; Fungi - 0; Plants - 57; Viruses 
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:12130340-12132173 
FORWARD LENGTH=1194
Length=1194

 Score =  255 bits (138),  Expect = 2e-67
 Identities = 190/215 (88%), Gaps = 3/215 (1%)
 Strand=Plus/Plus

Query  168  CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA  227
            |||||||| ||||| |||||||||||||| | |||| || ||| | ||||||||||||||
Sbjct  625  CGACAATCCTTTGAAACAACAATACAAGACAGCATCACTAGCTTCGGAGAATTCCAACGA  684

Query  228  CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA  287
            |||||||||||||||||||||||| || | |||||||||||||||||| |||||||| ||
Sbjct  685  CAAAGTTTTCAACAACTTCGTCCTCGT-GCTTTTGACCAAGATGATTACGATGAATTTAA  743

Query  288  AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCCTAAGCACA-TAGGATTTTATTGGG  346
            |||||||||||||||||| |  || ||||| ||||| ||||||| ||| |||| ||||||
Sbjct  744  AAAGGCGGAAGCGATATTTACCGCCCTAGATCTTCCCAAGCACACTAG-ATTTCATTGGG  802

Query  347  CATGCATTAATACACATAAGGAACTAGTATTTTGG  381
            ||||||||||| ||| |||||| ||||||||||||
Sbjct  803  CATGCATTAATGCACTTAAGGAGCTAGTATTTTGG  837


> AT3G44030.1 | Symbols:  | pseudogene, similar to OSJNBb0043H09.1, 
blastp match of 40% identity and 8.4e-38 P-value to GP|21740634|emb|CAD40195.1||AL606611 
OSJNBb0043H09.1 {Oryza sativa 
(japonica cultivar-group)} | chr3:15803993-15806948 FORWARD 
LENGTH=2956
Length=2956

 Score =  250 bits (135),  Expect = 1e-65
 Identities = 190/216 (88%), Gaps = 5/216 (2%)
 Strand=Plus/Plus

Query  168   CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA  227
             |||||||| ||||| |||||||||||||| | ||||  ||||| ||||||||||||||||
Sbjct  987   CGACAATCCTTTGAAACAACAATACAAGACAGCATCATTGGCTTCAGAGAATTCCAACGA  1046

Query  228   CAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTATGATGAATTGAA  287
             ||||||||||||||||||||| || || | |||||||||||||||||| |||||||| ||
Sbjct  1047  CAAAGTTTTCAACAACTTCGTTCTGGT-GCTTTTGACCAAGATGATTACGATGAATTTAA  1105

Query  288   AAAGGCGGAAGCGATATTCATTGCGCTAGACCTTCC-TAAGCACA-TAGGATTTTATTGG  345
             ||| |||||||||||||| || |||||||| ||||| || ||||| ||| |||| |||||
Sbjct  1106  AAAAGCGGAAGCGATATTTATCGCGCTAGATCTTCCCTA-GCACACTAG-ATTTCATTGG  1163

Query  346   GCATGCATTAATACACATAAGGAACTAGTATTTTGG  381
             |||||||||||| ||| ||| || ||||||||||||
Sbjct  1164  GCATGCATTAATGCACTTAAAGAGCTAGTATTTTGG  1199


 Score =  220 bits (119),  Expect = 9e-57
 Identities = 149/164 (91%), Gaps = 0/164 (0%)
 Strand=Plus/Plus

Query  382   GGAGGTCTATCATCTGGAAGTCCATCTTCTGTGGGTAATAATTTAGGAGGGCGAAACTCA  441
             |||||| |||||||||||||||||||||| |||||||||||||| ||||||| ||| |||
Sbjct  1341  GGAGGTATATCATCTGGAAGTCCATCTTCAGTGGGTAATAATTTGGGAGGGCAAAATTCA  1400

Query  442   CCCGGTTGTTGGGGTCCCGTTTATCCACAGTGGGGAACACCACCAAATGCTCCACAGTGG  501
             ||||||| |||||||||| |||||||||||||||||||||||||||| | | ||||||||
Sbjct  1401  CCCGGTTTTTGGGGTCCCATTTATCCACAGTGGGGAACACCACCAAACGTTGCACAGTGG  1460

Query  502   AGAACACCACCAAATGCTCCACAGTGGGGTACACCACCAAATGC  545
              |||||| ||||||| ||||| ||||||| ||||||||||||||
Sbjct  1461  GGAACACAACCAAATACTCCATAGTGGGGAACACCACCAAATGC  1504


> AT5G36080.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT3G32904.1); 
Has 1807 Blast hits to 1807 proteins in 277 species: Archae 
- 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; 
Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). 
| chr5:14184423-14185277 FORWARD LENGTH=462
Length=462

 Score =  172 bits (93),  Expect = 2e-42
 Identities = 138/160 (86%), Gaps = 1/160 (1%)
 Strand=Plus/Plus

Query  157  GAAATCGTTGGCGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAG  216
            |||||||| |  ||||||| ||||| ||||| |||||||| || || ||||| || ||||
Sbjct  304  GAAATCGTAGAAGACAATCTTTTGAAACAACTATACAAGACACTATTGCTGGTTATAGAG  363

Query  217  AATTCCAACGACAAAGTTTTCAACAACTTCGTCCTAGTTGGTTTTGACCAAGATGATTAT  276
            |||| |||||||||||||||||||||||||||||  | || |||||||||||||||||| 
Sbjct  364  AATTTCAACGACAAAGTTTTCAACAACTTCGTCCC-GATGCTTTTGACCAAGATGATTAC  422

Query  277  GATGAATTGAAAAAGGCGGAAGCGATATTCATTGCGCTAG  316
             ||||||| |||||||| |||||||||||  |||||||||
Sbjct  423  AATGAATTTAAAAAGGCAGAAGCGATATTTGTTGCGCTAG  462


> AT1G35820.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT3G30520.1); 
Has 30201 Blast hits to 17322 proteins in 780 species: Archae 
- 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants 
- 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI 
BLink). | chr1:13308119-13309257 REVERSE LENGTH=864
Length=864

 Score =  122 bits (66),  Expect = 2e-27
 Identities = 78/84 (93%), Gaps = 0/84 (0%)
 Strand=Plus/Plus

Query  168  CGACAATCGTTTGAGACAACAATACAAGATACCATCGCTGGCTACAGAGAATTCCAACGA  227
            |||||||| ||||| |||||||||||||| | |||| |||||| ||||||||||||||||
Sbjct  541  CGACAATCCTTTGAAACAACAATACAAGACATCATCACTGGCTTCAGAGAATTCCAACGA  600

Query  228  CAAAGTTTTCAACAACTTCGTCCT  251
            ||||||||||||||||||||||||
Sbjct  601  CAAAGTTTTCAACAACTTCGTCCT  624



Lambda     K      H
    1.33    0.621     1.12 

Gapped
Lambda     K      H
    1.28    0.460    0.850 

Effective search space used: 26473395469


  Database: TAIR10_cdna_20110103_representative_gene_model_updated
    Posted date:  Sep 25, 2014  6:13 PM
  Number of letters in database: 51,074,197
  Number of sequences in database:  33,602



Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5