BLASTN 2.2.26+


Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.



Database: TAIR10_cdna_20110103_representative_gene_model_updated
           33,602 sequences; 51,074,197 total letters



Query= Ahg923458

Length=422
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

  AT1G50220.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   599    7e-171
  AT1G43171.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecu...   597    3e-170
  AT5G54067.1 | Symbols:  | unknown protein; BEST Arabidopsis tha...   303    6e-82 
  AT1G50190.1 | Symbols:  | Cysteine/Histidine-rich C1 domain fam...   279    1e-74 


> AT1G50220.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT1G43171.1); 
Has 31 Blast hits to 31 proteins in 2 species: Archae - 0; 
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 31; Viruses 
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:18602958-18604575 
REVERSE LENGTH=612
Length=612

 Score =  599 bits (324),  Expect = 7e-171
 Identities = 367/388 (95%), Gaps = 2/388 (1%)
 Strand=Plus/Plus

Query  15   AGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA  74
            |||||||||||||||| |||||||||||||||||||||||||||||||||||||||||||
Sbjct  226  AGCAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTA  285

Query  75   TCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTC  134
             ||||||||||||||||||||||||||| |||||||||| | ||||||||||||||||||
Sbjct  286  ACAAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTC  345

Query  135  ACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC  194
            ||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct  346  ACGAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTGCAAGTCCAC  405

Query  195  GACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGACGACACT-AAGTA  253
            |||||||| ||||||||| | | ||||||||||||||||||||||||||||| | |||||
Sbjct  406  GACATAATGGAAGACGATCTGTACACAGTTACCCTGAAAAGAATTGACGACA-TGAAGTA  464

Query  254  TTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGTAGAAGGCGATGT  313
            ||||||||||||||||||||||| |||||||||||| |||||||||||||||||||||||
Sbjct  465  TTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGTAGAAGGCGATGT  524

Query  314  TCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAATTTTCAGTATAA  373
            || |||||| || |||||||||||||||||||||||||||||||| ||||||||| ||||
Sbjct  525  TCAGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAATTTTCAGCATAA  584

Query  374  GACTATGGGAATAATGATTAATGTGTAG  401
            |||||||||||||||||||  |||||||
Sbjct  585  GACTATGGGAATAATGATTCCTGTGTAG  612


> AT1G43171.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecular_function 
unknown; INVOLVED IN: biological_process unknown; 
LOCATED IN: cellular_component unknown; BEST Arabidopsis 
thaliana protein match is: unknown protein (TAIR:AT1G50220.1); 
Has 30201 Blast hits to 17322 proteins in 780 species: 
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; 
Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: 
NCBI BLink). | chr1:16269075-16270513 FORWARD LENGTH=1439
Length=1439

 Score =  597 bits (323),  Expect = 3e-170
 Identities = 388/418 (93%), Gaps = 9/418 (2%)
 Strand=Plus/Plus

Query  3     GTTAAATTTTTTAGCAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATA  62
             ||||||  |||||||||||||||||||| |  ||||||||||||||||||||||||||||
Sbjct  891   GTTAAA-ATTTTAGCAAGATGAATTATGCT--AGATGAACTAGACGGGGCTATGATCATA  947

Query  63    TCAAAGACTCTATCAAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTG  122
             |||||||||||| |||||| |||||||||||||||||||| |||||||||| ||||||||
Sbjct  948   TCAAAGACTCTAACAAAGACTGACATTGTTGGTAATGTGGCATTACCAAAAGCACAAGTG  1007

Query  123   ATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAA  182
             |||||||||||||| |||||||||||||| ||||||||||||||||||||||||||||||
Sbjct  1008  ATGTCTGTCCTCACAAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAA  1067

Query  183   GTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACCCTGAAAAGAATTGAC  242
             |||||||||||||||||||| ||||||||| ||| |||||||||||||||||||||||||
Sbjct  1068  GTGCAAGTCCACGACATAATGGAAGACGATCTATACACAGTTACCCTGAAAAGAATTGAC  1127

Query  243   GACACT-AAGTATTATTTCGGGACTGGTTGGAGTATTATGAAGCATTCGTTAGATCTCGT  301
             |||| | |||||||||||||||||||||||||||| |||||||||||| |||||||||||
Sbjct  1128  GACA-TGAAGTATTATTTCGGGACTGGTTGGAGTACTATGAAGCATTCATTAGATCTCGT  1186

Query  302   AGAAGGCGATGTTCTGAAGCTTTACTGGGATCAGTTTGAAAACAAATTCATTGTTCTTAA  361
             ||||||||||||||||||||| || |||||||||||||||||||||||||||||||| ||
Sbjct  1187  AGAAGGCGATGTTCTGAAGCTCTATTGGGATCAGTTTGAAAACAAATTCATTGTTCTCAA  1246

Query  362   TTTTCAGTATAAGACTATGGGAATAATGATTAATGTGTAGCTCGCTCGCTTACATCTA  419
             |||| || ||||||||||||||| |||||||  |||||||||   | |||||||||||
Sbjct  1247  TTTTTAGCATAAGACTATGGGAACAATGATTCCTGTGTAGCT---T-GCTTACATCTA  1300


> AT5G54067.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana 
protein match is: unknown protein (TAIR:AT1G50220.1); 
Has 30201 Blast hits to 17322 proteins in 780 species: Archae 
- 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants 
- 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI 
BLink). | chr5:21941602-21942146 REVERSE LENGTH=545
Length=545

 Score =  303 bits (164),  Expect = 6e-82
 Identities = 308/373 (83%), Gaps = 27/373 (7%)
 Strand=Plus/Plus

Query  50   GGCTATGATCATATCAAAGACT-CTATCAAAGAGTGACATTGTTGGTAATGTGGTATTAC  108
            ||||||||||||| |||| | | |||||||||||||| || |||||||||||||||||||
Sbjct  27   GGCTATGATCATAACAAA-AGTGCTATCAAAGAGTGATATCGTTGGTAATGTGGTATTAC  85

Query  109  CAAAAACACAAGTGATGTCTGTCCTCACGAGGATGAATGGTGTTACAGATGAGGGTTTGG  168
            | ||| || |||||||||||||||||||||||||||||| | | || ||  | | |||| 
Sbjct  86   CGAAAGCAGAAGTGATGTCTGTCCTCACGAGGATGAATG-T-TAAC-GACCAAGATTTGC  142

Query  169  -ACAACGGTTTTGAAGTGCAAGTCCACGACATAATAGAAGACGATTTATGCACAGTTACC  227
             | |||||| |||||||||||||| |||||||||| |||||||| |||| ||||||||| 
Sbjct  143  TA-AACGGTGTTGAAGTGCAAGTCGACGACATAATGGAAGACGACTTATACACAGTTACG  201

Query  228  CTGAAAAGAATT-G--A-CGACA--C-TAAGTATTATTTCGGGACTGGTTGGAGTATTAT  280
            || ||| | ||  |  | ||| |  | ||| ||||||||||| ||||||||||||| |||
Sbjct  202  CTCAAA-GTATCAGGTATCGATAAACCTAAATATTATTTCGGTACTGGTTGGAGTACTAT  260

Query  281  GAAGCATTCGTTAGATCTCGT-AGAAGGCGATGTTCTGAAGCTTTACTGG-GATCAGTTT  338
            ||||||||||||||||||| | |||||||||||||||||| || |||||| | ||| || 
Sbjct  261  GAAGCATTCGTTAGATCTC-TCAGAAGGCGATGTTCTGAAACTCTACTGGAG-TCACTTG  318

Query  339  GAAAACAAATTCATTGTTCTT-AATTTTCAGTATA-AG-ACTATGGGAATAATGATTA-A  394
            || ||||| ||| ||||| || ||||||||||||  || ||| |   | ||||||||  |
Sbjct  319  GACAACAAGTTCGTTGTT-TTGAATTTTCAGTATTCAGTACT-TCC-ATTAATGATTCCA  375

Query  395  TGTGTAGCTCGCT  407
             ||||||||||||
Sbjct  376  -GTGTAGCTCGCT  387


> AT1G50190.1 | Symbols:  | Cysteine/Histidine-rich C1 domain family 
protein | chr1:18588229-18590799 REVERSE LENGTH=1860
Length=1860

 Score =  279 bits (151),  Expect = 1e-74
 Identities = 163/169 (96%), Gaps = 0/169 (0%)
 Strand=Plus/Plus

Query  17    CAAGATGAATTATGGTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTATC  76
             |||||||||||||| ||||||||||||||||||||||||||||||||||||||||||| |
Sbjct  1689  CAAGATGAATTATGCTAGAGATGAACTAGACGGGGCTATGATCATATCAAAGACTCTAAC  1748

Query  77    AAAGAGTGACATTGTTGGTAATGTGGTATTACCAAAAACACAAGTGATGTCTGTCCTCAC  136
             |||||||||||||||||||||||||| |||||||||| | ||||||||||||||||||||
Sbjct  1749  AAAGAGTGACATTGTTGGTAATGTGGCATTACCAAAAGCGCAAGTGATGTCTGTCCTCAC  1808

Query  137   GAGGATGAATGGTGTTACAGATGAGGGTTTGGACAACGGTTTTGAAGTG  185
             ||||||||||||||| |||||||||||||||||||||||||||||||||
Sbjct  1809  GAGGATGAATGGTGTCACAGATGAGGGTTTGGACAACGGTTTTGAAGTG  1857



Lambda     K      H
    1.33    0.621     1.12 

Gapped
Lambda     K      H
    1.28    0.460    0.850 

Effective search space used: 20006564102


  Database: TAIR10_cdna_20110103_representative_gene_model_updated
    Posted date:  Sep 25, 2014  6:13 PM
  Number of letters in database: 51,074,197
  Number of sequences in database:  33,602



Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5