BLASTN 2.2.26+


Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.



Database: TAIR10_cdna_20110103_representative_gene_model_updated
           33,602 sequences; 51,074,197 total letters



Query= Ahg917075

Length=324
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

  AT5G54062.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecu...   381    2e-105
  AT5G53905.1 | Symbols:  | unknown protein; CONTAINS InterPro DO...   213    8e-55 
  AT5G53742.1 | Symbols:  | Protein of unknown function (DUF1278)...   174    4e-43 


> AT5G54062.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecular_function 
unknown; INVOLVED IN: biological_process unknown; 
LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s: 
Protein of unknown function DUF1278 (InterPro:IPR010701); 
BEST Arabidopsis thaliana protein match is: Protein of unknown 
function (DUF1278) (TAIR:AT5G53742.1); Has 30201 Blast 
hits to 17322 proteins in 780 species: Archae - 12; Bacteria 
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses 
- 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21938793-21939623 
FORWARD LENGTH=831
Length=831

 Score =  381 bits (206),  Expect = 2e-105
 Identities = 263/289 (91%), Gaps = 10/289 (3%)
 Strand=Plus/Plus

Query  1    TCTCGCAAAATGTTGGTCATCGCTCATC-ACATTCAAGGTTGCGAA-GTCGAAATCTTTA  58
            ||||||||||||||||||||||||| || ||||||| |||||| ||  ||||||||||||
Sbjct  196  TCTCGCAAAATGTTGGTCATCGCTCCTCAACATTCACGGTTGC-AATATCGAAATCTTTA  254

Query  59   AATCTGTTTTAACCGGTAAGATTGAAAACGTTGGA-CCAACATGCTGCAAGGCGTTTACG  117
            |||||||||||||||||||| |||||||||||||| || || ||||||||||| ||||| 
Sbjct  255  AATCTGTTTTAACCGGTAAGTTTGAAAACGTTGGATCC-ACGTGCTGCAAGGCTTTTAC-  312

Query  118  A-AAGTGGATGCAAACTGTTGGCCAAAAATGTTTTCGTTGAATCCGTTATTCCCTCCTCT  176
            | ||||||||||||| |||||||||||||||||| |||||||||| ||||||||||||||
Sbjct  313  AGAAGTGGATGCAAAGTGTTGGCCAAAAATGTTTCCGTTGAATCCATTATTCCCTCCTCT  372

Query  177  TCTCAAGGATGGTTGCTCTCGCATCATCGCAGGTGCACCCGCTGCACACACGACACCTCA  236
            |||||||||||||||||||||||||||| |||||||||| ||  |||| | |||||| ||
Sbjct  373  TCTCAAGGATGGTTGCTCTCGCATCATCTCAGGTGCACCAGCA-CACA-A-GACACCGCA  429

Query  237  GTTCCCTGTCATCTCTGGTTCTCCGGTCGATCTCACAAAATGTTTGTCA  285
             |||||||||||| ||||||||||| |||||||||||||||||||||||
Sbjct  430  ATTCCCTGTCATCCCTGGTTCTCCGATCGATCTCACAAAATGTTTGTCA  478


> AT5G53905.1 | Symbols:  | unknown protein; CONTAINS InterPro 
DOMAIN/s: Protein of unknown function DUF1278 (InterPro:IPR010701); 
BEST Arabidopsis thaliana protein match is: unknown 
protein (TAIR:AT5G54062.1); Has 30201 Blast hits to 17322 proteins 
in 780 species: Archae - 12; Bacteria - 1396; Metazoa 
- 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes 
- 2996 (source: NCBI BLink). | chr5:21888148-21889254 
REVERSE LENGTH=507
Length=507

 Score =  213 bits (115),  Expect = 8e-55
 Identities = 213/256 (83%), Gaps = 23/256 (9%)
 Strand=Plus/Plus

Query  1    TCTCGCAAAATGTTGGTCATCGCTCATC-ACATTCAAGGTTGCGAA-GTCGAAATCTTT-  57
            |||| |||||||||||||||||||| || |  ||||||||||| ||  ||||||| ||| 
Sbjct  72   TCTCACAAAATGTTGGTCATCGCTCTTCAATGTTCAAGGTTGC-AATATCGAAAT-TTTA  129

Query  58   AAATCTGTTTTAACCGGTAAGATTGAAAACGTTGGACCAACATGCTGCAAGGCGTTTACG  117
            |||| ||||||||||| ||||  |       || |   || | ||||||| |||||||||
Sbjct  130  AAATATGTTTTAACCGCTAAG--T-------TT-G---AA-A-GCTGCAAAGCGTTTACG  174

Query  118  AAAGTGGATGCAAACTGTTGGCCAAAAATGTTTTCGTTGAATCCGTTATTCCCTCCTCTT  177
             |||||||||||||||||||||||||||||||| |  ||||||| || ||||||||||||
Sbjct  175  GAAGTGGATGCAAACTGTTGGCCAAAAATGTTTCCACTGAATCCATTTTTCCCTCCTCTT  234

Query  178  CTCAAGGATGGTTGCTCTCGCATCATCGCAGGTGCACCCGCTGCACACACGACACCTCAG  237
             |||||||||||||||||||||||||| || ||||||||||   ||||||||||||||| 
Sbjct  235  GTCAAGGATGGTTGCTCTCGCATCATCTCAAGTGCACCCGC---ACACACGACACCTCAA  291

Query  238  TTCCCTGTCATCTCTG  253
             ||||||||||| |||
Sbjct  292  CTCCCTGTCATCCCTG  307


> AT5G53742.1 | Symbols:  | Protein of unknown function (DUF1278) 
| chr5:21813671-21814018 FORWARD LENGTH=348
Length=348

 Score =  174 bits (94),  Expect = 4e-43
 Identities = 126/141 (89%), Gaps = 3/141 (2%)
 Strand=Plus/Plus

Query  145  ATGTTTTCGTTGAATCCGTTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC  204
            |||||| |||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct  1    ATGTTTCCGTTGAATCCATTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC  60

Query  205  GCAGGTGCACCCGCTGCACACACGACACCTCAGTTCCCTGTCATCTCTGGTTCTCCGGTC  264
             | || |||||  |   |||||||| |||||| |||| ||||||| ||||||||||| ||
Sbjct  61   TCTGGCGCACCAAC---ACACACGAAACCTCAATTCCTTGTCATCCCTGGTTCTCCGATC  117

Query  265  GATCTCACAAAATGTTTGTCA  285
            |||||||||||||||||||||
Sbjct  118  GATCTCACAAAATGTTTGTCA  138



Lambda     K      H
    1.33    0.621     1.12 

Gapped
Lambda     K      H
    1.28    0.460    0.850 

Effective search space used: 15080324700


  Database: TAIR10_cdna_20110103_representative_gene_model_updated
    Posted date:  Sep 25, 2014  6:13 PM
  Number of letters in database: 51,074,197
  Number of sequences in database:  33,602



Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5