BLASTN 2.2.26+


Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.



Database: TAIR10_cdna_20110103_representative_gene_model_updated
           33,602 sequences; 51,074,197 total letters



Query= Ahg923815

Length=278
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

  AT1G52825.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecu...   353    4e-97
  AT4G14615.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecu...   244    2e-64


> AT1G52825.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecular_function 
unknown; INVOLVED IN: biological_process unknown; 
LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s: 
Protein of unknown function DUF2346 (InterPro:IPR018625); 
BEST Arabidopsis thaliana protein match is: unknown protein 
(TAIR:AT4G14615.1); Has 52 Blast hits to 49 proteins in 
19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; 
Plants - 51; Viruses - 0; Other Eukaryotes - 1 (source: NCBI 
BLink). | chr1:19670489-19670967 REVERSE LENGTH=479
Length=479

 Score =  353 bits (191),  Expect = 4e-97
 Identities = 242/266 (91%), Gaps = 6/266 (2%)
 Strand=Plus/Plus

Query  5    TTGGAGTCTCTTACAAATGTCGTCAGTAGGAACATCAAAGGGGATTCTTGAAATCGCCAA  64
            |||||||||| |||||||||| |||||||||||||||||||||||||| |||||||||||
Sbjct  58   TTGGAGTCTCATACAAATGTCATCAGTAGGAACATCAAAGGGGATTCTAGAAATCGCCAA  117

Query  65   ATTCGGTGTATACGTCGCTGTTCCGATCGTCCTAATGTATACATTCGCCAACAACAGCAC  124
            |||||||||||||||||| || || ||||||||||| |||||||||||||||||||||||
Sbjct  118  ATTCGGTGTATACGTCGCAGTACCAATCGTCCTAATCTATACATTCGCCAACAACAGCAC  177

Query  125  CAATATCAAGAAATTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCG  184
            ||| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  178  CAACATCAAGAAATTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCG  237

Query  185  TCCTCCTTCGCCTGAAGAGCTACGAGAGATGGCTAAAGAAATTGCTCGCAACAAGAACA-  243
            ||||||||| |||||||||||||| |||||||||||| || |||| | ||| || || | 
Sbjct  238  TCCTCCTTCACCTGAAGAGCTACGTGAGATGGCTAAACAACTTGCCCTCAAAAAAAAAAA  297

Query  244  ---TCCCTTCAAATTGATTCTACTTG  266
               ||||||| ||||||| ||| |||
Sbjct  298  AAATCCCTTCTAATTGAT-CTA-TTG  321


> AT4G14615.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecular_function 
unknown; INVOLVED IN: biological_process unknown; 
LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant 
structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro 
DOMAIN/s: Protein of unknown function DUF2346 (InterPro:IPR018625); 
BEST Arabidopsis thaliana protein match is: unknown 
protein (TAIR:AT1G52825.1); Has 30201 Blast hits to 17322 
proteins in 780 species: Archae - 12; Bacteria - 1396; 
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other 
Eukaryotes - 2996 (source: NCBI BLink). | chr4:8383764-8385079 
FORWARD LENGTH=574
Length=574

 Score =  244 bits (132),  Expect = 2e-64
 Identities = 198/230 (86%), Gaps = 4/230 (2%)
 Strand=Plus/Plus

Query  19   AAATGTCGTCAGTAGGAACATC-AAAGGGGATTCTTGAAATCGCCAAATTCGGTGTATAC  77
            ||||||| || || |||||||| ||||||| |||| || |||| ||| |||||||| || 
Sbjct  77   AAATGTCATCTGTTGGAACATCGAAAGGGG-TTCTGGAGATCGTCAAGTTCGGTGTCTAT  135

Query  78   GTCGCTGTTCCGATCGTCCTAATGTATACATTCGCCAACAACAGCACCAATATCAAGAAA  137
            |||||||||||||||||||| ||||| |||||||||||||||||||||||||||||||||
Sbjct  136  GTCGCTGTTCCGATCGTCCTTATGTACACATTCGCCAACAACAGCACCAATATCAAGAAA  195

Query  138  TTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCGTCCTCCTTCGCCT  197
            ||||||||||||| |||||| |||||||||||| |||| || ||| | |||||||| || 
Sbjct  196  TTCATGGGCAATCGTTCATATGTTGTTTATCCTGAAGAGGCACCTAGACCTCCTTCACCC  255

Query  198  GAAGAGCTACGAGAGATGGCTAA-AGAAATTGCTCGCAACAAGAACATCC  246
            || |||||| |||||||||| |  |||  |||| || || ||||||||||
Sbjct  256  GATGAGCTAAGAGAGATGGC-ACGAGAGCTTGCCCGTAAGAAGAACATCC  304



Lambda     K      H
    1.33    0.621     1.12 

Gapped
Lambda     K      H
    1.28    0.460    0.850 

Effective search space used: 12768008246


  Database: TAIR10_cdna_20110103_representative_gene_model_updated
    Posted date:  Sep 25, 2014  6:13 PM
  Number of letters in database: 51,074,197
  Number of sequences in database:  33,602



Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5