BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg913233
Length=177
Score E
Sequences producing significant alignments: (Bits) Value
AT3G43870.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 215 1e-55
AT3G32990.1 | Symbols: | pseudogene, ATP synthase C subunit, b... 193 5e-49
AT3G44210.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 193 5e-49
AT1G35790.1 | Symbols: | transposable element gene | chr1:1329... 139 7e-33
AT1G34315.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 58.4 2e-08
> AT3G43870.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G44210.1);
Has 18 Blast hits to 18 proteins in 2 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 18; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:15728176-15729150
FORWARD LENGTH=564
Length=564
Score = 215 bits (116), Expect = 1e-55
Identities = 134/142 (94%), Gaps = 4/142 (3%)
Strand=Plus/Plus
Query 14 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCGGAATGGGAGTGT 73
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 368 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCGGAATGGGAGTGT 427
Query 74 CTGAGCCACGAGGGCCAGCCTCCAC-TCAGGAGGTTCTGGTCATCACGAGGATGACTGGA 132
||||||||||||||||||||||||| | | |||||||| ||| |||||||||||||||||
Sbjct 428 CTGAGCCACGAGGGCCAGCCTCCACCT-ATGAGGTTCTAGTCGTCACGAGGATGACTGGA 486
Query 133 CCTAACGATGTATGTATGGTTA 154
||||||||||| ||||| |||
Sbjct 487 CCTAACGATGT--GTATGTTTA 506
> AT3G32990.1 | Symbols: | pseudogene, ATP synthase C subunit,
blastp match of 86% identity and 9.3e-27 P-value to SP|P06286|ATPH_TOBAC
ATP synthase C chain (EC 3.6.3.14) (Lipid-binding
protein) (Subunit III). (Common tobacco) {Nicotiana tabacum}
| chr3:13534183-13536711 REVERSE LENGTH=2529
Length=2529
Score = 193 bits (104), Expect = 5e-49
Identities = 129/141 (91%), Gaps = 2/141 (1%)
Strand=Plus/Plus
Query 14 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCGGAATGGGAGTGT 73
||||||||||||||||||||||||||||||||||||||||||||||| |||| ||||||
Sbjct 1945 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCAGAATATGAGTGT 2004
Query 74 CTGAGCCACGAGGGCCAGCCTCCACTCAGGAGGTTCTGGTCATCACGAGGATGACTGGAC 133
|||||||| ||||||||||||||||||| |||||||| ||| ||||| |||||||| |||
Sbjct 2005 CTGAGCCATGAGGGCCAGCCTCCACTCATGAGGTTCTAGTCGTCACGGGGATGACTTGAC 2064
Query 134 CTAACGATGTATGTATGGTTA 154
|||||||||| ||||| |||
Sbjct 2065 CTAACGATGT--GTATGTTTA 2083
> AT3G44210.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G43870.1);
Has 15 Blast hits to 15 proteins in 2 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 15; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:15916270-15917332
FORWARD LENGTH=432
Length=432
Score = 193 bits (104), Expect = 5e-49
Identities = 120/128 (94%), Gaps = 0/128 (0%)
Strand=Plus/Plus
Query 14 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCGGAATGGGAGTGT 73
||||||||||||||||||||||||||||||||||||||||| ||||| ||||||||||||
Sbjct 260 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACAAGCCCAGAATGGGAGTGT 319
Query 74 CTGAGCCACGAGGGCCAGCCTCCACTCAGGAGGTTCTGGTCATCACGAGGATGACTGGAC 133
|||||||||||||||||||||||||| | |||||||| || ||||||||||||||||||
Sbjct 320 CTGAGCCACGAGGGCCAGCCTCCACTTATGAGGTTCTAGTTGTCACGAGGATGACTGGAC 379
Query 134 CTAACGAT 141
|| |||||
Sbjct 380 CTTACGAT 387
> AT1G35790.1 | Symbols: | transposable element gene | chr1:13295190-13300109
REVERSE LENGTH=4920
Length=4920
Score = 139 bits (75), Expect = 7e-33
Identities = 81/84 (96%), Gaps = 0/84 (0%)
Strand=Plus/Minus
Query 14 TGATGGACCGACGGGTTAACAGCTGGTCAAGAACGGCTGACGAGCCCGGAATGGGAGTGT 73
|||||||||||||||||||||||| |||||||||||||||||||||||||||| ||||||
Sbjct 1484 TGATGGACCGACGGGTTAACAGCTAGTCAAGAACGGCTGACGAGCCCGGAATGAGAGTGT 1425
Query 74 CTGAGCCACGAGGGCCAGCCTCCA 97
||||||||||||||||||| ||||
Sbjct 1424 CTGAGCCACGAGGGCCAGCTTCCA 1401
Score = 69.4 bits (37), Expect = 1e-11
Identities = 55/63 (87%), Gaps = 4/63 (6%)
Strand=Plus/Minus
Query 93 CTCCA-CTCAGGAGGTTCTGGTCATCACGAGGATGACTGGACCTAACGATGTATGTATGG 151
||||| || | |||||||| ||| |||||||||||||||||||||||||||| |||||
Sbjct 1352 CTCCACCT-ATGAGGTTCTAGTCGTCACGAGGATGACTGGACCTAACGATGT--GTATGT 1296
Query 152 TTA 154
|||
Sbjct 1295 TTA 1293
> AT1G34315.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G31400.1); Has
4 Blast hits to 4 proteins in 1 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 4; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). | chr1:12514106-12516521
REVERSE LENGTH=819
Length=819
Score = 58.4 bits (31), Expect = 2e-08
Identities = 44/50 (88%), Gaps = 2/50 (4%)
Strand=Plus/Plus
Query 105 GGTTCTGGTCATCACGAGGATGACTGGACCTAACGATGTATGTATGGTTA 154
|||||| || |||||||||||||||||||||||||||| ||||| |||
Sbjct 423 GGTTCTAGTTGTCACGAGGATGACTGGACCTAACGATGT--GTATGTTTA 470
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 7746408054
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5