BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg479617
Length=273
Score E
Sequences producing significant alignments: (Bits) Value
AT3G20898.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 196 7e-50
AT3G20900.1 | Symbols: | unknown protein; Has 2 Blast hits to ... 195 2e-49
AT1G51355.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 89.8 1e-17
> AT3G20898.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: cellular_component unknown; BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT1G51355.1);
Has 66 Blast hits to 66 proteins in 10 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 66; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:7323683-7324634
FORWARD LENGTH=871
Length=871
Score = 196 bits (106), Expect = 7e-50
Identities = 121/128 (95%), Gaps = 2/128 (2%)
Strand=Plus/Minus
Query 83 GAGCAAAGAAAACGATCTTTCTCCGTCGTAACACACAGTTCTTT-GTGACCCTTTGCTTC 141
|||||||||||||||||| ||||||||||||||||||||| ||| | |||||||||||||
Sbjct 396 GAGCAAAGAAAACGATCTGTCTCCGTCGTAACACACAGTT-TTTCGAGACCCTTTGCTTC 338
Query 142 TTCGGCGCCGGTGGACACGTCAGCATCTCCGGTATCCTAGACTTCTTAGCTTTTGGAGTA 201
||||||||||||||||||||||||||||||||||||||||||||||| ||||| || |||
Sbjct 337 TTCGGCGCCGGTGGACACGTCAGCATCTCCGGTATCCTAGACTTCTTGGCTTTCGGGGTA 278
Query 202 CAACAACC 209
||||||||
Sbjct 277 CAACAACC 270
> AT3G20900.1 | Symbols: | unknown protein; Has 2 Blast hits to
2 proteins in 1 species: Archae - 0; Bacteria - 0; Metazoa
- 0; Fungi - 0; Plants - 2; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:7323984-7324670 REVERSE LENGTH=207
Length=207
Score = 195 bits (105), Expect = 2e-49
Identities = 120/127 (94%), Gaps = 2/127 (2%)
Strand=Plus/Plus
Query 84 AGCAAAGAAAACGATCTTTCTCCGTCGTAACACACAGTTCTTT-GTGACCCTTTGCTTCT 142
||||||||||||||||| ||||||||||||||||||||| ||| | ||||||||||||||
Sbjct 33 AGCAAAGAAAACGATCTGTCTCCGTCGTAACACACAGTT-TTTCGAGACCCTTTGCTTCT 91
Query 143 TCGGCGCCGGTGGACACGTCAGCATCTCCGGTATCCTAGACTTCTTAGCTTTTGGAGTAC 202
|||||||||||||||||||||||||||||||||||||||||||||| ||||| || ||||
Sbjct 92 TCGGCGCCGGTGGACACGTCAGCATCTCCGGTATCCTAGACTTCTTGGCTTTCGGGGTAC 151
Query 203 AACAACC 209
|||||||
Sbjct 152 AACAACC 158
> AT1G51355.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT3G20898.1); Has 52 Blast
hits to 52 proteins in 9 species: Archae - 0; Bacteria -
0; Metazoa - 2; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr1:19041397-19042146
FORWARD LENGTH=750
Length=750
Score = 89.8 bits (48), Expect = 1e-17
Identities = 92/111 (83%), Gaps = 12/111 (11%)
Strand=Plus/Minus
Query 83 GAGCAAAGAAAACGATCTT-TCTCCGT-CGTAACA-CACAGTTCTTTGTG--ACCCTTTG 137
||||||||||| |||| || ||||| | | ||| | | ||| | |||| | ||| ||||
Sbjct 373 GAGCAAAGAAAGCGAT-TTGTCTCC-TCCTTAA-AGCGCAG-T-TTTGCGCCACCTTTTG 319
Query 138 CTTCTTCGGCGCCGGTGGACACGTCAGCATCTCCGGTATCC-TAGACTTCT 187
||||||||| ||||||||||||||||||||||||||||| | | |||||||
Sbjct 318 CTTCTTCGGTGCCGGTGGACACGTCAGCATCTCCGGTATTCGT-GACTTCT 269
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 12516669501
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5