BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg923815
Length=278
Score E
Sequences producing significant alignments: (Bits) Value
AT1G52825.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 353 4e-97
AT4G14615.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 244 2e-64
> AT1G52825.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF2346 (InterPro:IPR018625);
BEST Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G14615.1); Has 52 Blast hits to 49 proteins in
19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0;
Plants - 51; Viruses - 0; Other Eukaryotes - 1 (source: NCBI
BLink). | chr1:19670489-19670967 REVERSE LENGTH=479
Length=479
Score = 353 bits (191), Expect = 4e-97
Identities = 242/266 (91%), Gaps = 6/266 (2%)
Strand=Plus/Plus
Query 5 TTGGAGTCTCTTACAAATGTCGTCAGTAGGAACATCAAAGGGGATTCTTGAAATCGCCAA 64
|||||||||| |||||||||| |||||||||||||||||||||||||| |||||||||||
Sbjct 58 TTGGAGTCTCATACAAATGTCATCAGTAGGAACATCAAAGGGGATTCTAGAAATCGCCAA 117
Query 65 ATTCGGTGTATACGTCGCTGTTCCGATCGTCCTAATGTATACATTCGCCAACAACAGCAC 124
|||||||||||||||||| || || ||||||||||| |||||||||||||||||||||||
Sbjct 118 ATTCGGTGTATACGTCGCAGTACCAATCGTCCTAATCTATACATTCGCCAACAACAGCAC 177
Query 125 CAATATCAAGAAATTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCG 184
||| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 178 CAACATCAAGAAATTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCG 237
Query 185 TCCTCCTTCGCCTGAAGAGCTACGAGAGATGGCTAAAGAAATTGCTCGCAACAAGAACA- 243
||||||||| |||||||||||||| |||||||||||| || |||| | ||| || || |
Sbjct 238 TCCTCCTTCACCTGAAGAGCTACGTGAGATGGCTAAACAACTTGCCCTCAAAAAAAAAAA 297
Query 244 ---TCCCTTCAAATTGATTCTACTTG 266
||||||| ||||||| ||| |||
Sbjct 298 AAATCCCTTCTAATTGAT-CTA-TTG 321
> AT4G14615.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF2346 (InterPro:IPR018625);
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT1G52825.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
Eukaryotes - 2996 (source: NCBI BLink). | chr4:8383764-8385079
FORWARD LENGTH=574
Length=574
Score = 244 bits (132), Expect = 2e-64
Identities = 198/230 (86%), Gaps = 4/230 (2%)
Strand=Plus/Plus
Query 19 AAATGTCGTCAGTAGGAACATC-AAAGGGGATTCTTGAAATCGCCAAATTCGGTGTATAC 77
||||||| || || |||||||| ||||||| |||| || |||| ||| |||||||| ||
Sbjct 77 AAATGTCATCTGTTGGAACATCGAAAGGGG-TTCTGGAGATCGTCAAGTTCGGTGTCTAT 135
Query 78 GTCGCTGTTCCGATCGTCCTAATGTATACATTCGCCAACAACAGCACCAATATCAAGAAA 137
|||||||||||||||||||| ||||| |||||||||||||||||||||||||||||||||
Sbjct 136 GTCGCTGTTCCGATCGTCCTTATGTACACATTCGCCAACAACAGCACCAATATCAAGAAA 195
Query 138 TTCATGGGCAATCATTCATACGTTGTTTATCCTAAAGAAGCTCCTCGTCCTCCTTCGCCT 197
||||||||||||| |||||| |||||||||||| |||| || ||| | |||||||| ||
Sbjct 196 TTCATGGGCAATCGTTCATATGTTGTTTATCCTGAAGAGGCACCTAGACCTCCTTCACCC 255
Query 198 GAAGAGCTACGAGAGATGGCTAA-AGAAATTGCTCGCAACAAGAACATCC 246
|| |||||| |||||||||| | ||| |||| || || ||||||||||
Sbjct 256 GATGAGCTAAGAGAGATGGC-ACGAGAGCTTGCCCGTAAGAAGAACATCC 304
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 12768008246
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5