BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg917075
Length=324
Score E
Sequences producing significant alignments: (Bits) Value
AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 381 2e-105
AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro DO... 213 8e-55
AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278)... 174 4e-43
> AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF1278 (InterPro:IPR010701);
BEST Arabidopsis thaliana protein match is: Protein of unknown
function (DUF1278) (TAIR:AT5G53742.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses
- 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21938793-21939623
FORWARD LENGTH=831
Length=831
Score = 381 bits (206), Expect = 2e-105
Identities = 263/289 (91%), Gaps = 10/289 (3%)
Strand=Plus/Plus
Query 1 TCTCGCAAAATGTTGGTCATCGCTCATC-ACATTCAAGGTTGCGAA-GTCGAAATCTTTA 58
||||||||||||||||||||||||| || ||||||| |||||| || ||||||||||||
Sbjct 196 TCTCGCAAAATGTTGGTCATCGCTCCTCAACATTCACGGTTGC-AATATCGAAATCTTTA 254
Query 59 AATCTGTTTTAACCGGTAAGATTGAAAACGTTGGA-CCAACATGCTGCAAGGCGTTTACG 117
|||||||||||||||||||| |||||||||||||| || || ||||||||||| |||||
Sbjct 255 AATCTGTTTTAACCGGTAAGTTTGAAAACGTTGGATCC-ACGTGCTGCAAGGCTTTTAC- 312
Query 118 A-AAGTGGATGCAAACTGTTGGCCAAAAATGTTTTCGTTGAATCCGTTATTCCCTCCTCT 176
| ||||||||||||| |||||||||||||||||| |||||||||| ||||||||||||||
Sbjct 313 AGAAGTGGATGCAAAGTGTTGGCCAAAAATGTTTCCGTTGAATCCATTATTCCCTCCTCT 372
Query 177 TCTCAAGGATGGTTGCTCTCGCATCATCGCAGGTGCACCCGCTGCACACACGACACCTCA 236
|||||||||||||||||||||||||||| |||||||||| || |||| | |||||| ||
Sbjct 373 TCTCAAGGATGGTTGCTCTCGCATCATCTCAGGTGCACCAGCA-CACA-A-GACACCGCA 429
Query 237 GTTCCCTGTCATCTCTGGTTCTCCGGTCGATCTCACAAAATGTTTGTCA 285
|||||||||||| ||||||||||| |||||||||||||||||||||||
Sbjct 430 ATTCCCTGTCATCCCTGGTTCTCCGATCGATCTCACAAAATGTTTGTCA 478
> AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF1278 (InterPro:IPR010701);
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT5G54062.1); Has 30201 Blast hits to 17322 proteins
in 780 species: Archae - 12; Bacteria - 1396; Metazoa
- 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes
- 2996 (source: NCBI BLink). | chr5:21888148-21889254
REVERSE LENGTH=507
Length=507
Score = 213 bits (115), Expect = 8e-55
Identities = 213/256 (83%), Gaps = 23/256 (9%)
Strand=Plus/Plus
Query 1 TCTCGCAAAATGTTGGTCATCGCTCATC-ACATTCAAGGTTGCGAA-GTCGAAATCTTT- 57
|||| |||||||||||||||||||| || | ||||||||||| || ||||||| |||
Sbjct 72 TCTCACAAAATGTTGGTCATCGCTCTTCAATGTTCAAGGTTGC-AATATCGAAAT-TTTA 129
Query 58 AAATCTGTTTTAACCGGTAAGATTGAAAACGTTGGACCAACATGCTGCAAGGCGTTTACG 117
|||| ||||||||||| |||| | || | || | ||||||| |||||||||
Sbjct 130 AAATATGTTTTAACCGCTAAG--T-------TT-G---AA-A-GCTGCAAAGCGTTTACG 174
Query 118 AAAGTGGATGCAAACTGTTGGCCAAAAATGTTTTCGTTGAATCCGTTATTCCCTCCTCTT 177
|||||||||||||||||||||||||||||||| | ||||||| || ||||||||||||
Sbjct 175 GAAGTGGATGCAAACTGTTGGCCAAAAATGTTTCCACTGAATCCATTTTTCCCTCCTCTT 234
Query 178 CTCAAGGATGGTTGCTCTCGCATCATCGCAGGTGCACCCGCTGCACACACGACACCTCAG 237
|||||||||||||||||||||||||| || |||||||||| |||||||||||||||
Sbjct 235 GTCAAGGATGGTTGCTCTCGCATCATCTCAAGTGCACCCGC---ACACACGACACCTCAA 291
Query 238 TTCCCTGTCATCTCTG 253
||||||||||| |||
Sbjct 292 CTCCCTGTCATCCCTG 307
> AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278)
| chr5:21813671-21814018 FORWARD LENGTH=348
Length=348
Score = 174 bits (94), Expect = 4e-43
Identities = 126/141 (89%), Gaps = 3/141 (2%)
Strand=Plus/Plus
Query 145 ATGTTTTCGTTGAATCCGTTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC 204
|||||| |||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct 1 ATGTTTCCGTTGAATCCATTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC 60
Query 205 GCAGGTGCACCCGCTGCACACACGACACCTCAGTTCCCTGTCATCTCTGGTTCTCCGGTC 264
| || ||||| | |||||||| |||||| |||| ||||||| ||||||||||| ||
Sbjct 61 TCTGGCGCACCAAC---ACACACGAAACCTCAATTCCTTGTCATCCCTGGTTCTCCGATC 117
Query 265 GATCTCACAAAATGTTTGTCA 285
|||||||||||||||||||||
Sbjct 118 GATCTCACAAAATGTTTGTCA 138
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 15080324700
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5