BLASTP 2.2.5 [Nov-16-2002]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= ci0100131919
(462 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
1,798,171 sequences; 593,787,773 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|28175808|gb|AAH43028.1| TAF9 RNA polymerase II, TATA box... 170 5e-41
gi|18079237|ref|NP_081415.1| TAF9 RNA polymerase II, TATA b... 170 5e-41
gi|4507351|ref|NP_003178.1| TBP-associated factor 9; TAF9 R... 168 2e-40
gi|13097291|gb|AAH03400.1| TBP-associated factor 9 [Homo sa... 168 2e-40
gi|33086648|gb|AAP92636.1| Cb1-739 [Rattus norvegicus] 168 3e-40
gi|34576553|ref|NP_908937.1| TAF9 RNA polymerase II, TATA b... 165 2e-39
gi|20070280|ref|NP_057059.2| TBP-associated factor 9L; neur... 165 2e-39
gi|4689154|gb|AAD27786.1| neuronal cell death-related prote... 165 2e-39
gi|47086725|ref|NP_997822.1| general transcription factor I... 164 4e-39
gi|38086493|ref|XP_284729.2| similar to TBP-associated fact... 163 8e-39
gi|30186181|gb|AAH51635.1| Similar to TAF9-like RNA polymer... 163 8e-39
gi|19424332|ref|NP_598299.1| TAF9-like RNA polymerase II, T... 162 2e-38
gi|42542429|gb|AAH66223.1| Unknown (protein for MGC:76785) ... 162 2e-38
gi|26364308|dbj|BAB26216.2| unnamed protein product [Mus mu... 159 2e-37
gi|7706735|ref|NP_056943.1| TFIIA alpha, p55 isoform 1; gen... 156 8e-37
gi|31207633|ref|XP_312783.1| ENSANGP00000020370 [Anopheles ... 156 1e-36
gi|34099896|gb|AAP44969.1| transcription factor IIA large s... 154 3e-36
gi|11559984|ref|NP_071544.1| general transcription factor I... 154 5e-36
gi|17136922|ref|NP_476996.1| CG5930-PB [Drosophila melanoga... 149 1e-34
gi|17136920|ref|NP_476995.1| CG5930-PA [Drosophila melanoga... 149 2e-34
gi|1711662|sp|P52654|T2AA_DROME TRANSCRIPTION INITIATION FA... 147 5e-34
gi|34099900|gb|AAP44971.1| transcription factor IIA large s... 145 1e-33
gi|33312518|gb|AAQ04072.1| transcription factor IIA large s... 144 3e-33
gi|27819943|gb|AAL28886.2| LD26511p [Drosophila melanogaster] 141 3e-32
gi|45549141|ref|NP_523391.3| CG6474-PA [Drosophila melanoga... 141 3e-32
gi|17562510|ref|NP_504355.1| prion-like Q/N-rich domain pro... 122 1e-26
gi|39583968|emb|CAE64058.1| Hypothetical protein CBG08658 [... 122 2e-26
gi|15221048|ref|NP_175816.1| transcription initiation facto... 119 1e-25
gi|42733663|gb|AAS38620.1| similar to F55B11.3.p [Caenorhab... 118 2e-25
gi|13878225|ref|NP_113568.1| general transcription factor I... 112 2e-23
gi|42476101|ref|NP_963889.1| TFIIA alpha, p55 isoform 2; ge... 110 8e-23
gi|30141908|ref|NP_780544.1| general transcription factor I... 107 4e-22
gi|19075146|ref|NP_586747.1| TRANSCRIPTION INITIATION FACTO... 105 3e-21
gi|46102011|gb|EAK87244.1| hypothetical protein UM06387.1 [... 104 5e-21
gi|19113805|ref|NP_592893.1| hypothetical protein [Schizosa... 102 2e-20
gi|30017563|gb|AAP12985.1| putative transcription initiatio... 100 5e-20
gi|24650582|ref|NP_733208.1| CG5930-PC [Drosophila melanoga... 100 9e-20
gi|38492544|pdb|1NVP|C Chain C, Human TfiiaTBPDNA COMPLEX 100 9e-20
gi|32394470|gb|AAM93933.1| transcriptional activation facto... 97 1e-18
gi|34862156|ref|XP_237507.2| similar to general transcripti... 96 1e-18
gi|26787968|ref|NP_006863.2| TFIIA-alpha/beta-like factor i... 94 5e-18
gi|29553944|ref|NP_758515.1| stoned B/TFIIA-alpha/beta-like... 94 5e-18
gi|33312516|gb|AAQ04071.1| TFIIAa/b-like factor [Xenopus la... 94 5e-18
gi|46442061|gb|EAL01353.1| hypothetical protein CaO19.10197... 94 6e-18
gi|12963759|ref|NP_076119.1| general transcription factor I... 94 8e-18
gi|34098596|sp|Q8R4I4|T2AY_MOUSE TFIIA-alpha and beta like ... 94 8e-18
gi|13124585|sp|Q9UNN4|T2AY_HUMAN TFIIA-alpha and beta like ... 92 2e-17
gi|40555809|gb|AAH64585.1| ALF protein [Homo sapiens] 92 2e-17
gi|5091669|gb|AAD39617.1| SALF [Homo sapiens] 92 2e-17
gi|1942936|pdb|1TAF|A Chain A, Drosophila Tbp Associated Fa... 89 3e-16
gi|6324768|ref|NP_014837.1| Transcription factor IIA, large... 86 2e-15
gi|46107550|ref|XP_380834.1| hypothetical protein FG00658.1... 83 1e-14
gi|32422741|ref|XP_331814.1| hypothetical protein [Neurospo... 82 3e-14
gi|18390773|ref|NP_563790.1| transcription factor IIA large... 81 4e-14
gi|46097824|gb|EAK83057.1| hypothetical protein UM05183.1 [... 80 1e-13
gi|30680148|ref|NP_850937.1| transcription factor IIA large... 79 2e-13
gi|6323892|ref|NP_013963.1| Subunit (17 kDa) of TFIID and S... 79 3e-13
gi|24059851|dbj|BAC21319.1| hypothetical protein [Oryza sat... 77 1e-12
gi|45199045|ref|NP_986074.1| AFR527Wp [Eremothecium gossypi... 75 3e-12
gi|25406976|pir||D86209 protein F22G5.18 [imported] - Arabi... 75 3e-12
gi|38492543|pdb|1NVP|B Chain B, Human TfiiaTBPDNA COMPLEX 74 9e-12
gi|17555056|ref|NP_499814.1| TBP-Associated transcription F... 73 1e-11
gi|46432465|gb|EAK91945.1| hypothetical protein CaO19.8708 ... 72 2e-11
gi|46432443|gb|EAK91924.1| hypothetical protein CaO19.1111 ... 72 2e-11
gi|40746468|gb|EAA65624.1| hypothetical protein AN0794.2 [A... 72 3e-11
gi|39591770|emb|CAE71348.1| Hypothetical protein CBG18251 [... 70 7e-11
gi|19112462|ref|NP_595670.1| putative transcription initiat... 70 1e-10
gi|32420261|ref|XP_330574.1| hypothetical protein [Neurospo... 68 5e-10
gi|26787966|ref|NP_751946.1| TFIIA-alpha/beta-like factor i... 64 9e-09
gi|46107964|ref|XP_381040.1| hypothetical protein FG00864.1... 61 6e-08
gi|37526068|ref|NP_929412.1| hypothetical protein [Photorha... 61 6e-08
gi|38109765|gb|EAA55584.1| hypothetical protein MG01235.4 [... 60 8e-08
gi|40746490|gb|EAA65646.1| hypothetical protein AN0816.2 [A... 60 1e-07
gi|45201324|ref|NP_986894.1| AGR228Cp [Eremothecium gossypi... 59 2e-07
gi|38109853|gb|EAA55659.1| hypothetical protein MG01310.4 [... 59 2e-07
gi|17065294|gb|AAL32801.1| similar to TFIIA [Arabidopsis th... 59 2e-07
gi|1633309|pdb|1YTF|C Chain C, Yeast TfiiaTBPDNA COMPLEX >g... 59 2e-07
gi|1429226|emb|CAA67368.1| TFIIA [Arabidopsis thaliana] 58 4e-07
gi|15237841|ref|NP_200731.1| transcription factor-related [... 56 2e-06
gi|1633308|pdb|1YTF|B Chain B, Yeast TfiiaTBPDNA COMPLEX >g... 50 8e-05
gi|22994332|ref|ZP_00038840.1| hypothetical protein [Xylell... 49 2e-04
gi|29893524|gb|AAN16518.1| circumsporozoite protein [Plasmo... 49 2e-04
gi|24580579|ref|NP_722615.1| CG18497-PA [Drosophila melanog... 49 2e-04
gi|6979936|gb|AAF34661.1| split ends long isoform [Drosophi... 49 2e-04
gi|24580583|ref|NP_722616.1| CG18497-PC [Drosophila melanog... 49 2e-04
gi|6467825|gb|AAF13218.1| Spen RNP motif protein long isofo... 49 2e-04
gi|24580581|ref|NP_524718.2| CG18497-PB [Drosophila melanog... 49 2e-04
gi|26247178|ref|NP_753218.1| Minor curlin subunit precursor... 47 7e-04
gi|24112441|ref|NP_706951.1| minor curlin subunit precursor... 47 7e-04
gi|46112495|ref|ZP_00180737.2| COG1322: Uncharacterized pro... 47 7e-04
gi|20148954|gb|AAM12730.1| TFIIA alpha/beta-like protein AL... 47 7e-04
>gi|28175808|gb|AAH43028.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 264
Score = 170 bits (431), Expect = 5e-41
Identities = 77/144 (53%), Positives = 112/144 (77%), Gaps = 5/144 (3%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
K S KK A AP G+ +V +L
Sbjct: 130 K-SLQKK----APAPAGRITVPRL 148
>gi|18079237|ref|NP_081415.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
factor; TAF9 RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 32 kDa [Mus musculus]
gi|34222930|sp|Q8VI33|TAF9_MOUSE Transcription initiation factor TFIID subunit 9 (Transcription
initiation factor TFIID 31 kDa subunit) (TAFII-31)
(TAFII-32) (TAFII32)
gi|17901798|gb|AAL47693.1| RNA polymerase II TATA box binding protein-associated factor G [Mus
musculus]
gi|27503789|gb|AAH42723.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 264
Score = 170 bits (431), Expect = 5e-41
Identities = 77/144 (53%), Positives = 112/144 (77%), Gaps = 5/144 (3%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
K S KK A AP G+ +V +L
Sbjct: 130 K-SLQKK----APAPAGRITVPRL 148
>gi|4507351|ref|NP_003178.1| TBP-associated factor 9; TAF9 RNA polymerase II, TATA box binding
protein (TBP)-associated factor, 32 kD; TATA box binding
protein (TBP)-associated factor, RNA polymerase II, G,
32kD; transcription initiation factor TFIID 31 kD
subunit; adrenal gland protein AD-004 [Homo sapiens]
gi|2498981|sp|Q16594|TAF9_HUMAN Transcription initiation factor TFIID subunit 9 (Transcription
initiation factor TFIID 31 kDa subunit) (TAFII-31)
(TAFII-32) (TAFII32) (STAF31/32)
gi|2136305|pir||I39141 transcription factor TFIID 32K chain TAFII32 - human
gi|841308|gb|AAC50153.1| TAFII32 precursor
gi|882393|gb|AAA91318.1| TBP-associated factor TAFII31
gi|940151|gb|AAA84389.1| TAFII31
gi|26892203|gb|AAN84793.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 32kDa [Homo sapiens]
gi|1098347|prf||2115400A transcription factor IID:SUBUNIT=32kD
Length = 264
Score = 168 bits (426), Expect = 2e-40
Identities = 73/132 (55%), Positives = 106/132 (80%), Gaps = 1/132 (0%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADKKVSSMA 132
K S KK S+ A
Sbjct: 130 K-SLQKKASTSA 140
>gi|13097291|gb|AAH03400.1| TBP-associated factor 9 [Homo sapiens]
Length = 264
Score = 168 bits (426), Expect = 2e-40
Identities = 73/132 (55%), Positives = 106/132 (80%), Gaps = 1/132 (0%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADKKVSSMA 132
K S KK S+ A
Sbjct: 130 K-SLQKKASTSA 140
>gi|33086648|gb|AAP92636.1| Cb1-739 [Rattus norvegicus]
Length = 259
Score = 168 bits (425), Expect = 3e-40
Identities = 76/144 (52%), Positives = 110/144 (76%), Gaps = 5/144 (3%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + ++ D
Sbjct: 5 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 64
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 65 DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 124
Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
K S KK A P G+ +V +L
Sbjct: 125 K-SLQKK----APTPAGRITVPRL 143
>gi|34576553|ref|NP_908937.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Rattus norvegicus]
gi|34068042|gb|AAQ56726.1| liver regeneration related protein; LRRG210 [Rattus norvegicus]
Length = 259
Score = 165 bits (417), Expect = 2e-39
Identities = 75/144 (52%), Positives = 109/144 (75%), Gaps = 5/144 (3%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ + +L+DMG+TEYEP+V++Q+LEF RY++ +LD+AK+Y++HA + ++ D
Sbjct: 5 KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFALRYVTTILDDAKIYSSHAKKPTVDAD 64
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +L + + D+SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRL
Sbjct: 65 DVRLPIQCRADHSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 124
Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
K S KK A P G+ +V +L
Sbjct: 125 K-SLQKK----APTPAGRITVPRL 143
>gi|20070280|ref|NP_057059.2| TBP-associated factor 9L; neuronal cell death-related protein;
TAF9-like RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 31 kD; neuronal cell
death-related gene in neuron-7; transcription associated
factor TAFII31L; transcription initiation factor IID,
31kD subunit [Homo sapiens]
gi|9963821|gb|AAG09711.1| transcription associated factor TAFII31L [Homo sapiens]
gi|14714449|gb|AAH10350.1| TBP-associated factor 9L [Homo sapiens]
gi|16306983|gb|AAH09566.1| TBP-associated factor 9L [Homo sapiens]
Length = 251
Score = 165 bits (417), Expect = 2e-39
Identities = 70/126 (55%), Positives = 99/126 (78%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + N++ D
Sbjct: 10 KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPNVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADK 126
K K
Sbjct: 130 KSLIKK 135
>gi|4689154|gb|AAD27786.1| neuronal cell death-related protein [Homo sapiens]
Length = 251
Score = 165 bits (417), Expect = 2e-39
Identities = 70/126 (55%), Positives = 99/126 (78%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + N++ D
Sbjct: 10 KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPNVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADK 126
K K
Sbjct: 130 KSLIKK 135
>gi|47086725|ref|NP_997822.1| general transcription factor IIA, 1, 19/37kDa [Danio rerio]
gi|29124431|gb|AAH48894.1| Wu:fc15h02 protein [Danio rerio]
Length = 369
Score = 164 bits (415), Expect = 4e-39
Identities = 136/397 (34%), Positives = 173/397 (43%), Gaps = 108/397 (27%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE---LR 195
N V KLY++V+EDVI +RE FLD+GVDE VL ELK WE KL QSKAV+ E +
Sbjct: 8 NQVPKLYRSVMEDVIAEVRELFLDEGVDEQVLMELKTLWESKLMQSKAVDGFHTEEQQVL 67
Query: 196 HAQS-----------VMPLYVYANSMQGDKSQQPLVYHT----------MTAAGQQAAMS 234
HAQ P V Q QQ +V M+AA A ++
Sbjct: 68 HAQQQQAQAQQTQQQTQPQQVLLPPAQTTPQQQVIVQDAKIMQHMSATGMSAAATAATLA 127
Query: 235 LPNSVLY---------------------------QRQQHPNQQVYAPYQPG---TSTNQQ 264
LP V Q+Q QQV QPG QQ
Sbjct: 128 LPTGVAPFQQLITPQGQILQVVRAANGAQYIIQPQQQMLLQQQVLPQMQPGGVQAPVIQQ 187
Query: 265 STS---GGGKQHPSVIIQAD-----GANDQRITQV---------DGANDQRITQVDGAND 307
+ GG Q VIIQ G + TQV G D ++ A
Sbjct: 188 VLTPLQGGLPQQTGVIIQPQQIVLTGNKVPQNTQVMQAAAMAVQSGPGDAQVQTQPQAQQ 247
Query: 308 QRITQVEEANDQ--RITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRI 365
Q+ Q ++A+ Q + QVDGA D + +E D+ D ++ DG D ++
Sbjct: 248 QQQQQQQQASQQPPMMLQVDGAGDTSSEEDEEEEDEYDEDEDEDKEK-----DGGEDGQV 302
Query: 366 TQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQY 425
+ PL S DD SD++ ELFDT+NVVVCQY
Sbjct: 303 EE------------------------------EPLNSGDDVSDEEDQELFDTENVVVCQY 332
Query: 426 DKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DKIHR+K +WKF+LK GIMNLNG+D VF KA G+AEW
Sbjct: 333 DKIHRSKNKWKFHLKDGIMNLNGRDFVFSKAIGDAEW 369
>gi|38086493|ref|XP_284729.2| similar to TBP-associated factor 9L; neuronal cell death-related
protein; TAF9-like RNA polymerase II, TATA box binding
protein (TBP)-associated factor, 31 kD; neuronal cell
death-related gene in neuron-7; transcription associated
factor TAFII31L; ... [Mus musculus]
Length = 210
Score = 163 bits (412), Expect = 8e-39
Identities = 69/126 (54%), Positives = 98/126 (77%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + ++ D
Sbjct: 10 KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADK 126
K K
Sbjct: 130 KSLVKK 135
>gi|30186181|gb|AAH51635.1| Similar to TAF9-like RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 31 kD [Mus musculus]
Length = 252
Score = 163 bits (412), Expect = 8e-39
Identities = 69/126 (54%), Positives = 98/126 (77%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + ++ D
Sbjct: 13 KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 72
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 73 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 132
Query: 121 KGSADK 126
K K
Sbjct: 133 KSLVKK 138
>gi|19424332|ref|NP_598299.1| TAF9-like RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 31 kD; neuronal cell
death-related gene in neuron-7 [Rattus norvegicus]
gi|2498982|sp|Q62880|TAF9_RAT Transcription initiation factor TFIID subunit 9 (Transcription
initiation factor TFIID 31 kDa subunit) (TAFII-31)
(TAFII-32) (TAFII32) (Neuronal cell death related gene
in neuron 7) (DN-7)
gi|7514096|pir||JC5511 TATA-binding protein-associated factor II 31 - rat
gi|1103900|gb|AAC53201.1| induced upon programmed cell death in neuronally differentiated
PC12 cells
Length = 253
Score = 162 bits (409), Expect = 2e-38
Identities = 68/126 (53%), Positives = 98/126 (77%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG+T+YEP+V++Q+LEF +RY++ +LD+AK+Y++HA + ++ D
Sbjct: 5 KNAPRDALVMAQILKDMGITDYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 64
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 65 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 124
Query: 121 KGSADK 126
K K
Sbjct: 125 KSLVKK 130
>gi|42542429|gb|AAH66223.1| Unknown (protein for MGC:76785) [Mus musculus]
Length = 249
Score = 162 bits (409), Expect = 2e-38
Identities = 69/126 (54%), Positives = 97/126 (76%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
KN P+DA + +L+DMG TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + ++ D
Sbjct: 10 KNAPRDALVMAQILKDMGTTEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 69
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA+ + D SFT+PPPRDFL+D+ARQKN LPL+K Y+ RLPPDRYCL + NYRL
Sbjct: 70 DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129
Query: 121 KGSADK 126
K K
Sbjct: 130 KSLVKK 135
>gi|26364308|dbj|BAB26216.2| unnamed protein product [Mus musculus]
Length = 247
Score = 159 bits (401), Expect = 2e-37
Identities = 72/132 (54%), Positives = 103/132 (78%), Gaps = 5/132 (3%)
Query: 13 VLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKLAVMSQLDN 72
+L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ DD +LA+ + D
Sbjct: 5 ILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDADDVRLAIQCRADQ 64
Query: 73 SFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSADKKVSSMA 132
SFT+PPPRDFL+D+ARQ+N LPL+K YS RLPPDRYCL + NYRLK S KK A
Sbjct: 65 SFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRLK-SLQKK----A 119
Query: 133 KAPGGQNSVAKL 144
AP G+ +V +L
Sbjct: 120 PAPAGRITVPRL 131
>gi|7706735|ref|NP_056943.1| TFIIA alpha, p55 isoform 1; general transcription factor IIA, 1
(37kD and 19kD subunits); TFIIA alpha, p55 [Homo
sapiens]
gi|1711663|sp|P52655|T2AA_HUMAN Transcription initiation factor IIA alpha and beta chains (TFIIA
p35 and p19 subunits) (TFIIA-42) (TFIIAL)
gi|627655|pir||A49077 transcription initiation factor IIA alpha chain precursor - human
gi|433500|emb|CAA53151.1| TFIIA [Homo sapiens]
gi|452272|emb|CAA54442.1| TFIIA/alpha, p55 [Homo sapiens]
gi|727197|dbj|BAA03604.1| 'TFIIA-42' [Homo sapiens]
gi|6721137|gb|AAF26776.1| TFIIA-42 [Homo sapiens]
Length = 376
Score = 156 bits (395), Expect = 8e-37
Identities = 135/405 (33%), Positives = 174/405 (42%), Gaps = 117/405 (28%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELR--- 195
N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK WE KL QS+AV+ +E +
Sbjct: 8 NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67
Query: 196 ------------------HAQSVMPLYVYANSMQGDK-----SQQP-----------LVY 221
H Q P Q + SQQ L+
Sbjct: 68 LQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQ 127
Query: 222 H----TMTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQQ 250
H M+AA A ++LP V ++Q QQ QQ
Sbjct: 128 HMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQ 187
Query: 251 VYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQRI 299
V QPG QQ + GG VIIQ G Q I A
Sbjct: 188 VIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQ 247
Query: 300 TQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQV 357
Q+ Q+ Q + A Q + QVDG D T +E D+ D + + +
Sbjct: 248 AQITATGQQQ-PQAQPAQTQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK--EK 301
Query: 358 DGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDT 417
DGA D ++ + PL S DD SD++ ELFDT
Sbjct: 302 DGAEDGQVEE------------------------------EPLNSEDDVSDEEGQELFDT 331
Query: 418 DNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 ENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376
>gi|31207633|ref|XP_312783.1| ENSANGP00000020370 [Anopheles gambiae]
gi|21296316|gb|EAA08461.1| ENSANGP00000020370 [Anopheles gambiae str. PEST]
Length = 266
Score = 156 bits (394), Expect = 1e-36
Identities = 67/136 (49%), Positives = 101/136 (74%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ ++++L+++G T+YEP+V++QLLEFTYRY++ +LD+AK+YANHA + I +D
Sbjct: 19 KHIPKDAQVVMSILKELGATDYEPRVINQLLEFTYRYVTCILDDAKIYANHARKKVIELD 78
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D KLA LD +FT PPPRD L++++R +N LPL+K + RLPPDRYCL + NY+L
Sbjct: 79 DVKLATQMILDKAFTRPPPRDVLLEISRVRNATPLPLIKTHCGLRLPPDRYCLSACNYKL 138
Query: 121 KGSADKKVSSMAKAPG 136
+ + K S + G
Sbjct: 139 RAAQQPKRMSKSALEG 154
>gi|34099896|gb|AAP44969.1| transcription factor IIA large subunit-2 [Xenopus laevis]
Length = 367
Score = 154 bits (390), Expect = 3e-36
Identities = 134/401 (33%), Positives = 172/401 (42%), Gaps = 108/401 (26%)
Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
A N V KLY++VIEDVIN++RE FLD+GVDE VL ELK WE KL QSKAV+ +E
Sbjct: 3 ASANANPVPKLYRSVIEDVINDVRELFLDEGVDEQVLMELKTLWESKLMQSKAVDGFHSE 62
Query: 194 -----------LRHAQS--------------------VMPLYVYANSMQGDKSQQPLVYH 222
+HAQ ++P A Q + L+ H
Sbjct: 63 EQQLLLQAQQQQQHAQHHHVQQHPQQQQAASHQTQQVLIPATHQAPQQQVLVQESKLIQH 122
Query: 223 T----MTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQQV 251
M+AA A ++LP V L Q QQ QQV
Sbjct: 123 MTPQGMSAAATAATLALPAGVTPVQQLITNSGQLLQVVRAANGAQYLIQPQQSMVLQQQV 182
Query: 252 YAPYQPG---TSTNQQSTS---GGGKQHPSVIIQADG----ANDQRITQVDGANDQRITQ 301
QPG QQ + GG Q VIIQ N ++ + Q TQ
Sbjct: 183 IPQMQPGGVQAPVIQQVLAPIPGGLSQQTGVIIQPQQILFTGNKTQVIPTTMSAPQAPTQ 242
Query: 302 VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGAN 361
+ Q + + QVDGA D ++ DE D+ + + ++ DG +
Sbjct: 243 GQLTVTSQQQQQGQQQTSIVLQVDGAGDTS-SEEDEDEDEDYDDEEEEDKEK----DGED 297
Query: 362 DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVV 421
Q PL S DD SD++ ELFDT+NVV
Sbjct: 298 GQ-------------------------------VEEEPLNSEDDVSDEEGQELFDTENVV 326
Query: 422 VCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
VCQYDKIHR+K +WKF+LK GIMNLNG+D VF KA G+AEW
Sbjct: 327 VCQYDKIHRSKNKWKFHLKDGIMNLNGRDFVFSKAIGDAEW 367
>gi|11559984|ref|NP_071544.1| general transcription factor IIa precursor; general transcription
factor IIa, 1 (37kD and 19kD subunits); TFIIA large
subunit; general transcription factor 2a 1; general
transcription factor IIa 1 (37kD and 19kD subunits)
[Rattus norvegicus]
gi|2149996|gb|AAB58716.1| TFIIA large subunit [Rattus norvegicus]
Length = 377
Score = 154 bits (388), Expect = 5e-36
Identities = 134/406 (33%), Positives = 172/406 (42%), Gaps = 118/406 (29%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAV----------- 187
N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK WE KL QS+AV
Sbjct: 8 NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67
Query: 188 ----------ENSSNELRHAQSVMPLYVYANSMQGDK-----SQQP-----------LVY 221
+ + H Q P Q + SQQ L+
Sbjct: 68 LQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQ 127
Query: 222 HT-----MTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQ 249
H +AA A ++LP V ++Q QQ Q
Sbjct: 128 HMNASSITSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQ 187
Query: 250 QVYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQR 298
QV QPG QQ + GG VIIQ G Q I A
Sbjct: 188 QVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPA 247
Query: 299 ITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
Q+ A Q+ Q + A Q + QVDG D T +E D+ D + + +
Sbjct: 248 QAQIPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK--E 301
Query: 357 VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFD 416
DGA D ++ + PL S DD SD++ ELFD
Sbjct: 302 KDGAEDGQVEE------------------------------EPLNSEDDVSDEEGQELFD 331
Query: 417 TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
T+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 TENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 377
>gi|17136922|ref|NP_476996.1| CG5930-PB [Drosophila melanogaster]
gi|23172421|gb|AAN14105.1| CG5930-PB [Drosophila melanogaster]
Length = 363
Score = 149 bits (377), Expect = 1e-34
Identities = 112/368 (30%), Positives = 168/368 (45%), Gaps = 52/368 (14%)
Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE---- 193
Q SV K+Y VIEDVI N+R+AFLD+GVDE VLQE+KQ W KL SKAVE S +
Sbjct: 5 QTSVLKVYHAVIEDVITNVRDAFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64
Query: 194 --------------LRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSV 239
+ ++ V ++ G S + ++AG A + N +
Sbjct: 65 HPPPIVANNPKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIRNGL 124
Query: 240 LYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGGKQHPSVIIQADGANDQRITQVD--- 292
+ +Q+ N Q P P ++ + QQ + G+ ++ D RI V+
Sbjct: 125 VPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSGQGSIPIVATLD---PNRIMPVNITL 180
Query: 293 ------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQR 342
+++ R+ + + ++TQ+ A+ + T Q
Sbjct: 181 PSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVLQQH 235
Query: 343 ITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXX 394
+ + AN Q+ Q+DGA +D+ ++ N
Sbjct: 236 VNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEHEDA 295
Query: 395 XXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQ 454
PL S DD +D+D E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD VFQ
Sbjct: 296 AEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDYVFQ 355
Query: 455 KATGEAEW 462
K+ G+AEW
Sbjct: 356 KSNGDAEW 363
>gi|17136920|ref|NP_476995.1| CG5930-PA [Drosophila melanogaster]
gi|16769308|gb|AAL28873.1| LD24213p [Drosophila melanogaster]
gi|23172420|gb|AAF56687.2| CG5930-PA [Drosophila melanogaster]
Length = 366
Score = 149 bits (375), Expect = 2e-34
Identities = 114/371 (30%), Positives = 169/371 (45%), Gaps = 55/371 (14%)
Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENS------- 190
Q SV K+Y VIEDVI N+R+AFLD+GVDE VLQE+KQ W KL SKAVE S
Sbjct: 5 QTSVLKVYHAVIEDVITNVRDAFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64
Query: 191 -------SNELRHA-------QSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLP 236
+N H ++ V ++ G S + ++AG A +
Sbjct: 65 HPPPIVANNPKSHKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIR 124
Query: 237 NSVLYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGGKQHPSVIIQADGANDQRITQVD 292
N ++ +Q+ N Q P P ++ + QQ + G+ ++ D RI V+
Sbjct: 125 NGLVPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSGQGSIPIVATLD---PNRIMPVN 180
Query: 293 ---------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEAN 339
+++ R+ + + ++TQ+ A+ + T
Sbjct: 181 ITLPSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVL 235
Query: 340 DQRITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXX 391
Q + + AN Q+ Q+DGA +D+ ++ N
Sbjct: 236 QQHVNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEH 295
Query: 392 XXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDL 451
PL S DD +D+D E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD
Sbjct: 296 EDAAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDY 355
Query: 452 VFQKATGEAEW 462
VFQK+ G+AEW
Sbjct: 356 VFQKSNGDAEW 366
>gi|1711662|sp|P52654|T2AA_DROME TRANSCRIPTION INITIATION FACTOR IIA ALPHA AND BETA CHAINS (TFIIA
P30 AND P20 SUBUNITS) (DTFIIA-L)
gi|542595|pir||A49076 transcription factor TFIIA-L - fruit fly (Drosophila melanogaster)
gi|452955|gb|AAB28821.1| TFIIA-L [Drosophila sp.]
Length = 366
Score = 147 bits (371), Expect = 5e-34
Identities = 115/369 (31%), Positives = 166/369 (44%), Gaps = 51/369 (13%)
Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENS------- 190
Q SV K+Y VIEDVI N+R+ FLD+GVDE VLQE+KQ W KL SKAVE S
Sbjct: 5 QTSVLKVYHAVIEDVITNVRDGFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64
Query: 191 -------SNELRHA-------QSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLP 236
+N H ++ V ++ G S + ++AG A +
Sbjct: 65 HPPPIVANNPKSHKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIR 124
Query: 237 NSVLYQRQQHPNQQVYAPYQP--GTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVD-- 292
N ++ +Q+ N Q P P G S Q+ S+ I A + RI V+
Sbjct: 125 NGLVPIKQE-VNSQNPPPLHPTSGASMMQKQQQAASSGQGSIPIVAT-LDPNRIMPVNIT 182
Query: 293 -------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ 341
+++ R+ + + ++TQ+ A+ + T Q
Sbjct: 183 LPSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVLQQ 237
Query: 342 RITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXXXX 393
+ + AN Q+ Q+DGA +D+ ++ N
Sbjct: 238 HVNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEHED 297
Query: 394 XXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVF 453
PL S DD +D+D E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD VF
Sbjct: 298 AAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDYVF 357
Query: 454 QKATGEAEW 462
QK+ G+AEW
Sbjct: 358 QKSNGDAEW 366
>gi|34099900|gb|AAP44971.1| transcription factor IIA large subunit-1 [Xenopus laevis]
Length = 370
Score = 145 bits (367), Expect = 1e-33
Identities = 123/373 (32%), Positives = 156/373 (41%), Gaps = 51/373 (13%)
Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
A N V KLY++VI DVIN++RE FLD+GV E VL ELK WE KL QSKAV+ E
Sbjct: 5 ASANANPVPKLYRSVIGDVINDVREIFLDEGVGEQVLMELKTLWESKLMQSKAVDGFHTE 64
Query: 194 LR-------------------HAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMS 234
+ H P A S Q +QQ L+ T A QQ +
Sbjct: 65 EQQLLLQAQQQQQQQQQHSQHHRVQQHPQQQQAASHQ---TQQVLIPATHQAQQQQVLVQ 121
Query: 235 LPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGA 294
+ + Q + G + QQ + G Q V+ A+GA Q
Sbjct: 122 ESKMIHHMTPQGMSAATTLALPAGVTPVQQLITNSG-QLLQVVRAANGAQYLIQPQQSMV 180
Query: 295 NDQRIT---QVDGANDQRITQVEEANDQRITQVDGA------------------NDQRIT 333
Q++ Q G I QV ++Q G N
Sbjct: 181 LQQQVIPQMQQGGVQAPVIQQVLTPMPGGLSQQTGVIIQPQQILFTGNKTQVIPNTMSAP 240
Query: 334 QVDEANDQRITQVDGANDQR----ITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXX 389
Q +T Q+ + QVDGA D T +
Sbjct: 241 QAPIQGQLTVTSQQQQQGQQQAPLVLQVDGAGD---TSSEEDEDEDEDYDDEEEEDKEKD 297
Query: 390 XXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGK 449
PL S DD SD++ ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+
Sbjct: 298 GEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGR 357
Query: 450 DLVFQKATGEAEW 462
D VF KA G+AEW
Sbjct: 358 DFVFSKAIGDAEW 370
>gi|33312518|gb|AAQ04072.1| transcription factor IIA large subunit [Xenopus laevis]
Length = 370
Score = 144 bits (364), Expect = 3e-33
Identities = 122/376 (32%), Positives = 159/376 (42%), Gaps = 57/376 (15%)
Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
A N V KLY++VI DVIN++RE FLD+GV E VL ELK WE KL QSKAV+ E
Sbjct: 5 ASANANPVPKLYRSVIGDVINDVREIFLDEGVGEQVLMELKTLWESKLMQSKAVDGFHTE 64
Query: 194 LRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYA 253
+ L + A Q + Q + QQ A S + H QQ
Sbjct: 65 EQQ------LLLQAQQQQQQQQQHSQHHRVQQHPQQQQAASHQTQQVLIPATHQAQQQQV 118
Query: 254 PYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGAN------- 306
Q + T G ++ + A Q++ G Q + +GA
Sbjct: 119 LVQESKMIHHM-TPXGMSAATTLALPAGVTPVQQLITNSGQLLQVVRAANGAQYLIQPQQ 177
Query: 307 -----DQRITQVEEAN------DQRITQVDGANDQRI---------------TQV----- 335
Q I Q+++ Q +T + G Q+ TQV
Sbjct: 178 SMVLQQQVIPQMQQGGVQAPVIQQVLTPMPGGLSQQTGVIIQPQQILFTGNKTQVIPNTM 237
Query: 336 --DEANDQRITQVDGANDQR-------ITQVDGANDQRITQVDGANXXXXXXXXXXXXXX 386
+A Q V Q+ + QVDGA D T +
Sbjct: 238 SAPQAPIQGQLTVTSQQQQQGQQQAPLVLQVDGAGD---TSSEEDEDEDEDYDDEEEEDK 294
Query: 387 XXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNL 446
PL S DD SD++ ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNL
Sbjct: 295 EKDGEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNL 354
Query: 447 NGKDLVFQKATGEAEW 462
NG+D VF KA G+AEW
Sbjct: 355 NGRDFVFSKAIGDAEW 370
>gi|27819943|gb|AAL28886.2| LD26511p [Drosophila melanogaster]
Length = 274
Score = 141 bits (355), Expect = 3e-32
Identities = 66/136 (48%), Positives = 96/136 (70%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA + I++D
Sbjct: 12 KHVPKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTCILDDAKVYANHARKKTIDLD 71
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA LD SFT P R L VA +N++ LP +K + RLPPDRYCL VNY+L
Sbjct: 72 DVRLATEVTLDKSFTGPLERHVLAKVADVRNSMPLPPIKPHCGLRLPPDRYCLTGVNYKL 131
Query: 121 KGSADKKVSSMAKAPG 136
+ + K + + G
Sbjct: 132 RATNQPKKMTKSAVEG 147
>gi|45549141|ref|NP_523391.3| CG6474-PA [Drosophila melanogaster]
gi|2498980|sp|Q27272|TAF9_DROME Transcription initiation factor TFIID subunit 9 (Transcription
initiation factor TFIID 42 kDa subunit) (TAFII-42)
(TAFII40) (p42) (Enhancer of yellow 1 protein)
gi|1079142|pir||A49067 transcription initiation factor IID chain p42 - fruit fly
(Drosophila melanogaster)
gi|458680|gb|AAC47347.1| transcription initiation factor TFIID 42 kDa subunit
gi|463049|gb|AAA28488.1| TAFII40
gi|45447037|gb|AAF48767.3| CG6474-PA [Drosophila melanogaster]
Length = 278
Score = 141 bits (355), Expect = 3e-32
Identities = 66/136 (48%), Positives = 96/136 (70%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
K++PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA + I++D
Sbjct: 16 KHVPKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTCILDDAKVYANHARKKTIDLD 75
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D +LA LD SFT P R L VA +N++ LP +K + RLPPDRYCL VNY+L
Sbjct: 76 DVRLATEVTLDKSFTGPLERHVLAKVADVRNSMPLPPIKPHCGLRLPPDRYCLTGVNYKL 135
Query: 121 KGSADKKVSSMAKAPG 136
+ + K + + G
Sbjct: 136 RATNQPKKMTKSAVEG 151
>gi|17562510|ref|NP_504355.1| prion-like Q/N-rich domain protein PQN-51, Prion-like Q/N-rich
domain protein (pqn-51) [Caenorhabditis elegans]
gi|7505765|pir||T32660 hypothetical protein K11D12.2 - Caenorhabditis elegans
gi|2736452|gb|AAB94230.1| Prion-like-(q/n-rich)-domain-bearing protein protein 51
[Caenorhabditis elegans]
Length = 354
Score = 122 bits (307), Expect = 1e-26
Identities = 102/356 (28%), Positives = 164/356 (46%), Gaps = 38/356 (10%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNEL---- 194
N +A+LY++V+ DVI N++EAFLD+ +D VL +L++ WE K+ S V+ SN
Sbjct: 5 NGIAELYKSVMADVIANMKEAFLDENIDVDVLSQLRKEWEDKVNSSGCVDLESNAPPPAP 64
Query: 195 RHAQSVMPLYVYANSM--QGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQ-QV 251
R V P V N M Q +QQP+ + A + + Q QHP+Q ++
Sbjct: 65 RQQHHVPPSAVRPNPMPPQRPVAQQPVRALSALHAAHIGDAPIRMAYTGQPTQHPSQVRM 124
Query: 252 YAP-------YQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDG 304
+ P +QPG Q +G Q+ + I + RI + Q+ Q
Sbjct: 125 FNPQFQGGIHFQPGQVFVVQQPNG---QNIPMSIMPNQIPQHRI--IHQGQPQQAQQQQP 179
Query: 305 ANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQV--DGANDQRITQVDGAND 362
++T + + + ++ DG ++ + +V GA++++ +V G+
Sbjct: 180 QQGNQLTHMNQMDGNVGSESDGEGPSGPVKLAPKKTKCSLRVRGSGASEKKAMKVLGSLL 239
Query: 363 QRITQVDGANXXXXXXXXXXX---------------XXXXXXXXXXXXXXXPLCSADDGS 407
+ Q+DG PL S DD S
Sbjct: 240 KDF-QLDGGGGGMSDSSSEDEPDDDDDPLRRIADRMGNGEVEDGDQVAEEEPLNSEDDQS 298
Query: 408 DDDPVE-LFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD+ + LF+ DNVV+CQ++K++RA+ +WKF LK GIM+++ KD FQK TGEAEW
Sbjct: 299 DDEDLTMLFEADNVVMCQFEKVNRARTKWKFQLKDGIMHIDKKDYCFQKCTGEAEW 354
>gi|39583968|emb|CAE64058.1| Hypothetical protein CBG08658 [Caenorhabditis briggsae]
Length = 353
Score = 122 bits (305), Expect = 2e-26
Identities = 107/358 (29%), Positives = 152/358 (42%), Gaps = 43/358 (12%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQ 198
N +A +Y++VI DVI+N+REAFLD+ +D VL +LK+ WE K+ S V+ SN A
Sbjct: 5 NGIADIYKSVIADVISNMREAFLDENIDVDVLAQLKKEWEDKVNSSGCVDLESNAAPPAP 64
Query: 199 SVMPLYVYANSMQGD----------KSQQPLVYHTMTA----------AGQQAAMSLPNS 238
+V N ++ + + QQP T+ A A Q P
Sbjct: 65 RQQQPHVAPNPVRPNPIPQRVNPVQQQQQPRT--TLAALHMGDTPIRMAYTQPGQHQPQQ 122
Query: 239 VLYQRQQHPNQ------QVYAPYQPG-----TSTNQ-QSTSGGGKQHPSVIIQADGANDQ 286
V QQ NQ Q QPG NQ Q Q + Q
Sbjct: 123 VRMFPQQFQNQLQFPAGQFVVVQQPGGVPMSLMPNQLQHRLMPQVQQQQLQQQPQQQQSN 182
Query: 287 RITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ-RITQ 345
++T ++ + ++ DG +V A + T+V G + + R Q
Sbjct: 183 QLTHMNQMDGNGGSESDGEGCSEPLKVRRARAKASTKVRGTGASKKEAMKVLGSMLRDIQ 242
Query: 346 VDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADD 405
+DG D ++D + D PL S DD
Sbjct: 243 LDGGGG---GMSDSSSDDEVEDDDDP----LRRIADRMGNGEVEDGDQVAEEEPLNSEDD 295
Query: 406 GSDD-DPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
SDD D LFD DNVV+CQ++K++RA+ +WKF LK GIM+++ KD FQK TGEAEW
Sbjct: 296 QSDDEDLTMLFDADNVVMCQFEKVNRARTKWKFQLKDGIMHIDKKDYCFQKCTGEAEW 353
>gi|15221048|ref|NP_175816.1| transcription initiation factor IID (TFIID) 31 kDa subunit
(TAFII-31) family protein [Arabidopsis thaliana]
gi|25405699|pir||E96582 hypothetical protein F15I1.24 [imported] - Arabidopsis thaliana
gi|4587557|gb|AAD25788.1| Similar to gb|U21858 transcription initiation factor TFIID 31KD
subunit (TAFII32) from Homo sapiens. [Arabidopsis
thaliana]
gi|7638157|gb|AAF65406.1| putative TATA binding protein associated factor 21kDa subunit
[Arabidopsis thaliana]
gi|21593471|gb|AAM65438.1| transcriptional activation factor TAFII32, putative [Arabidopsis
thaliana]
gi|39545926|gb|AAR28026.1| TAF9 [Arabidopsis thaliana]
Length = 183
Score = 119 bits (299), Expect = 1e-25
Identities = 55/120 (45%), Positives = 86/120 (71%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
+++P+DA+ + ++L+ MGV +YEP+V+HQ LE YRY+ EVL +A+VY+ HA + NI+ D
Sbjct: 7 EDVPRDAKIVKSLLKSMGVEDYEPRVIHQFLELWYRYVVEVLTDAQVYSEHASKPNIDCD 66
Query: 61 DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
D KLA+ S+++ SF+ PPPR+ L+++A +N I LP LPP++ LLS NY+L
Sbjct: 67 DVKLAIQSKVNFSFSQPPPREVLLELAASRNKIPLPKSIAGPGVPLPPEQDTLLSPNYQL 126
>gi|42733663|gb|AAS38620.1| similar to F55B11.3.p [Caenorhabditis elegans] [Dictyostelium
discoideum]
Length = 619
Score = 118 bits (296), Expect = 2e-25
Identities = 49/116 (42%), Positives = 81/116 (69%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
P+D+ I ++L+ MG+ +EP+VV+QLLEF Y+Y+ EVL +A +Y H+G+++I+V D +
Sbjct: 213 PRDSRVIKSILKTMGIQAHEPRVVNQLLEFMYKYVFEVLQDAMIYGEHSGKTDIDVSDVR 272
Query: 64 LAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYR 119
L++ S+++ + PPPR+ L+ +A +KN LP + N LPPD YCL + NY+
Sbjct: 273 LSIQSRVNYQISTPPPRELLLSIAEEKNKTPLPPIPNRYGVYLPPDEYCLTNPNYQ 328
>gi|13878225|ref|NP_113568.1| general transcription factor II A, 1 isoform 1; general
transcription factor IIa, 1 (37kD and 19kD subunits);
TFIIA alpha/beta [Mus musculus]
gi|12313735|gb|AAG50431.1| TFIIA alpha/beta [Mus musculus]
Length = 378
Score = 112 bits (280), Expect = 2e-23
Identities = 117/407 (28%), Positives = 159/407 (39%), Gaps = 119/407 (29%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLD-------------------------DGVDEHVLQEL 173
N+V KLY++VIEDVIN++R+ FLD DG Q L
Sbjct: 8 NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67
Query: 174 KQTWEQKLAQSKAVENSSNELRHAQS-------------VMPLYVYANSMQGDKSQQPLV 220
Q +Q Q + + ++ + AQ ++P A + Q L+
Sbjct: 68 LQVQQQHQPQQQQHHHHHHQHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIGPDSKLL 127
Query: 221 YHT-----MTAAGQQAAMSLPNSVLYQRQQHPN--------------------------- 248
H +AA A ++LP V +Q N
Sbjct: 128 QHMNASSITSAAATAATLALPAGVTPVQQLLTNSGQLLQVVRAANGAQYILQPQQSVVLQ 187
Query: 249 QQVYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQ 297
QQV QPG QQ + GG VIIQ G Q I A
Sbjct: 188 QQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPAP 247
Query: 298 RITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRIT 355
+ A Q+ Q + A Q + QVDG D T +E D+ D + +
Sbjct: 248 AQAPMPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK-- 301
Query: 356 QVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELF 415
+ DGA D ++ + PL S DD S ++ ELF
Sbjct: 302 EKDGAEDGQVEE------------------------------EPLNSEDDVSGEEGQELF 331
Query: 416 DTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
D +NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 DAENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 378
>gi|42476101|ref|NP_963889.1| TFIIA alpha, p55 isoform 2; general transcription factor IIA, 1
(37kD and 19kD subunits); TFIIA alpha, p55 [Homo
sapiens]
gi|433501|emb|CAA53152.1| TFIIA [Homo sapiens]
gi|727195|dbj|BAA03603.1| TFIIA-37 [Homo sapiens]
Length = 337
Score = 110 bits (274), Expect = 8e-23
Identities = 94/292 (32%), Positives = 120/292 (41%), Gaps = 44/292 (15%)
Query: 178 EQKLAQSKAVENSSNELRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPN 237
+ KL Q N S A +P V S Q L+ A G Q
Sbjct: 83 DSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQ-LLQVVRAANGAQYIFQPQQ 141
Query: 238 SVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQAD-----GANDQRITQVD 292
SV+ Q+Q P Q P GG VIIQ G Q I
Sbjct: 142 SVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTV 201
Query: 293 GANDQRITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGAN 350
A Q+ Q+ Q + A Q + QVDG D T +E D+ D
Sbjct: 202 AAPTPAQAQITATGQQQ-PQAQPAQTQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEE 257
Query: 351 DQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDD 410
+ + + DGA D ++ + PL S DD SD++
Sbjct: 258 EDK--EKDGAEDGQVEE------------------------------EPLNSEDDVSDEE 285
Query: 411 PVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 286 GQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 337
>gi|30141908|ref|NP_780544.1| general transcription factor II A, 1 isoform 2; general
transcription factor IIa, 1 (37kD and 19kD subunits);
TFIIA alpha/beta [Mus musculus]
gi|26339518|dbj|BAC33430.1| unnamed protein product [Mus musculus]
Length = 339
Score = 107 bits (268), Expect = 4e-22
Identities = 83/244 (34%), Positives = 107/244 (43%), Gaps = 43/244 (17%)
Query: 226 AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQAD---- 281
A G Q + SV+ Q+Q P Q P GG VIIQ
Sbjct: 132 ANGAQYILQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILF 191
Query: 282 -GANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEA 338
G Q I A + A Q+ Q + A Q + QVDG D T +E
Sbjct: 192 TGNKTQVIPTTVAAPAPAQAPMPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEED 247
Query: 339 NDQRITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
D+ D + + + DGA D ++ +
Sbjct: 248 EDEEEDYDDDEEEDK--EKDGAEDGQVEE------------------------------E 275
Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
PL S DD SD++ ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G
Sbjct: 276 PLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIG 335
Query: 459 EAEW 462
+AEW
Sbjct: 336 DAEW 339
>gi|19075146|ref|NP_586747.1| TRANSCRIPTION INITIATION FACTOR OF THE TFIID FAMILY
[Encephalitozoon cuniculi]
gi|19068538|emb|CAD25006.1| TRANSCRIPTION INITIATION FACTOR OF THE TFIID FAMILY
[Encephalitozoon cuniculi]
Length = 137
Score = 105 bits (261), Expect = 3e-21
Identities = 49/125 (39%), Positives = 81/125 (64%), Gaps = 1/125 (0%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
P+DA+ I +L+ +G+ E EPKV+ QLLEF YRY ++VL++A ++A H GR++I D K
Sbjct: 9 PRDAKVISVILRSLGIEECEPKVIIQLLEFAYRYTTDVLEDALLFAKHTGRTHITTSDVK 68
Query: 64 LAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYR-LKG 122
LA+ +++ F PPPR +L +++ N+ L + + R+PP LL+++Y L+
Sbjct: 69 LALQTKVGRHFVPPPPRQYLSEISTMVNSKPLTIPDGENLIRVPPSSSALLNLDYEVLRK 128
Query: 123 SADKK 127
+DKK
Sbjct: 129 DSDKK 133
>gi|46102011|gb|EAK87244.1| hypothetical protein UM06387.1 [Ustilago maydis 521]
Length = 273
Score = 104 bits (259), Expect = 5e-21
Identities = 51/122 (41%), Positives = 79/122 (64%), Gaps = 4/122 (3%)
Query: 3 LPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGR----SNIN 58
+P+DA I +L MGV++ EP V+ QLLEF +RY +VL +A VY++HA SN++
Sbjct: 25 VPRDARLIALILSSMGVSDVEPAVLLQLLEFAHRYTYDVLSDALVYSDHANSRQAGSNLS 84
Query: 59 VDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
+DD LA+ S+++ SFT PP +D L+ +A N I LP + + RLP ++CL +VN+
Sbjct: 85 LDDVNLAIQSRVNYSFTKPPEKDMLLALASTVNAIPLPPISDRHGVRLPAAQHCLTNVNF 144
Query: 119 RL 120
+
Sbjct: 145 SI 146
>gi|19113805|ref|NP_592893.1| hypothetical protein [Schizosaccharomyces pombe]
gi|1351629|sp|Q09869|YAG5_SCHPO Hypothetical protein C12G12.05c in chromosome I
gi|2130252|pir||S62536 hypothetical protein SPAC12G12.05c - fission yeast
(Schizosaccharomyces pombe)
gi|1052523|emb|CAA91500.1| SPAC12G12.05c [Schizosaccharomyces pombe]
Length = 163
Score = 102 bits (254), Expect = 2e-20
Identities = 48/117 (41%), Positives = 76/117 (64%), Gaps = 2/117 (1%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN--INVDD 61
PKD I +L +GV Y V QLL F +RY +++ +++VYA H+ N I+V+D
Sbjct: 14 PKDVRLIHLILSSLGVPSYSQTVPLQLLTFAHRYTQQLIQDSQVYAEHSRGQNAPISVED 73
Query: 62 TKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
+LAV SQ+++SFT PPP++FL+++A ++N LP ++ RLPP++YCL N+
Sbjct: 74 VRLAVASQINHSFTGPPPKEFLLELAMERNRKPLPQIQPSYGFRLPPEKYCLTQPNW 130
>gi|30017563|gb|AAP12985.1| putative transcription initiation factor [Oryza sativa (japonica
cultivar-group)]
Length = 250
Score = 100 bits (250), Expect = 5e-20
Identities = 55/146 (37%), Positives = 86/146 (58%), Gaps = 29/146 (19%)
Query: 4 PKDAETIIAVLQDMGVTE--YEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDD 61
P+DA + +L+ MG++E YEP+VVHQ L+ YRY+ +VL +A+VYA+HAG+ ++ DD
Sbjct: 34 PRDARVVRELLRSMGLSEGEYEPRVVHQFLDLAYRYVGDVLGDAQVYADHAGKPQLDADD 93
Query: 62 TKLAVMSQLDNSFTNPPPRD--------------------------FLVDVARQKNTIAL 95
+LA+ S+++ SF+ PPPR+ L++VAR +N I L
Sbjct: 94 VRLAIQSKVNFSFSQPPPRECSEFFHSDQDFRSRSLPSDNPLFFSMVLLEVARNRNKIPL 153
Query: 96 P-LVKNYSAARLPPDRYCLLSVNYRL 120
P + + LPP++ LLS NY+L
Sbjct: 154 PKSIAPPGSIPLPPEQDTLLSQNYQL 179
>gi|24650582|ref|NP_733208.1| CG5930-PC [Drosophila melanogaster]
gi|23172422|gb|AAN14106.1| CG5930-PC [Drosophila melanogaster]
Length = 324
Score = 100 bits (248), Expect = 9e-20
Identities = 87/333 (26%), Positives = 139/333 (41%), Gaps = 52/333 (15%)
Query: 173 LKQTWEQKLAQSKAVENSSNE------------------LRHAQSVMPLYVYANSMQGDK 214
+KQ W KL SKAVE S + + ++ V ++ G
Sbjct: 1 MKQVWRNKLLASKAVELSPDSGDGSHPPPIVANNPKAANAKAKKAAAATAVTSHQHIGGN 60
Query: 215 SQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGG 270
S + ++AG A + N ++ +Q+ N Q P P ++ + QQ + G
Sbjct: 61 SSMSSLVGLKSSAGMAAGSGIRNGLVPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSG 119
Query: 271 KQHPSVIIQADGANDQRITQVD---------GANDQRITQVD----GANDQRITQVEEAN 317
+ ++ D RI V+ +++ R+ + + ++TQ+ A+
Sbjct: 120 QGSIPIVATLD---PNRIMPVNITLPSPAGSASSESRVLTIQVPASALQENQLTQILTAH 176
Query: 318 DQRITQVDGANDQRITQVDEANDQRITQ-VDGANDQRIT----QVDGA---NDQRITQVD 369
+ T Q + + AN Q+ Q+DGA +D+ ++
Sbjct: 177 -----LISSIMSLPTTLASSVLQQHVNAALSSANHQKTLAAAKQLDGALDSSDEDESEES 231
Query: 370 GANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIH 429
N PL S DD +D+D E+FDTDNV+VCQYDKI
Sbjct: 232 DDNIDNDDDDDLDKDDDEDAEHEDAAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKIT 291
Query: 430 RAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
R++ +WKF LK GIMN+ GKD VFQK+ G+AEW
Sbjct: 292 RSRNKWKFYLKDGIMNMRGKDYVFQKSNGDAEW 324
>gi|38492544|pdb|1NVP|C Chain C, Human TfiiaTBPDNA COMPLEX
Length = 76
Score = 100 bits (248), Expect = 9e-20
Identities = 45/64 (70%), Positives = 54/64 (84%)
Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
PL S DD SD++ ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G
Sbjct: 13 PLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIG 72
Query: 459 EAEW 462
+AEW
Sbjct: 73 DAEW 76
>gi|32394470|gb|AAM93933.1| transcriptional activation factor TAFII32 [Griffithsia japonica]
Length = 180
Score = 96.7 bits (239), Expect = 1e-18
Identities = 50/128 (39%), Positives = 79/128 (61%), Gaps = 2/128 (1%)
Query: 17 MGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKLAVMSQLDNSFTN 76
MGV Y+P+V HQLLE YR++S VL +A+V +HA + I+ DD KLA+ S+ +FT
Sbjct: 27 MGVDSYDPRVTHQLLELLYRHVSTVLLDARVCCDHADKPAIDADDVKLALRSRNTFTFTQ 86
Query: 77 PPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSADK--KVSSMAKA 134
PPPRD + +A ++N + LP + + LPP+ L NYR+ +++ K S ++++
Sbjct: 87 PPPRDVTMRLAAERNAVPLPPIDARAGVALPPELGQLTRQNYRVMTQSNRPAKRSRLSQS 146
Query: 135 PGGQNSVA 142
P + S A
Sbjct: 147 PAARASYA 154
>gi|34862156|ref|XP_237507.2| similar to general transcription factor II A, 1-like factor;
TfIIa-alpha/beta-like factor; general transcription
factor IIA, 1-like factor [Rattus norvegicus]
Length = 791
Score = 96.3 bits (238), Expect = 1e-18
Identities = 58/184 (31%), Positives = 93/184 (50%), Gaps = 22/184 (11%)
Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
+++ +++ + R+++ +++ + QV + + I Q+DG D + + +A++
Sbjct: 623 KQLRKIEEPSSLRVSEKSCTSERDLNIQVTDDDINEIIQIDGTGDNSSAEEGGSIRDADE 682
Query: 341 QRITQVDGANDQRITQ--VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
+ A D ++ + VD +++ T N
Sbjct: 683 NEFPGIIEAGDLKVLEEEVDSVSNEDSTANSSDNEDHQIDALEED--------------- 727
Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
PL S DD S+ D +LFDTDNV+VCQYDKIHR+K RWKF LK G+M G+D VF KA G
Sbjct: 728 PLNSGDDVSEQDAPDLFDTDNVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIG 787
Query: 459 EAEW 462
EAEW
Sbjct: 788 EAEW 791
Score = 59.7 bits (143), Expect = 1e-07
Identities = 25/47 (53%), Positives = 38/47 (80%)
Query: 143 KLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
KLY++VIEDVI +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 50 KLYRSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 96
>gi|26787968|ref|NP_006863.2| TFIIA-alpha/beta-like factor isoform 1; TFIIA large subunit isoform
ALF; GTF2A1-like factor [Homo sapiens]
gi|19684124|gb|AAH25991.1| TFIIA-alpha/beta-like factor, isoform 1 [Homo sapiens]
Length = 478
Score = 94.4 bits (233), Expect = 5e-18
Identities = 69/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T++N +S G A +D+ + T GA Q +T +
Sbjct: 248 VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419
Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD S+ D +LFDTDNV+VCQYDKIHR+K +WKF LK G+M G+D VF KA G+AEW
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 478
Score = 63.5 bits (153), Expect = 9e-09
Identities = 28/51 (54%), Positives = 39/51 (76%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5 NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55
>gi|29553944|ref|NP_758515.1| stoned B/TFIIA-alpha/beta-like factor [Homo sapiens]
Length = 1182
Score = 94.4 bits (233), Expect = 5e-18
Identities = 69/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T++N +S G A +D+ + T GA Q +T +
Sbjct: 952 VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 1005
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 1006 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 1063
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 1064 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 1123
Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD S+ D +LFDTDNV+VCQYDKIHR+K +WKF LK G+M G+D VF KA G+AEW
Sbjct: 1124 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 1182
Score = 60.8 bits (146), Expect = 6e-08
Identities = 27/51 (52%), Positives = 38/51 (74%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 709 NIQPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 759
>gi|33312516|gb|AAQ04071.1| TFIIAa/b-like factor [Xenopus laevis]
gi|34099894|gb|AAP44968.1| transcription factor ALF [Xenopus laevis]
Length = 472
Score = 94.4 bits (233), Expect = 5e-18
Identities = 57/166 (34%), Positives = 80/166 (48%), Gaps = 36/166 (21%)
Query: 299 ITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVD--GANDQRITQ 356
I QVDGA D ++D+ + V ++ + + D + + D G+N+ +
Sbjct: 341 IIQVDGAGDT-------SSDEDLGNVRDLDENEFLGIIDTEDLKALEEDDTGSNEDSTSN 393
Query: 357 VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFD 416
D I ++ PL S DD S+ + +LFD
Sbjct: 394 SSDDEDPEIDVIE---------------------------EDPLNSGDDVSEQEVPDLFD 426
Query: 417 TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
TDNV+VCQYDKIHR+K +WKF LK G+M+ GKD VF KA GEAEW
Sbjct: 427 TDNVIVCQYDKIHRSKNKWKFYLKDGVMSFGGKDYVFSKAIGEAEW 472
Score = 77.0 bits (188), Expect = 8e-13
Identities = 42/107 (39%), Positives = 63/107 (58%), Gaps = 4/107 (3%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQ 198
N V KLY+++IEDV+ ++RE F+++GVDE VL+ELKQ WE KL QSKA E + Q
Sbjct: 8 NPVPKLYKSIIEDVMESVREIFVEEGVDEQVLKELKQLWETKLLQSKATEGFFRDNSTPQ 67
Query: 199 SVMP----LYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLY 241
V+ L+ ++ G+++ M +G +A SLP + Y
Sbjct: 68 FVLQLPQNLHHSLHTSTGNRNVTHFATGEMGTSGSNSAFSLPAGITY 114
>gi|46442061|gb|EAL01353.1| hypothetical protein CaO19.10197 [Candida albicans SC5314]
gi|46442302|gb|EAL01592.1| hypothetical protein CaO19.2682 [Candida albicans SC5314]
Length = 275
Score = 94.0 bits (232), Expect = 6e-18
Identities = 88/325 (27%), Positives = 138/325 (42%), Gaps = 60/325 (18%)
Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSVM 201
+KLY++VIEDVIN+ R+ F ++G+DE LQEL++ W +KL QS + S +E + +
Sbjct: 7 SKLYESVIEDVINDSRQDFENNGIDESTLQELRRIWCEKLTQSGVAKFSWDEEEEEEEPV 66
Query: 202 P---LYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
+ V ++QP T A + + + + + N + P
Sbjct: 67 EQVDVDVEVGQSAQQAAEQPQAAINDTGATESNTSATTETTVKKEATASNSGL-----PD 121
Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
+ +T G + PS I + AN T +G D G N I
Sbjct: 122 AQLSYNTTDSLGIEIPS--ISREKANGTEGTDKNGDGD------GGDNHGLIVP------ 167
Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDG-ANDQRITQVDGANXXXXX 377
+I Q DG Q ++++++QVDG ++ + DG +D +++D
Sbjct: 168 -KIEQADGPIFFDNKQFK--SNKKLSQVDGESEFDEFESDGDLSDDINSELDSD------ 218
Query: 378 XXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKF 437
S + DDD E N +C +DK+ R K +WK
Sbjct: 219 ------------------------SDKEEEDDDEEET----NFALCLFDKVQRIKNKWKS 250
Query: 438 NLKVGIMNLNGKDLVFQKATGEAEW 462
L GI N+NGKD VF KA GE+EW
Sbjct: 251 TLVAGIANINGKDYVFHKANGESEW 275
>gi|12963759|ref|NP_076119.1| general transcription factor II A, 1-like factor;
TfIIa-alpha/beta-like factor; general transcription
factor IIA, 1-like factor [Mus musculus]
gi|12313737|gb|AAG50432.1| TFIIA-alpha/beta-like factor [Mus musculus]
Length = 468
Score = 93.6 bits (231), Expect = 8e-18
Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 21/183 (11%)
Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
+++ + + + R+++ + +++ + +V + + I Q+DG D T+ + +A++
Sbjct: 301 KQLRKAEEPSSLRVSEKNCTSERDLNIRVTDDDINEIIQIDGTGDNSSTEEMGSIRDADE 360
Query: 341 QRITQVDGANDQRITQ-VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
+ A D + + VD +++ T N P
Sbjct: 361 NEFPGIIDAGDLNVLEEVDSVSNEDSTANSSDNEDHQINAPEED---------------P 405
Query: 400 LCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
L S DD S+ D +LFDT+NV+VCQYDKIHR+K RWKF LK G+M G+D VF KA GE
Sbjct: 406 LNSGDDVSEQDVPDLFDTENVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIGE 465
Query: 460 AEW 462
AEW
Sbjct: 466 AEW 468
Score = 63.9 bits (154), Expect = 7e-09
Identities = 28/51 (54%), Positives = 40/51 (78%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLYQ+VIEDVI +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 5 NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 55
>gi|34098596|sp|Q8R4I4|T2AY_MOUSE TFIIA-alpha and beta like factor (General transcription factor II
A, 1-like factor)
gi|12838693|dbj|BAB24296.1| unnamed protein product [Mus musculus]
gi|30047818|gb|AAH50756.1| General transcription factor II A, 1-like factor [Mus musculus]
Length = 468
Score = 93.6 bits (231), Expect = 8e-18
Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 21/183 (11%)
Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
+++ + + + R+++ + +++ + +V + + I Q+DG D T+ + +A++
Sbjct: 301 KQLRKAEEPSSLRVSEKNCTSERDLNIRVTDDDINEIIQIDGTGDNSSTEEMGSIRDADE 360
Query: 341 QRITQVDGANDQRITQ-VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
+ A D + + VD +++ T N P
Sbjct: 361 NEFPGIIDAGDLNVLEEVDSVSNEDSTANSSDNEDHQINAPEED---------------P 405
Query: 400 LCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
L S DD S+ D +LFDT+NV+VCQYDKIHR+K RWKF LK G+M G+D VF KA GE
Sbjct: 406 LNSGDDVSEQDVPDLFDTENVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIGE 465
Query: 460 AEW 462
AEW
Sbjct: 466 AEW 468
Score = 63.9 bits (154), Expect = 7e-09
Identities = 28/51 (54%), Positives = 40/51 (78%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLYQ+VIEDVI +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 5 NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 55
>gi|13124585|sp|Q9UNN4|T2AY_HUMAN TFIIA-alpha and beta like factor (General transcription factor II
A, 1-like factor)
gi|5091688|gb|AAD39634.1| TFIIA large subunit isoform ALF [Homo sapiens]
Length = 478
Score = 92.4 bits (228), Expect = 2e-17
Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T+++ +S G A +D+ + T GA Q +T +
Sbjct: 248 VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419
Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD S+ D +LFDTDNV+VCQYDKIHR+K +WKF LK G+M G+D VF KA G+AEW
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 478
Score = 63.5 bits (153), Expect = 9e-09
Identities = 28/51 (54%), Positives = 39/51 (76%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5 NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55
>gi|40555809|gb|AAH64585.1| ALF protein [Homo sapiens]
Length = 477
Score = 92.4 bits (228), Expect = 2e-17
Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T+++ +S G A +D+ + T GA Q +T +
Sbjct: 247 VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 300
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 301 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 358
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 359 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 418
Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD S+ D +LFDTDNV+VCQYDKIHR+K +WKF LK G+M G+D VF KA G+AEW
Sbjct: 419 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 477
Score = 63.5 bits (153), Expect = 9e-09
Identities = 28/51 (54%), Positives = 39/51 (76%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 4 NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 54
>gi|5091669|gb|AAD39617.1| SALF [Homo sapiens]
Length = 1182
Score = 92.4 bits (228), Expect = 2e-17
Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T+++ +S G A +D+ + T GA Q +T +
Sbjct: 952 VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 1005
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 1006 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 1063
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 1064 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 1123
Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD S+ D +LFDTDNV+VCQYDKIHR+K +WKF LK G+M G+D VF KA G+AEW
Sbjct: 1124 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 1182
Score = 60.8 bits (146), Expect = 6e-08
Identities = 27/51 (52%), Positives = 38/51 (74%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 709 NIQPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 759
>gi|1942936|pdb|1TAF|A Chain A, Drosophila Tbp Associated Factors Dtafii42DTAFII62
Heterotetramer
Length = 68
Score = 88.6 bits (218), Expect = 3e-16
Identities = 38/68 (55%), Positives = 56/68 (82%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA + I++DD +
Sbjct: 1 PKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTSILDDAKVYANHARKKTIDLDDVR 60
Query: 64 LAVMSQLD 71
LA LD
Sbjct: 61 LATEVTLD 68
>gi|6324768|ref|NP_014837.1| Transcription factor IIA, large chain; Toa1p [Saccharomyces
cerevisiae]
gi|418108|sp|P32773|TOA1_YEAST Transcription initiation factor IIA large chain (TFIIA 32 kDa
subunit)
gi|285446|pir||B41810 transcription factor IIA large chain - yeast (Saccharomyces
cerevisiae)
gi|172893|gb|AAA19654.1| transcription factor IIA
gi|1420463|emb|CAA99407.1| TOA1 [Saccharomyces cerevisiae]
Length = 286
Score = 85.5 bits (210), Expect = 2e-15
Identities = 73/326 (22%), Positives = 140/326 (42%), Gaps = 52/326 (15%)
Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSVM 201
+++Y+ ++E V+N +RE F + G+DE LQ+LK W++KL ++K S + + ++
Sbjct: 7 SRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNI- 65
Query: 202 PLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTST 261
N +Q D + ++ + N L + N + P+ T+
Sbjct: 66 ------NGVQNDLNFNLATPGVNSSEFNIKEENTGNEGLILPNINSNNNI--PHSGETNI 117
Query: 262 NQQSTSGGGKQHPSVIIQADGANDQRIT---QVDGANDQRITQVDGANDQRITQVEEAND 318
N + ++ G + +T +++ + +T N+ IT VE +D
Sbjct: 118 NTNTVEATNNSGATLNTNTSGNTNADVTSQPKIEVKPEIELT----INNANITTVENIDD 173
Query: 319 QRITQVDGANDQRITQVDEANDQRITQV--DGANDQRITQVDGANDQRITQVDGANXXXX 376
+ + D ++ + + + +Q I QV ++R +D D+ +++D ++
Sbjct: 174 ESEKKDDEEKEEDVEKTRKEKEQ-IEQVKLQAKKEKRSALLD--TDEVGSELDDSDDDYL 230
Query: 377 XXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWK 436
+G +D P E N+++C YDK+ R K RWK
Sbjct: 231 IS--------------------------EGEEDGPDE-----NLMLCLYDKVTRTKARWK 259
Query: 437 FNLKVGIMNLNGKDLVFQKATGEAEW 462
+LK G++ +N D FQKA EAEW
Sbjct: 260 CSLKDGVVTINRNDYTFQKAQVEAEW 285
>gi|46107550|ref|XP_380834.1| hypothetical protein FG00658.1 [Gibberella zeae PH-1]
gi|42545545|gb|EAA68388.1| hypothetical protein FG00658.1 [Gibberella zeae PH-1]
Length = 413
Score = 82.8 bits (203), Expect = 1e-14
Identities = 94/411 (22%), Positives = 143/411 (34%), Gaps = 90/411 (21%)
Query: 140 SVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAV------------ 187
+V +Y+ +I++V+N+ R F + GV+E VL+EL+Q W+QKL Q
Sbjct: 5 AVGGVYKAIIDEVVNSSRVDFEESGVEESVLEELRQGWQQKLTQLDVAGFPWDPKPDPPP 64
Query: 188 -----ENSSNELRHAQS------------VMPLYVYANSMQGDKSQQPLVYHTMTAAGQQ 230
+ HAQ+ +P + G K + V +Q
Sbjct: 65 AQAPPPAMAAAQTHAQAQFSPHPGNPTGLSLPGMAQPQAQNGVKPEGSYVKTESPVPIKQ 124
Query: 231 AAMSLPNSVLYQRQQHPN---------------QQVYAPYQPGTSTNQQSTSGGGKQHPS 275
N + YQ+ +PN QQ+ A Y + + + + HP
Sbjct: 125 EPGLQGNPMAYQQYANPNINMNDKNNVAANRAAQQLQAQYGQRAAGSINALHQQQQPHPQ 184
Query: 276 VIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQV 335
Q + Q + Q+ Q G Q +TQ ++ Q Q A Q
Sbjct: 185 Q-GQQQQQQQPALPQQQQTHHQQQQQQPG-QQQNLTQQQQQQQQMYRQQMAAATAHQQQQ 242
Query: 336 DEANDQ---RITQVDGAND---------QRITQVDGANDQRI--------------TQVD 369
+N Q + Q DGA D QR D R+ ++
Sbjct: 243 HPSNGQPHIQNGQTDGAGDVDDYEGVLMQRAASGDFCELGRVEIDRMLHEQILAKAKSME 302
Query: 370 GANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDD----------DPVELFDTD- 418
G DD +D DP + D D
Sbjct: 303 GGGLMVPLKEATRHSSASTSRKAKGKQPAAFDGGDDDEEDDDDAINSDLDDPEDEHDDDD 362
Query: 419 -------NVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
N+++C YDK+ R K +WK LK G++ +NGK+ VF KATGE EW
Sbjct: 363 VDDDGLGNIMLCMYDKVQRVKNKWKCTLKDGVLTVNGKEYVFHKATGEYEW 413
>gi|32422741|ref|XP_331814.1| hypothetical protein [Neurospora crassa]
gi|28926818|gb|EAA35782.1| hypothetical protein [Neurospora crassa]
gi|38567155|emb|CAE76449.1| related to transcription factor TFIIA-L [Neurospora crassa]
Length = 421
Score = 81.6 bits (200), Expect = 3e-14
Identities = 98/422 (23%), Positives = 156/422 (36%), Gaps = 104/422 (24%)
Query: 140 SVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQ---------------- 183
+V +Y+ +I +VIN +R F ++GVD+ VL+ELK+ W+ KL+Q
Sbjct: 5 AVGTVYEHIINEVINAVRVDFEENGVDDSVLEELKKGWQHKLSQLNIAQFPWDPKPEAPP 64
Query: 184 -SKAVENSSNELRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQ--------QAAMS 234
++A +N+S+ + +AQ+ AN Q S Q GQ ++
Sbjct: 65 PAQATQNTSSAV-NAQAAATQQPTANYTQSTLSPQTAAQSPSLPGGQPNGNGVAIKSEPG 123
Query: 235 LPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGA 294
+ N +++ Q + P PG N + G Q + I A+ Q ++
Sbjct: 124 MANEPTIKQEPGTMQPMMHPAYPG---NNAARGGMAAQRAAQAI-ANRYGSQAQASINAI 179
Query: 295 NDQ-RITQVDG---------ANDQRITQVEEANDQRITQVDGANDQRITQVDEAN-DQRI 343
+ Q + Q+ G Q+ Q ++ Q+ Q + Q+ Q A QRI
Sbjct: 180 HQQTQQPQIPGQAPPPPLQQQQQQQQQQQQQQQQQQQQQQQFSPQQQYQQTVAAQLQQRI 239
Query: 344 TQVDG----------ANDQRITQVDGAND------------------------------- 362
Q G N QVDG++D
Sbjct: 240 PQQPGQPGQPQPPNQQNGLGGAQVDGSSDGFDGVMMRRDADGNPVEMGRVEIDNLLHAQL 299
Query: 363 -QRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP--LCSADDGSD----------- 408
+R Q +G P L D+ D
Sbjct: 300 LERAKQREGGGLMLPLKEATKQRSNATKSRRAAAAGGPNQLDGNDEDDDEFDEDAINSDL 359
Query: 409 DDPVELFDTDN--------VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEA 460
DDP + D D+ +++C YDK+ R K +WK LK G++ +NGK+ VF KA GE
Sbjct: 360 DDPEDERDDDDEEDEASGWIMLCLYDKVQRVKNKWKCTLKDGVLTVNGKEYVFHKANGEY 419
Query: 461 EW 462
EW
Sbjct: 420 EW 421
>gi|18390773|ref|NP_563790.1| transcription factor IIA large subunit, putative / TFIIA large
subunit, putative [Arabidopsis thaliana]
gi|14532726|gb|AAK64164.1| putative transcription factor IIA large subunit [Arabidopsis
thaliana]
gi|15146226|gb|AAK83596.1| At1g07470/F22G5_13 [Arabidopsis thaliana]
gi|15983370|gb|AAL11553.1| At1g07470/F22G5_13 [Arabidopsis thaliana]
gi|21554033|gb|AAM63114.1| transcription factor IIA large subunit [Arabidopsis thaliana]
gi|22136784|gb|AAM91736.1| putative transcription factor IIA large subunit [Arabidopsis
thaliana]
gi|39545872|gb|AAR27999.1| TFIIA-L2 [Arabidopsis thaliana]
Length = 375
Score = 81.3 bits (199), Expect = 4e-14
Identities = 98/384 (25%), Positives = 151/384 (39%), Gaps = 67/384 (17%)
Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
G + + +Y VIEDV+N +RE F+++G E VL EL+ WE K+ Q+ V N E
Sbjct: 2 GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60
Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQR 243
AQ P + + ++ + P Q + P NS +Y
Sbjct: 61 SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120
Query: 244 QQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGAND------- 296
+ + G + + ++ PS + D + VDG ++
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMPPPSP--WTNPRLDVNVAYVDGRDEPERGNSN 178
Query: 297 QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ----RITQVD 347
Q+ TQ G + + N I Q DGA+D + EAN + RIT V
Sbjct: 179 QQFTQDLFVPSSGKRKRDDSSAHYQNGGSIPQQDGASDA----IPEANFECAALRITYV- 233
Query: 348 GANDQRITQVDGANDQRITQVDGA------------NXXXXXXXXXXXXXXXXXXXXXXX 395
D++I + + +I QVDG N
Sbjct: 234 --GDRKIPRDLIGSSSKIPQVDGPMPDPYDEMLSTPNIYSYQGPNEEFNEARTPAPNEIQ 291
Query: 396 XXXPLCSADD---------GSDDDPVELFD--------TDNVVVCQYDKIHRAKCRWKFN 438
P+ +D DDD EL D T ++V+ Q+DK+ R K RWK +
Sbjct: 292 TSTPVAVPNDIIEDDEELLNEDDDDDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCS 351
Query: 439 LKVGIMNLNGKDLVFQKATGEAEW 462
LK GIM++N KD++F KATGE ++
Sbjct: 352 LKDGIMHINDKDILFNKATGEFDF 375
>gi|46097824|gb|EAK83057.1| hypothetical protein UM05183.1 [Ustilago maydis 521]
Length = 214
Score = 79.7 bits (195), Expect = 1e-13
Identities = 75/322 (23%), Positives = 131/322 (40%), Gaps = 113/322 (35%)
Query: 141 VAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSV 200
V++LY+++I+DV+NN+R+ F + G+++ VL+EL+++WE +K V E A ++
Sbjct: 6 VSQLYRSIIDDVVNNVRQDFEEMGIEKEVLEELQRSWE-----AKIVATQVAEFEGAPAL 60
Query: 201 MPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTS 260
P KS+ ++Q+ N+ + +
Sbjct: 61 PPA----------KSRSS-----------------------KKQESKNE---VKTEDDDA 84
Query: 261 TNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQR 320
Q S S G QH + + + DG+ D DGA DG + +A D
Sbjct: 85 NGQNSRSNGADQHRAKVARGDGSTDV----ADGA-------ADGKGQDGVAGSVKAEDD- 132
Query: 321 ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXX 380
D A Q+ ++ + +D+ + +D ++D+ +DG VDG
Sbjct: 133 ----DAAAGQKRKRISD-DDEIGSDLDDSDDE---DLDGG-------VDG---------- 167
Query: 381 XXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLK 440
D D++V+C YDK+ R K +WK LK
Sbjct: 168 -----------------------------------DNDDMVLCLYDKVQRVKNKWKCVLK 192
Query: 441 VGIMNLNGKDLVFQKATGEAEW 462
G+ +++G+D +F K GE EW
Sbjct: 193 DGVASIDGRDYLFAKCNGEFEW 214
>gi|30680148|ref|NP_850937.1| transcription factor IIA large subunit / TFIIA large subunit
(TFIIA-L) [Arabidopsis thaliana]
gi|30680153|ref|NP_172228.3| transcription factor IIA large subunit / TFIIA large subunit
(TFIIA-L) [Arabidopsis thaliana]
gi|11358887|pir||T51333 transcription factor IIA large chain [validated] - Arabidopsis
thaliana
gi|2826884|emb|CAA11525.1| transcription factor IIA large subunit [Arabidopsis thaliana]
gi|39545932|gb|AAR28029.1| TFIIA-L1 [Arabidopsis thaliana]
Length = 375
Score = 79.3 bits (194), Expect = 2e-13
Identities = 93/380 (24%), Positives = 147/380 (38%), Gaps = 59/380 (15%)
Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
G + + +Y VIEDV+N +RE F+++G E VL EL+ WE K+ Q+ V N E
Sbjct: 2 GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60
Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQR 243
AQ P + + ++ + P Q + P NS +Y
Sbjct: 61 SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120
Query: 244 QQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGAND------- 296
+ + G + + ++ PS + D + VDG ++
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMPPPSP--WTNPRLDVNVAYVDGRDEPERGNSN 178
Query: 297 QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGAND 351
Q+ TQ G + + N I Q DGA D E + RIT + D
Sbjct: 179 QQFTQDLFVPSSGKRKRDDSSGHYQNGGSIPQQDGAGDAIPEANFECDAFRITSI---GD 235
Query: 352 QRITQVDGANDQRITQVDGA------------NXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
+++ + ++ +I QVDG N P
Sbjct: 236 RKVPRDFFSSSSKIPQVDGPMPDPYDEMLSTPNIYSYQGPSEEFNEARTPAPNEIQTSTP 295
Query: 400 LCSADD---------GSDDDPVELFD--------TDNVVVCQYDKIHRAKCRWKFNLKVG 442
+ +D DDD EL D T ++V+ Q+DK+ R K RWK +LK G
Sbjct: 296 VAVQNDIIEDDEELLNEDDDDDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDG 355
Query: 443 IMNLNGKDLVFQKATGEAEW 462
IM++N KD++F KA GE ++
Sbjct: 356 IMHINDKDILFNKAAGEFDF 375
>gi|6323892|ref|NP_013963.1| Subunit (17 kDa) of TFIID and SAGA complexes, involved in RNA
polymerase II transcription initiation and in chromatin
modification, similar to histone H3; Taf9p
[Saccharomyces cerevisiae]
gi|2497197|sp|Q05027|TAF9_YEAST Transcription initiation factor TFIID subunit 9 (TBP-associated
factor 9) (TBP-associated factor 17 kDa) (TAFII-17)
gi|1362400|pir||S57603 hypothetical protein YMR236w - yeast (Saccharomyces cerevisiae)
gi|887617|emb|CAA90207.1| unknown [Saccharomyces cerevisiae]
Length = 157
Score = 78.6 bits (192), Expect = 3e-13
Identities = 39/123 (31%), Positives = 69/123 (56%), Gaps = 5/123 (4%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN-----IN 58
P+D + +L + +YE +V QL++F +RY VL +A VY ++AG N +
Sbjct: 30 PRDVRLLHLLLASQSIHQYEDQVPLQLMDFAHRYTQGVLKDALVYNDYAGSGNSAGSGLG 89
Query: 59 VDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
V+D +LA+ ++ F P++ ++ +A ++N ALP V RLPP++YCL + +
Sbjct: 90 VEDIRLAIAARTQYQFKPTAPKELMLQLAAERNKKALPQVMGTWGVRLPPEKYCLTAKEW 149
Query: 119 RLK 121
L+
Sbjct: 150 DLE 152
>gi|24059851|dbj|BAC21319.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
Length = 425
Score = 76.6 bits (187), Expect = 1e-12
Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 3/95 (3%)
Query: 4 PKDAETIIAVLQDMGVTE--YEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDD 61
P+D + +L +G+ E YE VH+LL F +RY +VL EAK YA HAGR ++ DD
Sbjct: 27 PRDLRVVREILHSLGLREGDYEEAAVHKLLLFAHRYAGDVLGEAKAYAGHAGRESLQADD 86
Query: 62 TKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALP 96
+LA+ ++ S PP R+ ++D+A + N I +P
Sbjct: 87 VRLAIQAR-GMSSAAPPSREEMLDIAHKCNEIPIP 120
>gi|45199045|ref|NP_986074.1| AFR527Wp [Eremothecium gossypii]
gi|44985120|gb|AAS53898.1| AFR527Wp [Eremothecium gossypii]
Length = 137
Score = 75.1 bits (183), Expect = 3e-12
Identities = 37/124 (29%), Positives = 70/124 (56%), Gaps = 4/124 (3%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGR----SN 56
++ P+D + +L + YE +V QL++F +RY VL +A +Y +A ++
Sbjct: 5 EDTPRDVRLLHLLLASQSIHAYEDQVPLQLMDFAHRYTRGVLKDAMMYNEYANNGSAPAD 64
Query: 57 INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSV 116
+ V+D +LA+ S+ F P++ L+ +A+++N LP V + RLPP++YCL +
Sbjct: 65 LTVEDVRLAIGSRTQYQFKPTAPKELLLQLAQERNKKPLPQVMSMWGVRLPPEKYCLTAK 124
Query: 117 NYRL 120
++L
Sbjct: 125 EWQL 128
>gi|25406976|pir||D86209 protein F22G5.18 [imported] - Arabidopsis thaliana
gi|8778565|gb|AAF79573.1| F22G5.18 [Arabidopsis thaliana]
Length = 475
Score = 75.1 bits (183), Expect = 3e-12
Identities = 101/402 (25%), Positives = 155/402 (38%), Gaps = 84/402 (20%)
Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
G + + +Y VIEDV+N +RE F+++G E VL EL+ WE K+ Q+ V N E
Sbjct: 2 GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60
Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQ- 242
AQ P + + ++ + P Q + P NS +Y
Sbjct: 61 SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120
Query: 243 -----------RQQHPNQQVYAPYQPGTSTNQQSTSGG-GKQHPSVIIQADGAN------ 284
+ N V A P ++ + GG K S ++Q +
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMLCSETAYLGGLQKLRLSTLLQPPPSPWTNPRL 180
Query: 285 DQRITQVDGAND-------QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRI 332
D + VDG ++ Q+ TQ G + + N I Q DGA+D
Sbjct: 181 DVNVAYVDGRDEPERGNSNQQFTQDLFVPSSGKRKRDDSSAHYQNGGSIPQQDGASDA-- 238
Query: 333 TQVDEANDQ----RITQVDGANDQRITQVDGANDQRITQVDGA------------NXXXX 376
+ EAN + RIT V D++I + + +I QVDG N
Sbjct: 239 --IPEANFECAALRITYV---GDRKIPRDLIGSSSKIPQVDGPMPDPYDEMLSTPNIYSY 293
Query: 377 XXXXXXXXXXXXXXXXXXXXXXPLCSADD---------GSDDDPVELFD--------TDN 419
P+ +D DDD EL D T +
Sbjct: 294 QGPNEEFNEARTPAPNEIQTSTPVAVPNDIIEDDEELLNEDDDDDELDDLESGEDMNTQH 353
Query: 420 VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAE 461
+V+ Q+DK+ R K RWK +LK GIM++N KD++F KA+ ++
Sbjct: 354 LVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKASSTSD 395
>gi|38492543|pdb|1NVP|B Chain B, Human TfiiaTBPDNA COMPLEX
Length = 57
Score = 73.6 bits (179), Expect = 9e-12
Identities = 34/50 (68%), Positives = 42/50 (84%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVE 188
N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK WE KL QS+AV+
Sbjct: 7 NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVD 56
>gi|17555056|ref|NP_499814.1| TBP-Associated transcription Factor family member (20.6 kD) (taf-9)
[Caenorhabditis elegans]
gi|7507733|pir||T24863 hypothetical protein T12D8.7 - Caenorhabditis elegans
gi|3879796|emb|CAB03347.1| C. elegans TAF-9 protein (corresponding sequence T12D8.7)
[Caenorhabditis elegans]
Length = 183
Score = 72.8 bits (177), Expect = 1e-11
Identities = 44/163 (26%), Positives = 80/163 (49%), Gaps = 2/163 (1%)
Query: 5 KDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKL 64
K+A +I++L + G+ E++P+VV L++ Y S++L + + HA ++ I +D +
Sbjct: 21 KEALAVISMLHECGIQEFDPRVVSMLMDVQYAVTSKILQMSSGLSRHAEKAQIEAEDVQT 80
Query: 65 AVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSA 124
A L TN P R+ ++ +A KN LP +++ +LP DR+C L N+ K
Sbjct: 81 AA-DMLGVLTTNAPDREKILQLANDKNQQPLPQIRHNYGLKLPNDRFCQLQQNFVYKADD 139
Query: 125 DKKVSSMAKAPGGQNSVAKLYQTV-IEDVINNIREAFLDDGVD 166
+ M + + + Q + E V N ++ DD D
Sbjct: 140 SYQPMEMQQVTHQPHIIEPPTQVLRPEHVQNMLKRRAPDDDFD 182
>gi|46432465|gb|EAK91945.1| hypothetical protein CaO19.8708 [Candida albicans SC5314]
Length = 229
Score = 72.4 bits (176), Expect = 2e-11
Identities = 38/130 (29%), Positives = 69/130 (53%), Gaps = 12/130 (9%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN---- 56
K +P+D + + V Y+ V QL++F +RY VL +A VY +H+ N
Sbjct: 75 KIIPRDVRLLHLIFATHAVHNYQDHVPLQLMDFAHRYTKSVLKDALVYNDHSKPINANTN 134
Query: 57 --------INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPP 108
+N DD +LA+ ++ + F PP++ L+++A+++N+ LP V LPP
Sbjct: 135 LNSNTNATVNTDDIRLAIAARSNYQFKPVPPKELLLELAQERNSRPLPPVIPRWGLNLPP 194
Query: 109 DRYCLLSVNY 118
++YCL + ++
Sbjct: 195 EKYCLTAKDW 204
>gi|46432443|gb|EAK91924.1| hypothetical protein CaO19.1111 [Candida albicans SC5314]
Length = 232
Score = 72.4 bits (176), Expect = 2e-11
Identities = 38/130 (29%), Positives = 69/130 (53%), Gaps = 12/130 (9%)
Query: 1 KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN---- 56
K +P+D + + V Y+ V QL++F +RY VL +A VY +H+ N
Sbjct: 77 KIIPRDVRLLHLIFATHAVHNYQDHVPLQLMDFAHRYTKSVLKDALVYNDHSKPINANTN 136
Query: 57 --------INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPP 108
+N DD +LA+ ++ + F PP++ L+++A+++N+ LP V LPP
Sbjct: 137 LNSNTNATVNTDDIRLAIAARSNYQFKPVPPKELLLELAQERNSRPLPPVIPRWGLNLPP 196
Query: 109 DRYCLLSVNY 118
++YCL + ++
Sbjct: 197 EKYCLTAKDW 206
>gi|40746468|gb|EAA65624.1| hypothetical protein AN0794.2 [Aspergillus nidulans FGSC A4]
Length = 287
Score = 72.0 bits (175), Expect = 3e-11
Identities = 50/173 (28%), Positives = 78/173 (45%), Gaps = 36/173 (20%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDT- 62
P+D I +L GVT Y+ +V QLL+F YRY S VL +A V+ G + D T
Sbjct: 73 PRDVRLIHMLLAAQGVTAYQERVPLQLLDFAYRYTSGVLQDA-VHLTTEGYAGTGADTTT 131
Query: 63 ------------------KLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALP-LVKNYSA 103
+L++ S+L F P++FL+DVA ++N +ALP + + A
Sbjct: 132 SSGNKGPPEVNSVTLPALRLSIASRLHYQFQTGLPKEFLMDVAAERNRVALPGATRGFDA 191
Query: 104 A---------------RLPPDRYCLLSVNYRLKGSADKKVSSMAKAPGGQNSV 141
RLPP+R+CL +++ + + AP +N V
Sbjct: 192 GAGKPAASNSVLMAGMRLPPERFCLTGTGWKMSDEWESEGEEDIAAPEEKNDV 244
>gi|39591770|emb|CAE71348.1| Hypothetical protein CBG18251 [Caenorhabditis briggsae]
Length = 184
Score = 70.5 bits (171), Expect = 7e-11
Identities = 43/164 (26%), Positives = 80/164 (48%), Gaps = 3/164 (1%)
Query: 5 KDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKL 64
K+A I+++L + GV E++P+VV L++ Y S++L + + HA + I+ +D +
Sbjct: 21 KEAMAIMSLLGECGVEEFDPRVVSMLMDVQYAVTSKILQVSSGLSRHADKQKIDSEDVQT 80
Query: 65 AVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSA 124
A L +N P R+ ++ +A KN LP +++ +LP DR+C L N+ K
Sbjct: 81 AA-DMLGVLSSNAPDREKILQMANDKNQQPLPQIRHNYGLKLPNDRFCQLQQNFVYKADD 139
Query: 125 DKKVSSMAKAPGGQNSVAKLYQTVI--EDVINNIREAFLDDGVD 166
+ + + + +V+ E V N ++ DD D
Sbjct: 140 NSQQMETQQVAHTPRIIEPPQSSVLRPEQVQNMLKRRAPDDDFD 183
>gi|19112462|ref|NP_595670.1| putative transcription initiation factor IIA large subunit
[Schizosaccharomyces pombe]
gi|7493058|pir||T40052 probable transcription initiation factor IIA large subunit -
fission yeast (Schizosaccharomyces pombe)
gi|6018693|emb|CAB57938.1| SPBC28F2.09 [Schizosaccharomyces pombe]
Length = 369
Score = 70.1 bits (170), Expect = 1e-10
Identities = 91/391 (23%), Positives = 138/391 (35%), Gaps = 97/391 (24%)
Query: 141 VAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSK--------------- 185
V ++Y VI DVI N R F ++GVD+ L+EL+ W+ KL +
Sbjct: 6 VGEVYHHVILDVIANSRSDFEENGVDDATLRELQNLWQSKLVATDVATFPWAQAPVGTFP 65
Query: 186 ---------------------AVENSS--NELRHAQSVMPLYVYANSMQGDKSQQPLVYH 222
AV NS N + ++V + +A P
Sbjct: 66 IGQLFDPVSGLRTDSLDVTAPAVANSPILNNIAAIRAVQQMDTFAQQHGNSNYYSP---P 122
Query: 223 TMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTS----TNQQSTSGGGKQHPSVII 278
T + +S +S + Q +PN P S TNQ + S H + +
Sbjct: 123 TPSLPQSATNISFDSSAIPNVQSNPNNTAPFPSYSSNSLQLPTNQTADSPIINDHSTANV 182
Query: 279 QADG---ANDQRITQVDGA------NDQRITQVDGANDQRITQVEEANDQRITQVDGA-- 327
+ G A D T G N + +++ T ND + Q DGA
Sbjct: 183 TSTGQEHAPDSSSTNSFGGLLLPNQNSPKKSELGETESSNTTPANSRND--VPQTDGAIH 240
Query: 328 --NDQRITQVDEANDQRITQVDGAN------DQRITQVDGA----NDQRITQVDGANXXX 375
+D E+N I Q A RI Q+DG D++ VD +
Sbjct: 241 DLDDAGSPSNFESNRFAIAQKADAEIYEVLKKNRILQIDGTIEDNEDEKKPPVDTPSDEA 300
Query: 376 XXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNV----VVCQYDKIHRA 431
DD D+ E + ++ V+C YDK++
Sbjct: 301 INS-----------------------DLDDPDSDEAPETEEGSDIGQAIVLCLYDKVNHH 337
Query: 432 KCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
K +WK + G++ +NGKD +F KA GE EW
Sbjct: 338 KNKWKCVFRDGVVGVNGKDYLFFKANGEFEW 368
>gi|32420261|ref|XP_330574.1| hypothetical protein [Neurospora crassa]
gi|28925958|gb|EAA34951.1| hypothetical protein [Neurospora crassa]
gi|38567266|emb|CAE76556.1| related to TFIID and SAGA subunit [Neurospora crassa]
Length = 320
Score = 67.8 bits (164), Expect = 5e-10
Identities = 50/148 (33%), Positives = 77/148 (52%), Gaps = 29/148 (19%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDE-----AKVYANHA------ 52
P+DA TI +L GVT +E +V LL+F YR+ S VL + A Y +HA
Sbjct: 96 PRDARTIELLLTAQGVTAFEQRVPLLLLDFAYRHTSAVLSDALHLSADPYTSHAGARPSA 155
Query: 53 --GRSNINVDD-------TKLAVMSQLDNSFTNPP--------PRDFLVDVARQKNTIAL 95
G + +NV D +LA+ S+L+ F +++++++AR+KN +AL
Sbjct: 156 SSGAAPVNVGDATITSNAVQLAIASRLNFQFRGGSGGGSGGGVSKEWMMEMAREKNKVAL 215
Query: 96 PLVK-NYSAARLPPDRYCLLSVNYRLKG 122
P V N RLP +R+ L V++ LKG
Sbjct: 216 PKVSVNEWGVRLPSERFVLNGVSWGLKG 243
>gi|26787966|ref|NP_751946.1| TFIIA-alpha/beta-like factor isoform 2; TFIIA large subunit isoform
ALF; GTF2A1-like factor [Homo sapiens]
Length = 453
Score = 63.5 bits (153), Expect = 9e-09
Identities = 28/51 (54%), Positives = 39/51 (76%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
N V KLY++VIEDVI +R F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5 NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55
Score = 49.3 bits (116), Expect = 2e-04
Identities = 48/204 (23%), Positives = 81/204 (39%), Gaps = 35/204 (17%)
Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
V++P T++N +S G A +D+ + T GA Q +T +
Sbjct: 248 VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301
Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
D R +EE ++ +++ D + ++ +V + + I QVDG+ D +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359
Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
G+ ++ + +DG + PL S
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419
Query: 404 DDGSDDDPVELFDTDNVVVCQYDK 427
DD S+ D +LFDTDNV+VCQYDK
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDK 443
>gi|46107964|ref|XP_381040.1| hypothetical protein FG00864.1 [Gibberella zeae PH-1]
gi|42547614|gb|EAA70457.1| hypothetical protein FG00864.1 [Gibberella zeae PH-1]
Length = 256
Score = 60.8 bits (146), Expect = 6e-08
Identities = 54/201 (26%), Positives = 94/201 (46%), Gaps = 33/201 (16%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEA--------------KVYA 49
P+D+ I +L GVT YEP+V LL+F YR+ S VL +A K A
Sbjct: 56 PRDSRLIELLLTSQGVTAYEPRVPLLLLDFAYRHTSSVLSDALHLSGDPYVTQAGSKPSA 115
Query: 50 N------HAGRSNINVDDTKLAVMSQLDNSFTNPP-----PRDFLVDVARQKNTIALP-L 97
N AG + + + KLA+ ++L ++++ ++AR++N +ALP +
Sbjct: 116 NAGAISAPAGDAAVTANAVKLAIAARLSYQLRGGNAGGGISKEYMQELARERNKVALPKI 175
Query: 98 VKNYSAARLPPDRYCLLSVNYRLKGSADKKVSSMAKAPGGQNSVAKLYQTVIEDVINN-- 155
V + RLP +R+ L ++ LK D+ + G +++ + EDV +
Sbjct: 176 VPSEWGVRLPSERFVLSGTSWGLKDVWDEDDEEDDEMDQGGDAMEGIEGPEPEDVGGDGV 235
Query: 156 ----IREAFLDDGVDEHVLQE 172
+ + F +D VDE + +E
Sbjct: 236 EGGTVEDMFGED-VDEEMAEE 255
>gi|37526068|ref|NP_929412.1| hypothetical protein [Photorhabdus luminescens subsp. laumondii
TTO1]
gi|36785498|emb|CAE14445.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
TTO1]
Length = 247
Score = 60.8 bits (146), Expect = 6e-08
Identities = 27/83 (32%), Positives = 51/83 (61%)
Query: 287 RITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQV 346
R+T+ + D + T+V+G D R+T+ E D + T+V+G D R+T+ + D + T+V
Sbjct: 104 RLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEV 163
Query: 347 DGANDQRITQVDGANDQRITQVD 369
+G D R+T+ + D ++T+V+
Sbjct: 164 EGKFDARLTKSENRFDSKLTEVE 186
Score = 52.8 bits (125), Expect = 2e-05
Identities = 26/104 (25%), Positives = 55/104 (52%)
Query: 262 NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRI 321
N S +Q + + +++ D + T+V+G D R+T+ + D + T+VE D R+
Sbjct: 90 NTLSKKPSEEQLNARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARL 149
Query: 322 TQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRI 365
T+ + D + T+V+ D R+T+ + D ++T+V+ + +
Sbjct: 150 TKSENRFDSKFTEVEGKFDARLTKSENRFDSKLTEVESKQTESV 193
Score = 48.1 bits (113), Expect = 4e-04
Identities = 21/62 (33%), Positives = 38/62 (61%)
Query: 309 RITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQV 368
R+T+ E D + T+V+G D R+T+ + D + T+V+G D R+T+ + D + T+V
Sbjct: 104 RLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEV 163
Query: 369 DG 370
+G
Sbjct: 164 EG 165
>gi|38109765|gb|EAA55584.1| hypothetical protein MG01235.4 [Magnaporthe grisea 70-15]
Length = 407
Score = 60.5 bits (145), Expect = 8e-08
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 3/58 (5%)
Query: 405 DGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
D SD+D EL ++C YDK+ R K +WK LK G++ +NGK+ VF KATGE EW
Sbjct: 353 DESDEDGDEL---QQQMLCLYDKVQRVKNKWKCTLKDGVLCVNGKEYVFHKATGEYEW 407
>gi|40746490|gb|EAA65646.1| hypothetical protein AN0816.2 [Aspergillus nidulans FGSC A4]
Length = 419
Score = 60.1 bits (144), Expect = 1e-07
Identities = 62/244 (25%), Positives = 87/244 (35%), Gaps = 37/244 (15%)
Query: 227 AGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTNQ--QSTSGGGKQHPSVI-IQADGA 283
A Q+ M LP Q Q Q P Q Q T HPSV Q DG
Sbjct: 205 AQSQSPMGLPRPSSMQHMQQMQQMQQMQQMPNGQNPQIKQETGYHPVSHPSVSNAQTDGT 264
Query: 284 NDQRITQVDG-ANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ- 341
+++ +R DG D+ + + Q++ +++G + +DE Q
Sbjct: 265 ASDALSEWKAEVTRRREAAQDGEGDRALRNHLK---QQMLRLEGGG--LMVPLDERESQP 319
Query: 342 ---RITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
R D + + Q DG + D +
Sbjct: 320 KAPRFASADSSGSK--AQFDGPGGDDTKEEDDEDAINSDLDDPDDLV------------- 364
Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
A D DDD V V++C YDK+ R K +WK LK GI+ GK+ VF K G
Sbjct: 365 ----AQDDEDDDAV-----GQVMLCTYDKVQRVKNKWKCTLKDGILTTGGKEYVFHKGQG 415
Query: 459 EAEW 462
E EW
Sbjct: 416 EFEW 419
>gi|45201324|ref|NP_986894.1| AGR228Cp [Eremothecium gossypii]
gi|44986178|gb|AAS54718.1| AGR228Cp [Eremothecium gossypii]
Length = 200
Score = 59.3 bits (142), Expect = 2e-07
Identities = 26/63 (41%), Positives = 40/63 (63%), Gaps = 5/63 (7%)
Query: 405 DGSDDDPVELFDTDN-----VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
D +DD+ V + +N +++C Y+K+ R K +WK NLK G+ +N KD FQK+ GE
Sbjct: 138 DDTDDEYVNFGEEENGSDVNIMLCLYEKVLRVKNKWKCNLKDGVATINNKDYAFQKSQGE 197
Query: 460 AEW 462
+EW
Sbjct: 198 SEW 200
>gi|38109853|gb|EAA55659.1| hypothetical protein MG01310.4 [Magnaporthe grisea 70-15]
Length = 278
Score = 59.3 bits (142), Expect = 2e-07
Identities = 41/135 (30%), Positives = 67/135 (49%), Gaps = 25/135 (18%)
Query: 4 PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEA-----KVYANHAGR---- 54
P+DA TI +L GVT +EP+V LL+F YR+ + VL +A Y HAG
Sbjct: 67 PRDARTIELLLTAQGVTAFEPRVPLLLLDFAYRHTAAVLSDALHLAGDPYTTHAGAKPSA 126
Query: 55 -------------SNINVDDTKLAVMSQLDNSFT--NPPPRDFLVDVARQKNTIALPLVK 99
++++ ++A+ S+L F ++FL++ AR++N ALP +
Sbjct: 127 AQNATASAPSGGDASVSASAIQVAIASRLAFQFRGGGSTSKEFLLETARERNKAALPRIL 186
Query: 100 NYS-AARLPPDRYCL 113
+ RLP +R+ L
Sbjct: 187 PHEWGVRLPSERFVL 201
>gi|17065294|gb|AAL32801.1| similar to TFIIA [Arabidopsis thaliana]
gi|30023680|gb|AAP13373.1| At1g07480 [Arabidopsis thaliana]
Length = 218
Score = 58.9 bits (141), Expect = 2e-07
Identities = 49/175 (28%), Positives = 72/175 (41%), Gaps = 32/175 (18%)
Query: 317 NDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQVDGA----- 371
N I Q DGA D E + RIT + D+++ + ++ +I QVDG
Sbjct: 47 NGGSIPQQDGAGDAIPEANFECDAFRITSI---GDRKVPRDFFSSSSKIPQVDGPMPDPY 103
Query: 372 -------NXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADD---------GSDDDPVELF 415
N P+ +D DDD EL
Sbjct: 104 DEMLSTPNIYSYQGPSEEFNEARTPAPNEIQTSTPVAVQNDIIEDDEELLNEDDDDDELD 163
Query: 416 D--------TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
D T ++V+ Q+DK+ R K RWK +LK GIM++N KD++F KA GE ++
Sbjct: 164 DLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKAAGEFDF 218
>gi|1633309|pdb|1YTF|C Chain C, Yeast TfiiaTBPDNA COMPLEX
gi|38492532|pdb|1NH2|C Chain C, Crystal Structure Of A Yeast TfiiaTBPDNA COMPLEX
Length = 79
Score = 58.9 bits (141), Expect = 2e-07
Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 5/58 (8%)
Query: 405 DGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
+G +D P E N+++C YDK+ R K RWK +LK G++ +N D FQKA EAEW
Sbjct: 26 EGEEDGPDE-----NLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEW 78
>gi|1429226|emb|CAA67368.1| TFIIA [Arabidopsis thaliana]
Length = 375
Score = 58.2 bits (139), Expect = 4e-07
Identities = 28/60 (46%), Positives = 42/60 (70%), Gaps = 1/60 (1%)
Query: 404 DDGSDD-DPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
DD DD + E +T ++V+ Q+DK+ R K RWK +LK GIM++N KD++F KA GE ++
Sbjct: 316 DDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKAAGEFDF 375
>gi|15237841|ref|NP_200731.1| transcription factor-related [Arabidopsis thaliana]
gi|9759244|dbj|BAB09768.1| unnamed protein product [Arabidopsis thaliana]
gi|39545874|gb|AAR28000.1| TFIIA-L3 [Arabidopsis thaliana]
Length = 186
Score = 55.8 bits (133), Expect = 2e-06
Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 26/107 (24%)
Query: 354 ITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDD-PV 412
I Q DGA D+ I VD PL DD +DD
Sbjct: 102 IPQQDGARDEAIVDVD-------------------------ENEEPLNEDDDDEEDDIDD 136
Query: 413 ELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
+ + ++V+CQ+DK+ R+K +W+ G+M +NGK+++F +ATG+
Sbjct: 137 DDMNIQHLVMCQFDKVKRSKNKWECKFNAGVMQINGKNVLFSQATGD 183
>gi|1633308|pdb|1YTF|B Chain B, Yeast TfiiaTBPDNA COMPLEX
gi|38492531|pdb|1NH2|B Chain B, Crystal Structure Of A Yeast TfiiaTBPDNA COMPLEX
Length = 53
Score = 50.4 bits (119), Expect = 8e-05
Identities = 18/44 (40%), Positives = 33/44 (75%)
Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSK 185
+++Y+ ++E V+N +RE F + G+DE LQ+LK W++KL ++K
Sbjct: 6 SRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETK 49
>gi|22994332|ref|ZP_00038840.1| hypothetical protein [Xylella fastidiosa Dixon]
Length = 222
Score = 49.3 bits (116), Expect = 2e-04
Identities = 34/89 (38%), Positives = 49/89 (55%), Gaps = 8/89 (8%)
Query: 285 DQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQ---RITQVDEANDQ 341
DQR QVD QR ++D ++I Q E DQR QVD +Q Q+D+ DQ
Sbjct: 114 DQRFAQVD----QRFEKID-QRFEKIDQRFEKIDQRFAQVDQRFEQIAKDFAQLDKNMDQ 168
Query: 342 RITQVDGANDQRITQVDGANDQRITQVDG 370
R+ Q+D A +QR Q++ A +QR + +G
Sbjct: 169 RLGQLDKALEQRFGQLEKALEQRFAKTEG 197
>gi|29893524|gb|AAN16518.1| circumsporozoite protein [Plasmodium coatneyi]
Length = 341
Score = 49.3 bits (116), Expect = 2e-04
Identities = 32/91 (35%), Positives = 33/91 (36%)
Query: 281 DGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEAND 340
DGA D DGA D DGA D + A D DGA D D A D
Sbjct: 90 DGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARD 149
Query: 341 QRITQVDGANDQRITQVDGANDQRITQVDGA 371
DGA D DGA D DGA
Sbjct: 150 GPAPAADGARDGPAPAADGARDGPAPPADGA 180
Score = 47.4 bits (111), Expect = 7e-04
Identities = 30/83 (36%), Positives = 31/83 (37%)
Query: 280 ADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEAN 339
ADGA D DGA D DGA D + A D DGA D D A
Sbjct: 100 ADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGAR 159
Query: 340 DQRITQVDGANDQRITQVDGAND 362
D DGA D DGA D
Sbjct: 160 DGPAPAADGARDGPAPPADGARD 182
>gi|24580579|ref|NP_722615.1| CG18497-PA [Drosophila melanogaster]
gi|46397733|sp|Q8SX83|SPEN_DROME Split ends protein
gi|10727421|gb|AAF51535.2| CG18497-PA [Drosophila melanogaster]
Length = 5560
Score = 48.9 bits (115), Expect = 2e-04
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)
Query: 205 VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
V+ N+ Q QQP V MTA QQ + + QRQQH QQ++ Q TS
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864
Query: 262 ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
QQ +QH + + A Q+ Q +Q+I Q ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924
Query: 313 VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
+A Q ++Q + Q++ Q +A Q++ Q+ Q++ Q+ G Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|6979936|gb|AAF34661.1| split ends long isoform [Drosophila melanogaster]
Length = 5554
Score = 48.9 bits (115), Expect = 2e-04
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)
Query: 205 VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
V+ N+ Q QQP V MTA QQ + + QRQQH QQ++ Q TS
Sbjct: 3801 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3858
Query: 262 ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
QQ +QH + + A Q+ Q +Q+I Q ++ Q
Sbjct: 3859 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3918
Query: 313 VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
+A Q ++Q + Q++ Q +A Q++ Q+ Q++ Q+ G Q+
Sbjct: 3919 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3965
>gi|24580583|ref|NP_722616.1| CG18497-PC [Drosophila melanogaster]
gi|6715140|gb|AAF26299.1| split ends [Drosophila melanogaster]
gi|22945598|gb|AAN10511.1| CG18497-PC [Drosophila melanogaster]
Length = 5476
Score = 48.9 bits (115), Expect = 2e-04
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)
Query: 205 VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
V+ N+ Q QQP V MTA QQ + + QRQQH QQ++ Q TS
Sbjct: 3750 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3807
Query: 262 ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
QQ +QH + + A Q+ Q +Q+I Q ++ Q
Sbjct: 3808 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3867
Query: 313 VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
+A Q ++Q + Q++ Q +A Q++ Q+ Q++ Q+ G Q+
Sbjct: 3868 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3914
>gi|6467825|gb|AAF13218.1| Spen RNP motif protein long isoform [Drosophila melanogaster]
Length = 5533
Score = 48.9 bits (115), Expect = 2e-04
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)
Query: 205 VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
V+ N+ Q QQP V MTA QQ + + QRQQH QQ++ Q TS
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864
Query: 262 ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
QQ +QH + + A Q+ Q +Q+I Q ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924
Query: 313 VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
+A Q ++Q + Q++ Q +A Q++ Q+ Q++ Q+ G Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|24580581|ref|NP_524718.2| CG18497-PB [Drosophila melanogaster]
gi|10727420|gb|AAF51534.2| CG18497-PB [Drosophila melanogaster]
Length = 5533
Score = 48.9 bits (115), Expect = 2e-04
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)
Query: 205 VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
V+ N+ Q QQP V MTA QQ + + QRQQH QQ++ Q TS
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864
Query: 262 ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
QQ +QH + + A Q+ Q +Q+I Q ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924
Query: 313 VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
+A Q ++Q + Q++ Q +A Q++ Q+ Q++ Q+ G Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|26247178|ref|NP_753218.1| Minor curlin subunit precursor [Escherichia coli CFT073]
gi|26107579|gb|AAN79778.1| Minor curlin subunit precursor [Escherichia coli CFT073]
Length = 160
Score = 47.4 bits (111), Expect = 7e-04
Identities = 43/158 (27%), Positives = 61/158 (38%), Gaps = 10/158 (6%)
Query: 208 NSMQGDKSQQPLVYHTMT---------AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
+ +QGD + L++ +T AAG A S N + + + Q Q G
Sbjct: 3 DQVQGDNMKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAG 62
Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
T+ + Q GG K +V+ Q +N +I Q N I Q AND I+Q N
Sbjct: 63 TNNSAQLRQGGSKLL-TVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNT 121
Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
I Q N ITQ + Q R+TQ
Sbjct: 122 AMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQ 159
>gi|24112441|ref|NP_706951.1| minor curlin subunit precursor, similar ro CsgA [Shigella flexneri
2a str. 301]
gi|24051320|gb|AAN42658.1| minor curlin subunit precursor, similar ro CsgA [Shigella flexneri
2a str. 301]
Length = 160
Score = 47.4 bits (111), Expect = 7e-04
Identities = 43/158 (27%), Positives = 61/158 (38%), Gaps = 10/158 (6%)
Query: 208 NSMQGDKSQQPLVYHTMT---------AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
+ +QGD + L++ +T AAG A S N + + + Q Q G
Sbjct: 3 DQVQGDNMKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAG 62
Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
T+ + Q GG K +V+ Q +N +I Q N I Q AND I+Q N
Sbjct: 63 TNNSAQLRQGGSKLL-AVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNT 121
Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
I Q N ITQ + Q R+TQ
Sbjct: 122 AMIIQKGSGNKANITQYGTQKTAVVVQRQSQMAIRVTQ 159
>gi|46112495|ref|ZP_00180737.2| COG1322: Uncharacterized protein conserved in bacteria [Moorella
thermoacetica ATCC 39073]
Length = 190
Score = 47.4 bits (111), Expect = 7e-04
Identities = 28/85 (32%), Positives = 54/85 (63%)
Query: 283 ANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQR 342
A +Q++TQ A +Q++TQ +Q++T+ E +Q++TQ A +Q++T+ E +Q+
Sbjct: 58 AVEQKLTQRIDAVEQKLTQRIDVVEQKLTRHIEEVEQKLTQRIDAVEQKLTEHIEEVEQK 117
Query: 343 ITQVDGANDQRITQVDGANDQRITQ 367
+TQ A ++ +TQ A DQ++T+
Sbjct: 118 LTQRIDAVEKTLTQGIDAVDQKLTR 142
>gi|20148954|gb|AAM12730.1| TFIIA alpha/beta-like protein ALF [Mus musculus]
Length = 41
Score = 47.4 bits (111), Expect = 7e-04
Identities = 20/37 (54%), Positives = 30/37 (81%)
Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQ 175
N V KLYQ+VIEDVI +R+ F ++G++E VL++LK+
Sbjct: 5 NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKK 41
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
Posted date: May 11, 2004 12:59 AM
Number of letters in database: 593,787,773
Number of sequences in database: 1,798,171
Lambda K H
0.313 0.129 0.365
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 497,867,535
Number of Sequences: 1798171
Number of extensions: 19614691
Number of successful extensions: 66046
Number of sequences better than 1.0e-03: 91
Number of HSP's better than 0.0 without gapping: 77
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 65829
Number of HSP's gapped (non-prelim): 178
length of query: 462
length of database: 593,787,773
effective HSP length: 129
effective length of query: 333
effective length of database: 361,823,714
effective search space: 120487296762
effective search space used: 120487296762
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 110 (47.0 bits)