BLASTP 2.2.5 [Nov-16-2002]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= ci0100131919
         (462 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples 
           1,798,171 sequences; 593,787,773 total letters

Searching..................................................done


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|28175808|gb|AAH43028.1|  TAF9 RNA polymerase II, TATA box...   170   5e-41
gi|18079237|ref|NP_081415.1|  TAF9 RNA polymerase II, TATA b...   170   5e-41
gi|4507351|ref|NP_003178.1|  TBP-associated factor 9; TAF9 R...   168   2e-40
gi|13097291|gb|AAH03400.1|  TBP-associated factor 9 [Homo sa...   168   2e-40
gi|33086648|gb|AAP92636.1|  Cb1-739 [Rattus norvegicus]           168   3e-40
gi|34576553|ref|NP_908937.1|  TAF9 RNA polymerase II, TATA b...   165   2e-39
gi|20070280|ref|NP_057059.2|  TBP-associated factor 9L; neur...   165   2e-39
gi|4689154|gb|AAD27786.1|  neuronal cell death-related prote...   165   2e-39
gi|47086725|ref|NP_997822.1|  general transcription factor I...   164   4e-39
gi|38086493|ref|XP_284729.2|  similar to TBP-associated fact...   163   8e-39
gi|30186181|gb|AAH51635.1|  Similar to TAF9-like RNA polymer...   163   8e-39
gi|19424332|ref|NP_598299.1|  TAF9-like RNA polymerase II, T...   162   2e-38
gi|42542429|gb|AAH66223.1|  Unknown (protein for MGC:76785) ...   162   2e-38
gi|26364308|dbj|BAB26216.2|  unnamed protein product [Mus mu...   159   2e-37
gi|7706735|ref|NP_056943.1|  TFIIA alpha, p55 isoform 1; gen...   156   8e-37
gi|31207633|ref|XP_312783.1|  ENSANGP00000020370 [Anopheles ...   156   1e-36
gi|34099896|gb|AAP44969.1|  transcription factor IIA large s...   154   3e-36
gi|11559984|ref|NP_071544.1|  general transcription factor I...   154   5e-36
gi|17136922|ref|NP_476996.1|  CG5930-PB [Drosophila melanoga...   149   1e-34
gi|17136920|ref|NP_476995.1|  CG5930-PA [Drosophila melanoga...   149   2e-34
gi|1711662|sp|P52654|T2AA_DROME  TRANSCRIPTION INITIATION FA...   147   5e-34
gi|34099900|gb|AAP44971.1|  transcription factor IIA large s...   145   1e-33
gi|33312518|gb|AAQ04072.1|  transcription factor IIA large s...   144   3e-33
gi|27819943|gb|AAL28886.2|  LD26511p [Drosophila melanogaster]    141   3e-32
gi|45549141|ref|NP_523391.3|  CG6474-PA [Drosophila melanoga...   141   3e-32
gi|17562510|ref|NP_504355.1|  prion-like Q/N-rich domain pro...   122   1e-26
gi|39583968|emb|CAE64058.1|  Hypothetical protein CBG08658 [...   122   2e-26
gi|15221048|ref|NP_175816.1|  transcription initiation facto...   119   1e-25
gi|42733663|gb|AAS38620.1|  similar to F55B11.3.p [Caenorhab...   118   2e-25
gi|13878225|ref|NP_113568.1|  general transcription factor I...   112   2e-23
gi|42476101|ref|NP_963889.1|  TFIIA alpha, p55 isoform 2; ge...   110   8e-23
gi|30141908|ref|NP_780544.1|  general transcription factor I...   107   4e-22
gi|19075146|ref|NP_586747.1|  TRANSCRIPTION INITIATION FACTO...   105   3e-21
gi|46102011|gb|EAK87244.1|  hypothetical protein UM06387.1 [...   104   5e-21
gi|19113805|ref|NP_592893.1|  hypothetical protein [Schizosa...   102   2e-20
gi|30017563|gb|AAP12985.1|  putative transcription initiatio...   100   5e-20
gi|24650582|ref|NP_733208.1|  CG5930-PC [Drosophila melanoga...   100   9e-20
gi|38492544|pdb|1NVP|C  Chain C, Human TfiiaTBPDNA COMPLEX        100   9e-20
gi|32394470|gb|AAM93933.1|  transcriptional activation facto...    97   1e-18
gi|34862156|ref|XP_237507.2|  similar to general transcripti...    96   1e-18
gi|26787968|ref|NP_006863.2|  TFIIA-alpha/beta-like factor i...    94   5e-18
gi|29553944|ref|NP_758515.1|  stoned B/TFIIA-alpha/beta-like...    94   5e-18
gi|33312516|gb|AAQ04071.1|  TFIIAa/b-like factor [Xenopus la...    94   5e-18
gi|46442061|gb|EAL01353.1|  hypothetical protein CaO19.10197...    94   6e-18
gi|12963759|ref|NP_076119.1|  general transcription factor I...    94   8e-18
gi|34098596|sp|Q8R4I4|T2AY_MOUSE  TFIIA-alpha and beta like ...    94   8e-18
gi|13124585|sp|Q9UNN4|T2AY_HUMAN  TFIIA-alpha and beta like ...    92   2e-17
gi|40555809|gb|AAH64585.1|  ALF protein [Homo sapiens]             92   2e-17
gi|5091669|gb|AAD39617.1|  SALF [Homo sapiens]                     92   2e-17
gi|1942936|pdb|1TAF|A  Chain A, Drosophila Tbp Associated Fa...    89   3e-16
gi|6324768|ref|NP_014837.1|  Transcription factor IIA, large...    86   2e-15
gi|46107550|ref|XP_380834.1|  hypothetical protein FG00658.1...    83   1e-14
gi|32422741|ref|XP_331814.1|  hypothetical protein [Neurospo...    82   3e-14
gi|18390773|ref|NP_563790.1|  transcription factor IIA large...    81   4e-14
gi|46097824|gb|EAK83057.1|  hypothetical protein UM05183.1 [...    80   1e-13
gi|30680148|ref|NP_850937.1|  transcription factor IIA large...    79   2e-13
gi|6323892|ref|NP_013963.1|  Subunit (17 kDa) of TFIID and S...    79   3e-13
gi|24059851|dbj|BAC21319.1|  hypothetical protein [Oryza sat...    77   1e-12
gi|45199045|ref|NP_986074.1|  AFR527Wp [Eremothecium gossypi...    75   3e-12
gi|25406976|pir||D86209  protein F22G5.18 [imported] - Arabi...    75   3e-12
gi|38492543|pdb|1NVP|B  Chain B, Human TfiiaTBPDNA COMPLEX         74   9e-12
gi|17555056|ref|NP_499814.1|  TBP-Associated transcription F...    73   1e-11
gi|46432465|gb|EAK91945.1|  hypothetical protein CaO19.8708 ...    72   2e-11
gi|46432443|gb|EAK91924.1|  hypothetical protein CaO19.1111 ...    72   2e-11
gi|40746468|gb|EAA65624.1|  hypothetical protein AN0794.2 [A...    72   3e-11
gi|39591770|emb|CAE71348.1|  Hypothetical protein CBG18251 [...    70   7e-11
gi|19112462|ref|NP_595670.1|  putative transcription initiat...    70   1e-10
gi|32420261|ref|XP_330574.1|  hypothetical protein [Neurospo...    68   5e-10
gi|26787966|ref|NP_751946.1|  TFIIA-alpha/beta-like factor i...    64   9e-09
gi|46107964|ref|XP_381040.1|  hypothetical protein FG00864.1...    61   6e-08
gi|37526068|ref|NP_929412.1|  hypothetical protein [Photorha...    61   6e-08
gi|38109765|gb|EAA55584.1|  hypothetical protein MG01235.4 [...    60   8e-08
gi|40746490|gb|EAA65646.1|  hypothetical protein AN0816.2 [A...    60   1e-07
gi|45201324|ref|NP_986894.1|  AGR228Cp [Eremothecium gossypi...    59   2e-07
gi|38109853|gb|EAA55659.1|  hypothetical protein MG01310.4 [...    59   2e-07
gi|17065294|gb|AAL32801.1|  similar to TFIIA [Arabidopsis th...    59   2e-07
gi|1633309|pdb|1YTF|C  Chain C, Yeast TfiiaTBPDNA COMPLEX >g...    59   2e-07
gi|1429226|emb|CAA67368.1|  TFIIA [Arabidopsis thaliana]           58   4e-07
gi|15237841|ref|NP_200731.1|  transcription factor-related [...    56   2e-06
gi|1633308|pdb|1YTF|B  Chain B, Yeast TfiiaTBPDNA COMPLEX >g...    50   8e-05
gi|22994332|ref|ZP_00038840.1|  hypothetical protein [Xylell...    49   2e-04
gi|29893524|gb|AAN16518.1|  circumsporozoite protein [Plasmo...    49   2e-04
gi|24580579|ref|NP_722615.1|  CG18497-PA [Drosophila melanog...    49   2e-04
gi|6979936|gb|AAF34661.1|  split ends long isoform [Drosophi...    49   2e-04
gi|24580583|ref|NP_722616.1|  CG18497-PC [Drosophila melanog...    49   2e-04
gi|6467825|gb|AAF13218.1|  Spen RNP motif protein long isofo...    49   2e-04
gi|24580581|ref|NP_524718.2|  CG18497-PB [Drosophila melanog...    49   2e-04
gi|26247178|ref|NP_753218.1|  Minor curlin subunit precursor...    47   7e-04
gi|24112441|ref|NP_706951.1|  minor curlin subunit precursor...    47   7e-04
gi|46112495|ref|ZP_00180737.2|  COG1322: Uncharacterized pro...    47   7e-04
gi|20148954|gb|AAM12730.1|  TFIIA alpha/beta-like protein AL...    47   7e-04
>gi|28175808|gb|AAH43028.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Mus musculus]
          Length = 264

 Score =  170 bits (431), Expect = 5e-41
 Identities = 77/144 (53%), Positives = 112/144 (77%), Gaps = 5/144 (3%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10  KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
           K S  KK    A AP G+ +V +L
Sbjct: 130 K-SLQKK----APAPAGRITVPRL 148
>gi|18079237|ref|NP_081415.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor; TAF9 RNA polymerase II, TATA box binding protein
           (TBP)-associated factor, 32 kDa [Mus musculus]
 gi|34222930|sp|Q8VI33|TAF9_MOUSE Transcription initiation factor TFIID subunit 9 (Transcription
           initiation factor TFIID 31 kDa subunit) (TAFII-31)
           (TAFII-32) (TAFII32)
 gi|17901798|gb|AAL47693.1| RNA polymerase II TATA box binding protein-associated factor G [Mus
           musculus]
 gi|27503789|gb|AAH42723.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Mus musculus]
          Length = 264

 Score =  170 bits (431), Expect = 5e-41
 Identities = 77/144 (53%), Positives = 112/144 (77%), Gaps = 5/144 (3%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10  KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
           K S  KK    A AP G+ +V +L
Sbjct: 130 K-SLQKK----APAPAGRITVPRL 148
>gi|4507351|ref|NP_003178.1| TBP-associated factor 9; TAF9 RNA polymerase II, TATA box binding
           protein (TBP)-associated factor, 32 kD; TATA box binding
           protein (TBP)-associated factor, RNA polymerase II, G,
           32kD; transcription initiation factor TFIID 31 kD
           subunit; adrenal gland protein AD-004 [Homo sapiens]
 gi|2498981|sp|Q16594|TAF9_HUMAN Transcription initiation factor TFIID subunit 9 (Transcription
           initiation factor TFIID 31 kDa subunit) (TAFII-31)
           (TAFII-32) (TAFII32) (STAF31/32)
 gi|2136305|pir||I39141 transcription factor TFIID 32K chain TAFII32 - human
 gi|841308|gb|AAC50153.1| TAFII32 precursor
 gi|882393|gb|AAA91318.1| TBP-associated factor TAFII31
 gi|940151|gb|AAA84389.1| TAFII31
 gi|26892203|gb|AAN84793.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 32kDa [Homo sapiens]
 gi|1098347|prf||2115400A transcription factor IID:SUBUNIT=32kD
          Length = 264

 Score =  168 bits (426), Expect = 2e-40
 Identities = 73/132 (55%), Positives = 106/132 (80%), Gaps = 1/132 (0%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10  KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADKKVSSMA 132
           K S  KK S+ A
Sbjct: 130 K-SLQKKASTSA 140
>gi|13097291|gb|AAH03400.1| TBP-associated factor 9 [Homo sapiens]
          Length = 264

 Score =  168 bits (426), Expect = 2e-40
 Identities = 73/132 (55%), Positives = 106/132 (80%), Gaps = 1/132 (0%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ D
Sbjct: 10  KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADKKVSSMA 132
           K S  KK S+ A
Sbjct: 130 K-SLQKKASTSA 140
>gi|33086648|gb|AAP92636.1| Cb1-739 [Rattus norvegicus]
          Length = 259

 Score =  168 bits (425), Expect = 3e-40
 Identities = 76/144 (52%), Positives = 110/144 (76%), Gaps = 5/144 (3%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 5   KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 64

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 65  DVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 124

Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
           K S  KK    A  P G+ +V +L
Sbjct: 125 K-SLQKK----APTPAGRITVPRL 143
>gi|34576553|ref|NP_908937.1| TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Rattus norvegicus]
 gi|34068042|gb|AAQ56726.1| liver regeneration related protein; LRRG210 [Rattus norvegicus]
          Length = 259

 Score =  165 bits (417), Expect = 2e-39
 Identities = 75/144 (52%), Positives = 109/144 (75%), Gaps = 5/144 (3%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ +  +L+DMG+TEYEP+V++Q+LEF  RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 5   KSMPKDAQMMAQILKDMGITEYEPRVINQMLEFALRYVTTILDDAKIYSSHAKKPTVDAD 64

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +L +  + D+SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRL
Sbjct: 65  DVRLPIQCRADHSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRL 124

Query: 121 KGSADKKVSSMAKAPGGQNSVAKL 144
           K S  KK    A  P G+ +V +L
Sbjct: 125 K-SLQKK----APTPAGRITVPRL 143
>gi|20070280|ref|NP_057059.2| TBP-associated factor 9L; neuronal cell death-related protein;
           TAF9-like RNA polymerase II, TATA box binding protein
           (TBP)-associated factor, 31 kD; neuronal cell
           death-related gene in neuron-7; transcription associated
           factor TAFII31L; transcription initiation factor IID,
           31kD subunit [Homo sapiens]
 gi|9963821|gb|AAG09711.1| transcription associated factor TAFII31L [Homo sapiens]
 gi|14714449|gb|AAH10350.1| TBP-associated factor 9L [Homo sapiens]
 gi|16306983|gb|AAH09566.1| TBP-associated factor 9L [Homo sapiens]
          Length = 251

 Score =  165 bits (417), Expect = 2e-39
 Identities = 70/126 (55%), Positives = 99/126 (78%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + N++ D
Sbjct: 10  KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPNVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADK 126
           K    K
Sbjct: 130 KSLIKK 135
>gi|4689154|gb|AAD27786.1| neuronal cell death-related protein [Homo sapiens]
          Length = 251

 Score =  165 bits (417), Expect = 2e-39
 Identities = 70/126 (55%), Positives = 99/126 (78%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA + N++ D
Sbjct: 10  KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPNVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADK 126
           K    K
Sbjct: 130 KSLIKK 135
>gi|47086725|ref|NP_997822.1| general transcription factor IIA, 1, 19/37kDa [Danio rerio]
 gi|29124431|gb|AAH48894.1| Wu:fc15h02 protein [Danio rerio]
          Length = 369

 Score =  164 bits (415), Expect = 4e-39
 Identities = 136/397 (34%), Positives = 173/397 (43%), Gaps = 108/397 (27%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE---LR 195
           N V KLY++V+EDVI  +RE FLD+GVDE VL ELK  WE KL QSKAV+    E   + 
Sbjct: 8   NQVPKLYRSVMEDVIAEVRELFLDEGVDEQVLMELKTLWESKLMQSKAVDGFHTEEQQVL 67

Query: 196 HAQS-----------VMPLYVYANSMQGDKSQQPLVYHT----------MTAAGQQAAMS 234
           HAQ              P  V     Q    QQ +V             M+AA   A ++
Sbjct: 68  HAQQQQAQAQQTQQQTQPQQVLLPPAQTTPQQQVIVQDAKIMQHMSATGMSAAATAATLA 127

Query: 235 LPNSVLY---------------------------QRQQHPNQQVYAPYQPG---TSTNQQ 264
           LP  V                             Q+Q    QQV    QPG       QQ
Sbjct: 128 LPTGVAPFQQLITPQGQILQVVRAANGAQYIIQPQQQMLLQQQVLPQMQPGGVQAPVIQQ 187

Query: 265 STS---GGGKQHPSVIIQAD-----GANDQRITQV---------DGANDQRITQVDGAND 307
             +   GG  Q   VIIQ       G    + TQV          G  D ++     A  
Sbjct: 188 VLTPLQGGLPQQTGVIIQPQQIVLTGNKVPQNTQVMQAAAMAVQSGPGDAQVQTQPQAQQ 247

Query: 308 QRITQVEEANDQ--RITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRI 365
           Q+  Q ++A+ Q   + QVDGA D    + +E  D+     D   ++     DG  D ++
Sbjct: 248 QQQQQQQQASQQPPMMLQVDGAGDTSSEEDEEEEDEYDEDEDEDKEK-----DGGEDGQV 302

Query: 366 TQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQY 425
            +                               PL S DD SD++  ELFDT+NVVVCQY
Sbjct: 303 EE------------------------------EPLNSGDDVSDEEDQELFDTENVVVCQY 332

Query: 426 DKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DKIHR+K +WKF+LK GIMNLNG+D VF KA G+AEW
Sbjct: 333 DKIHRSKNKWKFHLKDGIMNLNGRDFVFSKAIGDAEW 369
>gi|38086493|ref|XP_284729.2| similar to TBP-associated factor 9L; neuronal cell death-related
           protein; TAF9-like RNA polymerase II, TATA box binding
           protein (TBP)-associated factor, 31 kD; neuronal cell
           death-related gene in neuron-7; transcription associated
           factor TAFII31L; ... [Mus musculus]
          Length = 210

 Score =  163 bits (412), Expect = 8e-39
 Identities = 69/126 (54%), Positives = 98/126 (77%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 10  KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADK 126
           K    K
Sbjct: 130 KSLVKK 135
>gi|30186181|gb|AAH51635.1| Similar to TAF9-like RNA polymerase II, TATA box binding protein
           (TBP)-associated factor, 31 kD [Mus musculus]
          Length = 252

 Score =  163 bits (412), Expect = 8e-39
 Identities = 69/126 (54%), Positives = 98/126 (77%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 13  KNAPRDALVMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 72

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 73  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 132

Query: 121 KGSADK 126
           K    K
Sbjct: 133 KSLVKK 138
>gi|19424332|ref|NP_598299.1| TAF9-like RNA polymerase II, TATA box binding protein
           (TBP)-associated factor, 31 kD; neuronal cell
           death-related gene in neuron-7 [Rattus norvegicus]
 gi|2498982|sp|Q62880|TAF9_RAT Transcription initiation factor TFIID subunit 9 (Transcription
           initiation factor TFIID 31 kDa subunit) (TAFII-31)
           (TAFII-32) (TAFII32) (Neuronal cell death related gene
           in neuron 7) (DN-7)
 gi|7514096|pir||JC5511 TATA-binding protein-associated factor II 31 - rat
 gi|1103900|gb|AAC53201.1| induced upon programmed cell death in neuronally differentiated
           PC12 cells
          Length = 253

 Score =  162 bits (409), Expect = 2e-38
 Identities = 68/126 (53%), Positives = 98/126 (77%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG+T+YEP+V++Q+LEF +RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 5   KNAPRDALVMAQILKDMGITDYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 64

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 65  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 124

Query: 121 KGSADK 126
           K    K
Sbjct: 125 KSLVKK 130
>gi|42542429|gb|AAH66223.1| Unknown (protein for MGC:76785) [Mus musculus]
          Length = 249

 Score =  162 bits (409), Expect = 2e-38
 Identities = 69/126 (54%), Positives = 97/126 (76%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           KN P+DA  +  +L+DMG TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA +  ++ D
Sbjct: 10  KNAPRDALVMAQILKDMGTTEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKPTVDAD 69

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA+  + D SFT+PPPRDFL+D+ARQKN   LPL+K Y+  RLPPDRYCL + NYRL
Sbjct: 70  DVRLAIQCRADQSFTSPPPRDFLLDIARQKNQTPLPLIKPYAGPRLPPDRYCLTAPNYRL 129

Query: 121 KGSADK 126
           K    K
Sbjct: 130 KSLVKK 135
>gi|26364308|dbj|BAB26216.2| unnamed protein product [Mus musculus]
          Length = 247

 Score =  159 bits (401), Expect = 2e-37
 Identities = 72/132 (54%), Positives = 103/132 (78%), Gaps = 5/132 (3%)

Query: 13  VLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKLAVMSQLDN 72
           +L+DMG+TEYEP+V++Q+LEF +RY++ +LD+AK+Y++HA ++ ++ DD +LA+  + D 
Sbjct: 5   ILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSHAKKATVDADDVRLAIQCRADQ 64

Query: 73  SFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSADKKVSSMA 132
           SFT+PPPRDFL+D+ARQ+N   LPL+K YS  RLPPDRYCL + NYRLK S  KK    A
Sbjct: 65  SFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRYCLTAPNYRLK-SLQKK----A 119

Query: 133 KAPGGQNSVAKL 144
            AP G+ +V +L
Sbjct: 120 PAPAGRITVPRL 131
>gi|7706735|ref|NP_056943.1| TFIIA alpha, p55 isoform 1; general transcription factor IIA, 1
           (37kD and 19kD subunits); TFIIA alpha, p55 [Homo
           sapiens]
 gi|1711663|sp|P52655|T2AA_HUMAN Transcription initiation factor IIA alpha and beta chains (TFIIA
           p35 and p19 subunits) (TFIIA-42) (TFIIAL)
 gi|627655|pir||A49077 transcription initiation factor IIA alpha chain precursor - human
 gi|433500|emb|CAA53151.1| TFIIA [Homo sapiens]
 gi|452272|emb|CAA54442.1| TFIIA/alpha, p55 [Homo sapiens]
 gi|727197|dbj|BAA03604.1| 'TFIIA-42' [Homo sapiens]
 gi|6721137|gb|AAF26776.1| TFIIA-42 [Homo sapiens]
          Length = 376

 Score =  156 bits (395), Expect = 8e-37
 Identities = 135/405 (33%), Positives = 174/405 (42%), Gaps = 117/405 (28%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELR--- 195
           N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK  WE KL QS+AV+   +E +   
Sbjct: 8   NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67

Query: 196 ------------------HAQSVMPLYVYANSMQGDK-----SQQP-----------LVY 221
                             H Q   P        Q  +     SQQ            L+ 
Sbjct: 68  LQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQ 127

Query: 222 H----TMTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQQ 250
           H     M+AA   A ++LP  V                         ++Q QQ     QQ
Sbjct: 128 HMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQ 187

Query: 251 VYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQRI 299
           V    QPG       QQ  +   GG      VIIQ       G   Q I     A     
Sbjct: 188 VIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQ 247

Query: 300 TQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQV 357
            Q+     Q+  Q + A  Q   + QVDG  D   T  +E  D+     D   + +  + 
Sbjct: 248 AQITATGQQQ-PQAQPAQTQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK--EK 301

Query: 358 DGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDT 417
           DGA D ++ +                               PL S DD SD++  ELFDT
Sbjct: 302 DGAEDGQVEE------------------------------EPLNSEDDVSDEEGQELFDT 331

Query: 418 DNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           +NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 ENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376
>gi|31207633|ref|XP_312783.1| ENSANGP00000020370 [Anopheles gambiae]
 gi|21296316|gb|EAA08461.1| ENSANGP00000020370 [Anopheles gambiae str. PEST]
          Length = 266

 Score =  156 bits (394), Expect = 1e-36
 Identities = 67/136 (49%), Positives = 101/136 (74%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ ++++L+++G T+YEP+V++QLLEFTYRY++ +LD+AK+YANHA +  I +D
Sbjct: 19  KHIPKDAQVVMSILKELGATDYEPRVINQLLEFTYRYVTCILDDAKIYANHARKKVIELD 78

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D KLA    LD +FT PPPRD L++++R +N   LPL+K +   RLPPDRYCL + NY+L
Sbjct: 79  DVKLATQMILDKAFTRPPPRDVLLEISRVRNATPLPLIKTHCGLRLPPDRYCLSACNYKL 138

Query: 121 KGSADKKVSSMAKAPG 136
           + +   K  S +   G
Sbjct: 139 RAAQQPKRMSKSALEG 154
>gi|34099896|gb|AAP44969.1| transcription factor IIA large subunit-2 [Xenopus laevis]
          Length = 367

 Score =  154 bits (390), Expect = 3e-36
 Identities = 134/401 (33%), Positives = 172/401 (42%), Gaps = 108/401 (26%)

Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
           A    N V KLY++VIEDVIN++RE FLD+GVDE VL ELK  WE KL QSKAV+   +E
Sbjct: 3   ASANANPVPKLYRSVIEDVINDVRELFLDEGVDEQVLMELKTLWESKLMQSKAVDGFHSE 62

Query: 194 -----------LRHAQS--------------------VMPLYVYANSMQGDKSQQPLVYH 222
                       +HAQ                     ++P    A   Q    +  L+ H
Sbjct: 63  EQQLLLQAQQQQQHAQHHHVQQHPQQQQAASHQTQQVLIPATHQAPQQQVLVQESKLIQH 122

Query: 223 T----MTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQQV 251
                M+AA   A ++LP  V                         L Q QQ     QQV
Sbjct: 123 MTPQGMSAAATAATLALPAGVTPVQQLITNSGQLLQVVRAANGAQYLIQPQQSMVLQQQV 182

Query: 252 YAPYQPG---TSTNQQSTS---GGGKQHPSVIIQADG----ANDQRITQVDGANDQRITQ 301
               QPG       QQ  +   GG  Q   VIIQ        N  ++     +  Q  TQ
Sbjct: 183 IPQMQPGGVQAPVIQQVLAPIPGGLSQQTGVIIQPQQILFTGNKTQVIPTTMSAPQAPTQ 242

Query: 302 VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGAN 361
                  +  Q  +     + QVDGA D   ++ DE  D+     +  + ++    DG +
Sbjct: 243 GQLTVTSQQQQQGQQQTSIVLQVDGAGDTS-SEEDEDEDEDYDDEEEEDKEK----DGED 297

Query: 362 DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVV 421
            Q                                   PL S DD SD++  ELFDT+NVV
Sbjct: 298 GQ-------------------------------VEEEPLNSEDDVSDEEGQELFDTENVV 326

Query: 422 VCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           VCQYDKIHR+K +WKF+LK GIMNLNG+D VF KA G+AEW
Sbjct: 327 VCQYDKIHRSKNKWKFHLKDGIMNLNGRDFVFSKAIGDAEW 367
>gi|11559984|ref|NP_071544.1| general transcription factor IIa precursor; general transcription
           factor IIa, 1 (37kD and 19kD subunits); TFIIA large
           subunit; general transcription factor 2a 1; general
           transcription factor IIa 1 (37kD and 19kD subunits)
           [Rattus norvegicus]
 gi|2149996|gb|AAB58716.1| TFIIA large subunit [Rattus norvegicus]
          Length = 377

 Score =  154 bits (388), Expect = 5e-36
 Identities = 134/406 (33%), Positives = 172/406 (42%), Gaps = 118/406 (29%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAV----------- 187
           N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK  WE KL QS+AV           
Sbjct: 8   NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67

Query: 188 ----------ENSSNELRHAQSVMPLYVYANSMQGDK-----SQQP-----------LVY 221
                     +   +   H Q   P        Q  +     SQQ            L+ 
Sbjct: 68  LQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQ 127

Query: 222 HT-----MTAAGQQAAMSLPNSV-------------------------LYQRQQHP--NQ 249
           H       +AA   A ++LP  V                         ++Q QQ     Q
Sbjct: 128 HMNASSITSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQ 187

Query: 250 QVYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQR 298
           QV    QPG       QQ  +   GG      VIIQ       G   Q I     A    
Sbjct: 188 QVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPA 247

Query: 299 ITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
             Q+  A  Q+  Q + A  Q   + QVDG  D   T  +E  D+     D   + +  +
Sbjct: 248 QAQIPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK--E 301

Query: 357 VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFD 416
            DGA D ++ +                               PL S DD SD++  ELFD
Sbjct: 302 KDGAEDGQVEE------------------------------EPLNSEDDVSDEEGQELFD 331

Query: 417 TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           T+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 TENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 377
>gi|17136922|ref|NP_476996.1| CG5930-PB [Drosophila melanogaster]
 gi|23172421|gb|AAN14105.1| CG5930-PB [Drosophila melanogaster]
          Length = 363

 Score =  149 bits (377), Expect = 1e-34
 Identities = 112/368 (30%), Positives = 168/368 (45%), Gaps = 52/368 (14%)

Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE---- 193
           Q SV K+Y  VIEDVI N+R+AFLD+GVDE VLQE+KQ W  KL  SKAVE S +     
Sbjct: 5   QTSVLKVYHAVIEDVITNVRDAFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64

Query: 194 --------------LRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSV 239
                          +  ++     V ++   G  S    +    ++AG  A   + N +
Sbjct: 65  HPPPIVANNPKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIRNGL 124

Query: 240 LYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGGKQHPSVIIQADGANDQRITQVD--- 292
           +  +Q+  N Q   P  P ++ +    QQ  +  G+    ++   D     RI  V+   
Sbjct: 125 VPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSGQGSIPIVATLD---PNRIMPVNITL 180

Query: 293 ------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQR 342
                  +++ R+  +        + ++TQ+  A+      +        T       Q 
Sbjct: 181 PSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVLQQH 235

Query: 343 ITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXX 394
           +   +  AN Q+      Q+DGA   +D+  ++    N                      
Sbjct: 236 VNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEHEDA 295

Query: 395 XXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQ 454
               PL S DD +D+D  E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD VFQ
Sbjct: 296 AEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDYVFQ 355

Query: 455 KATGEAEW 462
           K+ G+AEW
Sbjct: 356 KSNGDAEW 363
>gi|17136920|ref|NP_476995.1| CG5930-PA [Drosophila melanogaster]
 gi|16769308|gb|AAL28873.1| LD24213p [Drosophila melanogaster]
 gi|23172420|gb|AAF56687.2| CG5930-PA [Drosophila melanogaster]
          Length = 366

 Score =  149 bits (375), Expect = 2e-34
 Identities = 114/371 (30%), Positives = 169/371 (45%), Gaps = 55/371 (14%)

Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENS------- 190
           Q SV K+Y  VIEDVI N+R+AFLD+GVDE VLQE+KQ W  KL  SKAVE S       
Sbjct: 5   QTSVLKVYHAVIEDVITNVRDAFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64

Query: 191 -------SNELRHA-------QSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLP 236
                  +N   H        ++     V ++   G  S    +    ++AG  A   + 
Sbjct: 65  HPPPIVANNPKSHKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIR 124

Query: 237 NSVLYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGGKQHPSVIIQADGANDQRITQVD 292
           N ++  +Q+  N Q   P  P ++ +    QQ  +  G+    ++   D     RI  V+
Sbjct: 125 NGLVPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSGQGSIPIVATLD---PNRIMPVN 180

Query: 293 ---------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEAN 339
                     +++ R+  +        + ++TQ+  A+      +        T      
Sbjct: 181 ITLPSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVL 235

Query: 340 DQRITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXX 391
            Q +   +  AN Q+      Q+DGA   +D+  ++    N                   
Sbjct: 236 QQHVNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEH 295

Query: 392 XXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDL 451
                  PL S DD +D+D  E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD 
Sbjct: 296 EDAAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDY 355

Query: 452 VFQKATGEAEW 462
           VFQK+ G+AEW
Sbjct: 356 VFQKSNGDAEW 366
>gi|1711662|sp|P52654|T2AA_DROME TRANSCRIPTION INITIATION FACTOR IIA ALPHA AND BETA CHAINS (TFIIA
           P30 AND P20 SUBUNITS) (DTFIIA-L)
 gi|542595|pir||A49076 transcription factor TFIIA-L - fruit fly (Drosophila melanogaster)
 gi|452955|gb|AAB28821.1| TFIIA-L [Drosophila sp.]
          Length = 366

 Score =  147 bits (371), Expect = 5e-34
 Identities = 115/369 (31%), Positives = 166/369 (44%), Gaps = 51/369 (13%)

Query: 138 QNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENS------- 190
           Q SV K+Y  VIEDVI N+R+ FLD+GVDE VLQE+KQ W  KL  SKAVE S       
Sbjct: 5   QTSVLKVYHAVIEDVITNVRDGFLDEGVDEQVLQEMKQVWRNKLLASKAVELSPDSGDGS 64

Query: 191 -------SNELRHA-------QSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLP 236
                  +N   H        ++     V ++   G  S    +    ++AG  A   + 
Sbjct: 65  HPPPIVANNPKSHKAANAKAKKAAAATAVTSHQHIGGNSSMSSLVGLKSSAGMAAGSGIR 124

Query: 237 NSVLYQRQQHPNQQVYAPYQP--GTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVD-- 292
           N ++  +Q+  N Q   P  P  G S  Q+          S+ I A   +  RI  V+  
Sbjct: 125 NGLVPIKQE-VNSQNPPPLHPTSGASMMQKQQQAASSGQGSIPIVAT-LDPNRIMPVNIT 182

Query: 293 -------GANDQRITQVD----GANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ 341
                   +++ R+  +        + ++TQ+  A+      +        T       Q
Sbjct: 183 LPSPAGSASSESRVLTIQVPASALQENQLTQILTAH-----LISSIMSLPTTLASSVLQQ 237

Query: 342 RITQ-VDGANDQRIT----QVDGA---NDQRITQVDGANXXXXXXXXXXXXXXXXXXXXX 393
            +   +  AN Q+      Q+DGA   +D+  ++    N                     
Sbjct: 238 HVNAALSSANHQKTLAAAKQLDGALDSSDEDESEESDDNIDNDDDDDLDKDDDEDAEHED 297

Query: 394 XXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVF 453
                PL S DD +D+D  E+FDTDNV+VCQYDKI R++ +WKF LK GIMN+ GKD VF
Sbjct: 298 AAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKITRSRNKWKFYLKDGIMNMRGKDYVF 357

Query: 454 QKATGEAEW 462
           QK+ G+AEW
Sbjct: 358 QKSNGDAEW 366
>gi|34099900|gb|AAP44971.1| transcription factor IIA large subunit-1 [Xenopus laevis]
          Length = 370

 Score =  145 bits (367), Expect = 1e-33
 Identities = 123/373 (32%), Positives = 156/373 (41%), Gaps = 51/373 (13%)

Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
           A    N V KLY++VI DVIN++RE FLD+GV E VL ELK  WE KL QSKAV+    E
Sbjct: 5   ASANANPVPKLYRSVIGDVINDVREIFLDEGVGEQVLMELKTLWESKLMQSKAVDGFHTE 64

Query: 194 LR-------------------HAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMS 234
            +                   H     P    A S Q   +QQ L+  T  A  QQ  + 
Sbjct: 65  EQQLLLQAQQQQQQQQQHSQHHRVQQHPQQQQAASHQ---TQQVLIPATHQAQQQQVLVQ 121

Query: 235 LPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGA 294
               + +   Q  +         G +  QQ  +  G Q   V+  A+GA      Q    
Sbjct: 122 ESKMIHHMTPQGMSAATTLALPAGVTPVQQLITNSG-QLLQVVRAANGAQYLIQPQQSMV 180

Query: 295 NDQRIT---QVDGANDQRITQVEEANDQRITQVDGA------------------NDQRIT 333
             Q++    Q  G     I QV       ++Q  G                   N     
Sbjct: 181 LQQQVIPQMQQGGVQAPVIQQVLTPMPGGLSQQTGVIIQPQQILFTGNKTQVIPNTMSAP 240

Query: 334 QVDEANDQRITQVDGANDQR----ITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXX 389
           Q        +T       Q+    + QVDGA D   T  +                    
Sbjct: 241 QAPIQGQLTVTSQQQQQGQQQAPLVLQVDGAGD---TSSEEDEDEDEDYDDEEEEDKEKD 297

Query: 390 XXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGK 449
                    PL S DD SD++  ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+
Sbjct: 298 GEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGR 357

Query: 450 DLVFQKATGEAEW 462
           D VF KA G+AEW
Sbjct: 358 DFVFSKAIGDAEW 370
>gi|33312518|gb|AAQ04072.1| transcription factor IIA large subunit [Xenopus laevis]
          Length = 370

 Score =  144 bits (364), Expect = 3e-33
 Identities = 122/376 (32%), Positives = 159/376 (42%), Gaps = 57/376 (15%)

Query: 134 APGGQNSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNE 193
           A    N V KLY++VI DVIN++RE FLD+GV E VL ELK  WE KL QSKAV+    E
Sbjct: 5   ASANANPVPKLYRSVIGDVINDVREIFLDEGVGEQVLMELKTLWESKLMQSKAVDGFHTE 64

Query: 194 LRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYA 253
            +       L + A   Q  + Q    +       QQ A S     +     H  QQ   
Sbjct: 65  EQQ------LLLQAQQQQQQQQQHSQHHRVQQHPQQQQAASHQTQQVLIPATHQAQQQQV 118

Query: 254 PYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGAN------- 306
             Q     +   T  G     ++ + A     Q++    G   Q +   +GA        
Sbjct: 119 LVQESKMIHHM-TPXGMSAATTLALPAGVTPVQQLITNSGQLLQVVRAANGAQYLIQPQQ 177

Query: 307 -----DQRITQVEEAN------DQRITQVDGANDQRI---------------TQV----- 335
                 Q I Q+++         Q +T + G   Q+                TQV     
Sbjct: 178 SMVLQQQVIPQMQQGGVQAPVIQQVLTPMPGGLSQQTGVIIQPQQILFTGNKTQVIPNTM 237

Query: 336 --DEANDQRITQVDGANDQR-------ITQVDGANDQRITQVDGANXXXXXXXXXXXXXX 386
              +A  Q    V     Q+       + QVDGA D   T  +                 
Sbjct: 238 SAPQAPIQGQLTVTSQQQQQGQQQAPLVLQVDGAGD---TSSEEDEDEDEDYDDEEEEDK 294

Query: 387 XXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNL 446
                       PL S DD SD++  ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNL
Sbjct: 295 EKDGEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNL 354

Query: 447 NGKDLVFQKATGEAEW 462
           NG+D VF KA G+AEW
Sbjct: 355 NGRDFVFSKAIGDAEW 370
>gi|27819943|gb|AAL28886.2| LD26511p [Drosophila melanogaster]
          Length = 274

 Score =  141 bits (355), Expect = 3e-32
 Identities = 66/136 (48%), Positives = 96/136 (70%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA +  I++D
Sbjct: 12  KHVPKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTCILDDAKVYANHARKKTIDLD 71

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA    LD SFT P  R  L  VA  +N++ LP +K +   RLPPDRYCL  VNY+L
Sbjct: 72  DVRLATEVTLDKSFTGPLERHVLAKVADVRNSMPLPPIKPHCGLRLPPDRYCLTGVNYKL 131

Query: 121 KGSADKKVSSMAKAPG 136
           + +   K  + +   G
Sbjct: 132 RATNQPKKMTKSAVEG 147
>gi|45549141|ref|NP_523391.3| CG6474-PA [Drosophila melanogaster]
 gi|2498980|sp|Q27272|TAF9_DROME Transcription initiation factor TFIID subunit 9 (Transcription
           initiation factor TFIID 42 kDa subunit) (TAFII-42)
           (TAFII40) (p42) (Enhancer of yellow 1 protein)
 gi|1079142|pir||A49067 transcription initiation factor IID chain p42 - fruit fly
           (Drosophila melanogaster)
 gi|458680|gb|AAC47347.1| transcription initiation factor TFIID 42 kDa subunit
 gi|463049|gb|AAA28488.1| TAFII40
 gi|45447037|gb|AAF48767.3| CG6474-PA [Drosophila melanogaster]
          Length = 278

 Score =  141 bits (355), Expect = 3e-32
 Identities = 66/136 (48%), Positives = 96/136 (70%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           K++PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA +  I++D
Sbjct: 16  KHVPKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTCILDDAKVYANHARKKTIDLD 75

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D +LA    LD SFT P  R  L  VA  +N++ LP +K +   RLPPDRYCL  VNY+L
Sbjct: 76  DVRLATEVTLDKSFTGPLERHVLAKVADVRNSMPLPPIKPHCGLRLPPDRYCLTGVNYKL 135

Query: 121 KGSADKKVSSMAKAPG 136
           + +   K  + +   G
Sbjct: 136 RATNQPKKMTKSAVEG 151
>gi|17562510|ref|NP_504355.1| prion-like Q/N-rich domain protein PQN-51, Prion-like Q/N-rich
           domain protein (pqn-51) [Caenorhabditis elegans]
 gi|7505765|pir||T32660 hypothetical protein K11D12.2 - Caenorhabditis elegans
 gi|2736452|gb|AAB94230.1| Prion-like-(q/n-rich)-domain-bearing protein protein 51
           [Caenorhabditis elegans]
          Length = 354

 Score =  122 bits (307), Expect = 1e-26
 Identities = 102/356 (28%), Positives = 164/356 (46%), Gaps = 38/356 (10%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNEL---- 194
           N +A+LY++V+ DVI N++EAFLD+ +D  VL +L++ WE K+  S  V+  SN      
Sbjct: 5   NGIAELYKSVMADVIANMKEAFLDENIDVDVLSQLRKEWEDKVNSSGCVDLESNAPPPAP 64

Query: 195 RHAQSVMPLYVYANSM--QGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQ-QV 251
           R    V P  V  N M  Q   +QQP+   +   A       +  +   Q  QHP+Q ++
Sbjct: 65  RQQHHVPPSAVRPNPMPPQRPVAQQPVRALSALHAAHIGDAPIRMAYTGQPTQHPSQVRM 124

Query: 252 YAP-------YQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDG 304
           + P       +QPG     Q  +G   Q+  + I  +     RI  +     Q+  Q   
Sbjct: 125 FNPQFQGGIHFQPGQVFVVQQPNG---QNIPMSIMPNQIPQHRI--IHQGQPQQAQQQQP 179

Query: 305 ANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQV--DGANDQRITQVDGAND 362
               ++T + + +    ++ DG       ++     +   +V   GA++++  +V G+  
Sbjct: 180 QQGNQLTHMNQMDGNVGSESDGEGPSGPVKLAPKKTKCSLRVRGSGASEKKAMKVLGSLL 239

Query: 363 QRITQVDGANXXXXXXXXXXX---------------XXXXXXXXXXXXXXXPLCSADDGS 407
           +   Q+DG                                           PL S DD S
Sbjct: 240 KDF-QLDGGGGGMSDSSSEDEPDDDDDPLRRIADRMGNGEVEDGDQVAEEEPLNSEDDQS 298

Query: 408 DDDPVE-LFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DD+ +  LF+ DNVV+CQ++K++RA+ +WKF LK GIM+++ KD  FQK TGEAEW
Sbjct: 299 DDEDLTMLFEADNVVMCQFEKVNRARTKWKFQLKDGIMHIDKKDYCFQKCTGEAEW 354
>gi|39583968|emb|CAE64058.1| Hypothetical protein CBG08658 [Caenorhabditis briggsae]
          Length = 353

 Score =  122 bits (305), Expect = 2e-26
 Identities = 107/358 (29%), Positives = 152/358 (42%), Gaps = 43/358 (12%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQ 198
           N +A +Y++VI DVI+N+REAFLD+ +D  VL +LK+ WE K+  S  V+  SN    A 
Sbjct: 5   NGIADIYKSVIADVISNMREAFLDENIDVDVLAQLKKEWEDKVNSSGCVDLESNAAPPAP 64

Query: 199 SVMPLYVYANSMQGD----------KSQQPLVYHTMTA----------AGQQAAMSLPNS 238
                +V  N ++ +          + QQP    T+ A          A  Q     P  
Sbjct: 65  RQQQPHVAPNPVRPNPIPQRVNPVQQQQQPRT--TLAALHMGDTPIRMAYTQPGQHQPQQ 122

Query: 239 VLYQRQQHPNQ------QVYAPYQPG-----TSTNQ-QSTSGGGKQHPSVIIQADGANDQ 286
           V    QQ  NQ      Q     QPG        NQ Q       Q   +  Q       
Sbjct: 123 VRMFPQQFQNQLQFPAGQFVVVQQPGGVPMSLMPNQLQHRLMPQVQQQQLQQQPQQQQSN 182

Query: 287 RITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ-RITQ 345
           ++T ++  +    ++ DG       +V  A  +  T+V G    +   +       R  Q
Sbjct: 183 QLTHMNQMDGNGGSESDGEGCSEPLKVRRARAKASTKVRGTGASKKEAMKVLGSMLRDIQ 242

Query: 346 VDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADD 405
           +DG         D ++D  +   D                             PL S DD
Sbjct: 243 LDGGGG---GMSDSSSDDEVEDDDDP----LRRIADRMGNGEVEDGDQVAEEEPLNSEDD 295

Query: 406 GSDD-DPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
            SDD D   LFD DNVV+CQ++K++RA+ +WKF LK GIM+++ KD  FQK TGEAEW
Sbjct: 296 QSDDEDLTMLFDADNVVMCQFEKVNRARTKWKFQLKDGIMHIDKKDYCFQKCTGEAEW 353
>gi|15221048|ref|NP_175816.1| transcription initiation factor IID (TFIID) 31 kDa subunit
           (TAFII-31) family protein [Arabidopsis thaliana]
 gi|25405699|pir||E96582 hypothetical protein F15I1.24 [imported] - Arabidopsis thaliana
 gi|4587557|gb|AAD25788.1| Similar to gb|U21858 transcription initiation factor TFIID 31KD
           subunit (TAFII32) from Homo sapiens. [Arabidopsis
           thaliana]
 gi|7638157|gb|AAF65406.1| putative TATA binding protein associated factor 21kDa subunit
           [Arabidopsis thaliana]
 gi|21593471|gb|AAM65438.1| transcriptional activation factor TAFII32, putative [Arabidopsis
           thaliana]
 gi|39545926|gb|AAR28026.1| TAF9 [Arabidopsis thaliana]
          Length = 183

 Score =  119 bits (299), Expect = 1e-25
 Identities = 55/120 (45%), Positives = 86/120 (71%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVD 60
           +++P+DA+ + ++L+ MGV +YEP+V+HQ LE  YRY+ EVL +A+VY+ HA + NI+ D
Sbjct: 7   EDVPRDAKIVKSLLKSMGVEDYEPRVIHQFLELWYRYVVEVLTDAQVYSEHASKPNIDCD 66

Query: 61  DTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRL 120
           D KLA+ S+++ SF+ PPPR+ L+++A  +N I LP         LPP++  LLS NY+L
Sbjct: 67  DVKLAIQSKVNFSFSQPPPREVLLELAASRNKIPLPKSIAGPGVPLPPEQDTLLSPNYQL 126
>gi|42733663|gb|AAS38620.1| similar to F55B11.3.p [Caenorhabditis elegans] [Dictyostelium
           discoideum]
          Length = 619

 Score =  118 bits (296), Expect = 2e-25
 Identities = 49/116 (42%), Positives = 81/116 (69%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
           P+D+  I ++L+ MG+  +EP+VV+QLLEF Y+Y+ EVL +A +Y  H+G+++I+V D +
Sbjct: 213 PRDSRVIKSILKTMGIQAHEPRVVNQLLEFMYKYVFEVLQDAMIYGEHSGKTDIDVSDVR 272

Query: 64  LAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYR 119
           L++ S+++   + PPPR+ L+ +A +KN   LP + N     LPPD YCL + NY+
Sbjct: 273 LSIQSRVNYQISTPPPRELLLSIAEEKNKTPLPPIPNRYGVYLPPDEYCLTNPNYQ 328
>gi|13878225|ref|NP_113568.1| general transcription factor II A, 1 isoform 1; general
           transcription factor IIa, 1 (37kD and 19kD subunits);
           TFIIA alpha/beta [Mus musculus]
 gi|12313735|gb|AAG50431.1| TFIIA alpha/beta [Mus musculus]
          Length = 378

 Score =  112 bits (280), Expect = 2e-23
 Identities = 117/407 (28%), Positives = 159/407 (39%), Gaps = 119/407 (29%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLD-------------------------DGVDEHVLQEL 173
           N+V KLY++VIEDVIN++R+ FLD                         DG      Q L
Sbjct: 8   NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLL 67

Query: 174 KQTWEQKLAQSKAVENSSNELRHAQS-------------VMPLYVYANSMQGDKSQQPLV 220
            Q  +Q   Q +   +  ++ + AQ              ++P    A + Q       L+
Sbjct: 68  LQVQQQHQPQQQQHHHHHHQHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIGPDSKLL 127

Query: 221 YHT-----MTAAGQQAAMSLPNSVLYQRQQHPN--------------------------- 248
            H       +AA   A ++LP  V   +Q   N                           
Sbjct: 128 QHMNASSITSAAATAATLALPAGVTPVQQLLTNSGQLLQVVRAANGAQYILQPQQSVVLQ 187

Query: 249 QQVYAPYQPG---TSTNQQSTS---GGGKQHPSVIIQAD-----GANDQRITQVDGANDQ 297
           QQV    QPG       QQ  +   GG      VIIQ       G   Q I     A   
Sbjct: 188 QQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPAP 247

Query: 298 RITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGANDQRIT 355
               +  A  Q+  Q + A  Q   + QVDG  D   T  +E  D+     D   + +  
Sbjct: 248 AQAPMPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEEEDK-- 301

Query: 356 QVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELF 415
           + DGA D ++ +                               PL S DD S ++  ELF
Sbjct: 302 EKDGAEDGQVEE------------------------------EPLNSEDDVSGEEGQELF 331

Query: 416 DTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           D +NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 332 DAENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 378
>gi|42476101|ref|NP_963889.1| TFIIA alpha, p55 isoform 2; general transcription factor IIA, 1
           (37kD and 19kD subunits); TFIIA alpha, p55 [Homo
           sapiens]
 gi|433501|emb|CAA53152.1| TFIIA [Homo sapiens]
 gi|727195|dbj|BAA03603.1| TFIIA-37 [Homo sapiens]
          Length = 337

 Score =  110 bits (274), Expect = 8e-23
 Identities = 94/292 (32%), Positives = 120/292 (41%), Gaps = 44/292 (15%)

Query: 178 EQKLAQSKAVENSSNELRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPN 237
           + KL Q     N S     A   +P  V         S Q L+     A G Q       
Sbjct: 83  DSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQ-LLQVVRAANGAQYIFQPQQ 141

Query: 238 SVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQAD-----GANDQRITQVD 292
           SV+ Q+Q  P  Q      P          GG      VIIQ       G   Q I    
Sbjct: 142 SVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTV 201

Query: 293 GANDQRITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEANDQRITQVDGAN 350
            A      Q+     Q+  Q + A  Q   + QVDG  D   T  +E  D+     D   
Sbjct: 202 AAPTPAQAQITATGQQQ-PQAQPAQTQAPLVLQVDGTGD---TSSEEDEDEEEDYDDDEE 257

Query: 351 DQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDD 410
           + +  + DGA D ++ +                               PL S DD SD++
Sbjct: 258 EDK--EKDGAEDGQVEE------------------------------EPLNSEDDVSDEE 285

Query: 411 PVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
             ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G+AEW
Sbjct: 286 GQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 337
>gi|30141908|ref|NP_780544.1| general transcription factor II A, 1 isoform 2; general
           transcription factor IIa, 1 (37kD and 19kD subunits);
           TFIIA alpha/beta [Mus musculus]
 gi|26339518|dbj|BAC33430.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  107 bits (268), Expect = 4e-22
 Identities = 83/244 (34%), Positives = 107/244 (43%), Gaps = 43/244 (17%)

Query: 226 AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQAD---- 281
           A G Q  +    SV+ Q+Q  P  Q      P          GG      VIIQ      
Sbjct: 132 ANGAQYILQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILF 191

Query: 282 -GANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQR--ITQVDGANDQRITQVDEA 338
            G   Q I     A       +  A  Q+  Q + A  Q   + QVDG  D   T  +E 
Sbjct: 192 TGNKTQVIPTTVAAPAPAQAPMPAAGQQQ-PQAQPAQQQAPLVLQVDGTGD---TSSEED 247

Query: 339 NDQRITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
            D+     D   + +  + DGA D ++ +                               
Sbjct: 248 EDEEEDYDDDEEEDK--EKDGAEDGQVEE------------------------------E 275

Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
           PL S DD SD++  ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G
Sbjct: 276 PLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIG 335

Query: 459 EAEW 462
           +AEW
Sbjct: 336 DAEW 339
>gi|19075146|ref|NP_586747.1| TRANSCRIPTION INITIATION FACTOR OF THE TFIID FAMILY
           [Encephalitozoon cuniculi]
 gi|19068538|emb|CAD25006.1| TRANSCRIPTION INITIATION FACTOR OF THE TFIID FAMILY
           [Encephalitozoon cuniculi]
          Length = 137

 Score =  105 bits (261), Expect = 3e-21
 Identities = 49/125 (39%), Positives = 81/125 (64%), Gaps = 1/125 (0%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
           P+DA+ I  +L+ +G+ E EPKV+ QLLEF YRY ++VL++A ++A H GR++I   D K
Sbjct: 9   PRDAKVISVILRSLGIEECEPKVIIQLLEFAYRYTTDVLEDALLFAKHTGRTHITTSDVK 68

Query: 64  LAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYR-LKG 122
           LA+ +++   F  PPPR +L +++   N+  L +    +  R+PP    LL+++Y  L+ 
Sbjct: 69  LALQTKVGRHFVPPPPRQYLSEISTMVNSKPLTIPDGENLIRVPPSSSALLNLDYEVLRK 128

Query: 123 SADKK 127
            +DKK
Sbjct: 129 DSDKK 133
>gi|46102011|gb|EAK87244.1| hypothetical protein UM06387.1 [Ustilago maydis 521]
          Length = 273

 Score =  104 bits (259), Expect = 5e-21
 Identities = 51/122 (41%), Positives = 79/122 (64%), Gaps = 4/122 (3%)

Query: 3   LPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGR----SNIN 58
           +P+DA  I  +L  MGV++ EP V+ QLLEF +RY  +VL +A VY++HA      SN++
Sbjct: 25  VPRDARLIALILSSMGVSDVEPAVLLQLLEFAHRYTYDVLSDALVYSDHANSRQAGSNLS 84

Query: 59  VDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
           +DD  LA+ S+++ SFT PP +D L+ +A   N I LP + +    RLP  ++CL +VN+
Sbjct: 85  LDDVNLAIQSRVNYSFTKPPEKDMLLALASTVNAIPLPPISDRHGVRLPAAQHCLTNVNF 144

Query: 119 RL 120
            +
Sbjct: 145 SI 146
>gi|19113805|ref|NP_592893.1| hypothetical protein [Schizosaccharomyces pombe]
 gi|1351629|sp|Q09869|YAG5_SCHPO Hypothetical protein C12G12.05c in chromosome I
 gi|2130252|pir||S62536 hypothetical protein SPAC12G12.05c - fission yeast
           (Schizosaccharomyces pombe)
 gi|1052523|emb|CAA91500.1| SPAC12G12.05c [Schizosaccharomyces pombe]
          Length = 163

 Score =  102 bits (254), Expect = 2e-20
 Identities = 48/117 (41%), Positives = 76/117 (64%), Gaps = 2/117 (1%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN--INVDD 61
           PKD   I  +L  +GV  Y   V  QLL F +RY  +++ +++VYA H+   N  I+V+D
Sbjct: 14  PKDVRLIHLILSSLGVPSYSQTVPLQLLTFAHRYTQQLIQDSQVYAEHSRGQNAPISVED 73

Query: 62  TKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
            +LAV SQ+++SFT PPP++FL+++A ++N   LP ++     RLPP++YCL   N+
Sbjct: 74  VRLAVASQINHSFTGPPPKEFLLELAMERNRKPLPQIQPSYGFRLPPEKYCLTQPNW 130
>gi|30017563|gb|AAP12985.1| putative transcription initiation factor [Oryza sativa (japonica
           cultivar-group)]
          Length = 250

 Score =  100 bits (250), Expect = 5e-20
 Identities = 55/146 (37%), Positives = 86/146 (58%), Gaps = 29/146 (19%)

Query: 4   PKDAETIIAVLQDMGVTE--YEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDD 61
           P+DA  +  +L+ MG++E  YEP+VVHQ L+  YRY+ +VL +A+VYA+HAG+  ++ DD
Sbjct: 34  PRDARVVRELLRSMGLSEGEYEPRVVHQFLDLAYRYVGDVLGDAQVYADHAGKPQLDADD 93

Query: 62  TKLAVMSQLDNSFTNPPPRD--------------------------FLVDVARQKNTIAL 95
            +LA+ S+++ SF+ PPPR+                           L++VAR +N I L
Sbjct: 94  VRLAIQSKVNFSFSQPPPRECSEFFHSDQDFRSRSLPSDNPLFFSMVLLEVARNRNKIPL 153

Query: 96  P-LVKNYSAARLPPDRYCLLSVNYRL 120
           P  +    +  LPP++  LLS NY+L
Sbjct: 154 PKSIAPPGSIPLPPEQDTLLSQNYQL 179
>gi|24650582|ref|NP_733208.1| CG5930-PC [Drosophila melanogaster]
 gi|23172422|gb|AAN14106.1| CG5930-PC [Drosophila melanogaster]
          Length = 324

 Score =  100 bits (248), Expect = 9e-20
 Identities = 87/333 (26%), Positives = 139/333 (41%), Gaps = 52/333 (15%)

Query: 173 LKQTWEQKLAQSKAVENSSNE------------------LRHAQSVMPLYVYANSMQGDK 214
           +KQ W  KL  SKAVE S +                    +  ++     V ++   G  
Sbjct: 1   MKQVWRNKLLASKAVELSPDSGDGSHPPPIVANNPKAANAKAKKAAAATAVTSHQHIGGN 60

Query: 215 SQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTN----QQSTSGGG 270
           S    +    ++AG  A   + N ++  +Q+  N Q   P  P ++ +    QQ  +  G
Sbjct: 61  SSMSSLVGLKSSAGMAAGSGIRNGLVPIKQE-VNSQNPPPLHPTSAASMMQKQQQAASSG 119

Query: 271 KQHPSVIIQADGANDQRITQVD---------GANDQRITQVD----GANDQRITQVEEAN 317
           +    ++   D     RI  V+          +++ R+  +        + ++TQ+  A+
Sbjct: 120 QGSIPIVATLD---PNRIMPVNITLPSPAGSASSESRVLTIQVPASALQENQLTQILTAH 176

Query: 318 DQRITQVDGANDQRITQVDEANDQRITQ-VDGANDQRIT----QVDGA---NDQRITQVD 369
                 +        T       Q +   +  AN Q+      Q+DGA   +D+  ++  
Sbjct: 177 -----LISSIMSLPTTLASSVLQQHVNAALSSANHQKTLAAAKQLDGALDSSDEDESEES 231

Query: 370 GANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIH 429
             N                          PL S DD +D+D  E+FDTDNV+VCQYDKI 
Sbjct: 232 DDNIDNDDDDDLDKDDDEDAEHEDAAEEEPLNSEDDVTDEDSAEMFDTDNVIVCQYDKIT 291

Query: 430 RAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           R++ +WKF LK GIMN+ GKD VFQK+ G+AEW
Sbjct: 292 RSRNKWKFYLKDGIMNMRGKDYVFQKSNGDAEW 324
>gi|38492544|pdb|1NVP|C Chain C, Human TfiiaTBPDNA COMPLEX
          Length = 76

 Score =  100 bits (248), Expect = 9e-20
 Identities = 45/64 (70%), Positives = 54/64 (84%)

Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
           PL S DD SD++  ELFDT+NVVVCQYDKIHR+K +WKF+LK GIMNLNG+D +F KA G
Sbjct: 13  PLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIG 72

Query: 459 EAEW 462
           +AEW
Sbjct: 73  DAEW 76
>gi|32394470|gb|AAM93933.1| transcriptional activation factor TAFII32 [Griffithsia japonica]
          Length = 180

 Score = 96.7 bits (239), Expect = 1e-18
 Identities = 50/128 (39%), Positives = 79/128 (61%), Gaps = 2/128 (1%)

Query: 17  MGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKLAVMSQLDNSFTN 76
           MGV  Y+P+V HQLLE  YR++S VL +A+V  +HA +  I+ DD KLA+ S+   +FT 
Sbjct: 27  MGVDSYDPRVTHQLLELLYRHVSTVLLDARVCCDHADKPAIDADDVKLALRSRNTFTFTQ 86

Query: 77  PPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSADK--KVSSMAKA 134
           PPPRD  + +A ++N + LP +   +   LPP+   L   NYR+   +++  K S ++++
Sbjct: 87  PPPRDVTMRLAAERNAVPLPPIDARAGVALPPELGQLTRQNYRVMTQSNRPAKRSRLSQS 146

Query: 135 PGGQNSVA 142
           P  + S A
Sbjct: 147 PAARASYA 154
>gi|34862156|ref|XP_237507.2| similar to general transcription factor II A, 1-like factor;
           TfIIa-alpha/beta-like factor; general transcription
           factor IIA, 1-like factor [Rattus norvegicus]
          Length = 791

 Score = 96.3 bits (238), Expect = 1e-18
 Identities = 58/184 (31%), Positives = 93/184 (50%), Gaps = 22/184 (11%)

Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
           +++ +++  +  R+++    +++ +  QV + +   I Q+DG  D    +    + +A++
Sbjct: 623 KQLRKIEEPSSLRVSEKSCTSERDLNIQVTDDDINEIIQIDGTGDNSSAEEGGSIRDADE 682

Query: 341 QRITQVDGANDQRITQ--VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
                +  A D ++ +  VD  +++  T     N                          
Sbjct: 683 NEFPGIIEAGDLKVLEEEVDSVSNEDSTANSSDNEDHQIDALEED--------------- 727

Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
           PL S DD S+ D  +LFDTDNV+VCQYDKIHR+K RWKF LK G+M   G+D VF KA G
Sbjct: 728 PLNSGDDVSEQDAPDLFDTDNVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIG 787

Query: 459 EAEW 462
           EAEW
Sbjct: 788 EAEW 791

 Score = 59.7 bits (143), Expect = 1e-07
 Identities = 25/47 (53%), Positives = 38/47 (80%)

Query: 143 KLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           KLY++VIEDVI  +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 50  KLYRSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 96
>gi|26787968|ref|NP_006863.2| TFIIA-alpha/beta-like factor isoform 1; TFIIA large subunit isoform
           ALF; GTF2A1-like factor [Homo sapiens]
 gi|19684124|gb|AAH25991.1| TFIIA-alpha/beta-like factor, isoform 1 [Homo sapiens]
          Length = 478

 Score = 94.4 bits (233), Expect = 5e-18
 Identities = 69/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)

Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
           V++P    T++N +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 248 VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301

Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359

Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
             G+      ++ +  +DG +                                  PL S 
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419

Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DD S+ D  +LFDTDNV+VCQYDKIHR+K +WKF LK G+M   G+D VF KA G+AEW
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 478

 Score = 63.5 bits (153), Expect = 9e-09
 Identities = 28/51 (54%), Positives = 39/51 (76%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5   NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55
>gi|29553944|ref|NP_758515.1| stoned B/TFIIA-alpha/beta-like factor [Homo sapiens]
          Length = 1182

 Score = 94.4 bits (233), Expect = 5e-18
 Identities = 69/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)

Query: 251  VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
            V++P    T++N +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 952  VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 1005

Query: 303  -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                 D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 1006 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 1063

Query: 357  VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
              G+      ++ +  +DG +                                  PL S 
Sbjct: 1064 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 1123

Query: 404  DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
            DD S+ D  +LFDTDNV+VCQYDKIHR+K +WKF LK G+M   G+D VF KA G+AEW
Sbjct: 1124 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 1182

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 27/51 (52%), Positives = 38/51 (74%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N   KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 709 NIQPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 759
>gi|33312516|gb|AAQ04071.1| TFIIAa/b-like factor [Xenopus laevis]
 gi|34099894|gb|AAP44968.1| transcription factor ALF [Xenopus laevis]
          Length = 472

 Score = 94.4 bits (233), Expect = 5e-18
 Identities = 57/166 (34%), Positives = 80/166 (48%), Gaps = 36/166 (21%)

Query: 299 ITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVD--GANDQRITQ 356
           I QVDGA D        ++D+ +  V   ++     + +  D +  + D  G+N+   + 
Sbjct: 341 IIQVDGAGDT-------SSDEDLGNVRDLDENEFLGIIDTEDLKALEEDDTGSNEDSTSN 393

Query: 357 VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFD 416
                D  I  ++                             PL S DD S+ +  +LFD
Sbjct: 394 SSDDEDPEIDVIE---------------------------EDPLNSGDDVSEQEVPDLFD 426

Query: 417 TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           TDNV+VCQYDKIHR+K +WKF LK G+M+  GKD VF KA GEAEW
Sbjct: 427 TDNVIVCQYDKIHRSKNKWKFYLKDGVMSFGGKDYVFSKAIGEAEW 472

 Score = 77.0 bits (188), Expect = 8e-13
 Identities = 42/107 (39%), Positives = 63/107 (58%), Gaps = 4/107 (3%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQ 198
           N V KLY+++IEDV+ ++RE F+++GVDE VL+ELKQ WE KL QSKA E    +    Q
Sbjct: 8   NPVPKLYKSIIEDVMESVREIFVEEGVDEQVLKELKQLWETKLLQSKATEGFFRDNSTPQ 67

Query: 199 SVMP----LYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLY 241
            V+     L+   ++  G+++        M  +G  +A SLP  + Y
Sbjct: 68  FVLQLPQNLHHSLHTSTGNRNVTHFATGEMGTSGSNSAFSLPAGITY 114
>gi|46442061|gb|EAL01353.1| hypothetical protein CaO19.10197 [Candida albicans SC5314]
 gi|46442302|gb|EAL01592.1| hypothetical protein CaO19.2682 [Candida albicans SC5314]
          Length = 275

 Score = 94.0 bits (232), Expect = 6e-18
 Identities = 88/325 (27%), Positives = 138/325 (42%), Gaps = 60/325 (18%)

Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSVM 201
           +KLY++VIEDVIN+ R+ F ++G+DE  LQEL++ W +KL QS   + S +E    +  +
Sbjct: 7   SKLYESVIEDVINDSRQDFENNGIDESTLQELRRIWCEKLTQSGVAKFSWDEEEEEEEPV 66

Query: 202 P---LYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
               + V         ++QP      T A +    +   + + +     N  +     P 
Sbjct: 67  EQVDVDVEVGQSAQQAAEQPQAAINDTGATESNTSATTETTVKKEATASNSGL-----PD 121

Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
              +  +T   G + PS  I  + AN    T  +G  D       G N   I        
Sbjct: 122 AQLSYNTTDSLGIEIPS--ISREKANGTEGTDKNGDGD------GGDNHGLIVP------ 167

Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDG-ANDQRITQVDGANXXXXX 377
            +I Q DG       Q    ++++++QVDG ++    + DG  +D   +++D        
Sbjct: 168 -KIEQADGPIFFDNKQFK--SNKKLSQVDGESEFDEFESDGDLSDDINSELDSD------ 218

Query: 378 XXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKF 437
                                   S  +  DDD  E     N  +C +DK+ R K +WK 
Sbjct: 219 ------------------------SDKEEEDDDEEET----NFALCLFDKVQRIKNKWKS 250

Query: 438 NLKVGIMNLNGKDLVFQKATGEAEW 462
            L  GI N+NGKD VF KA GE+EW
Sbjct: 251 TLVAGIANINGKDYVFHKANGESEW 275
>gi|12963759|ref|NP_076119.1| general transcription factor II A, 1-like factor;
           TfIIa-alpha/beta-like factor; general transcription
           factor IIA, 1-like factor [Mus musculus]
 gi|12313737|gb|AAG50432.1| TFIIA-alpha/beta-like factor [Mus musculus]
          Length = 468

 Score = 93.6 bits (231), Expect = 8e-18
 Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 21/183 (11%)

Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
           +++ + +  +  R+++ +  +++ +  +V + +   I Q+DG  D   T+    + +A++
Sbjct: 301 KQLRKAEEPSSLRVSEKNCTSERDLNIRVTDDDINEIIQIDGTGDNSSTEEMGSIRDADE 360

Query: 341 QRITQVDGANDQRITQ-VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
                +  A D  + + VD  +++  T     N                          P
Sbjct: 361 NEFPGIIDAGDLNVLEEVDSVSNEDSTANSSDNEDHQINAPEED---------------P 405

Query: 400 LCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
           L S DD S+ D  +LFDT+NV+VCQYDKIHR+K RWKF LK G+M   G+D VF KA GE
Sbjct: 406 LNSGDDVSEQDVPDLFDTENVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIGE 465

Query: 460 AEW 462
           AEW
Sbjct: 466 AEW 468

 Score = 63.9 bits (154), Expect = 7e-09
 Identities = 28/51 (54%), Positives = 40/51 (78%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLYQ+VIEDVI  +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 5   NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 55
>gi|34098596|sp|Q8R4I4|T2AY_MOUSE TFIIA-alpha and beta like factor (General transcription factor II
           A, 1-like factor)
 gi|12838693|dbj|BAB24296.1| unnamed protein product [Mus musculus]
 gi|30047818|gb|AAH50756.1| General transcription factor II A, 1-like factor [Mus musculus]
          Length = 468

 Score = 93.6 bits (231), Expect = 8e-18
 Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 21/183 (11%)

Query: 286 QRITQVDGANDQRITQVDGANDQRIT-QVEEANDQRITQVDGANDQRITQ----VDEAND 340
           +++ + +  +  R+++ +  +++ +  +V + +   I Q+DG  D   T+    + +A++
Sbjct: 301 KQLRKAEEPSSLRVSEKNCTSERDLNIRVTDDDINEIIQIDGTGDNSSTEEMGSIRDADE 360

Query: 341 QRITQVDGANDQRITQ-VDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
                +  A D  + + VD  +++  T     N                          P
Sbjct: 361 NEFPGIIDAGDLNVLEEVDSVSNEDSTANSSDNEDHQINAPEED---------------P 405

Query: 400 LCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
           L S DD S+ D  +LFDT+NV+VCQYDKIHR+K RWKF LK G+M   G+D VF KA GE
Sbjct: 406 LNSGDDVSEQDVPDLFDTENVIVCQYDKIHRSKNRWKFYLKDGVMCFGGRDYVFAKAIGE 465

Query: 460 AEW 462
           AEW
Sbjct: 466 AEW 468

 Score = 63.9 bits (154), Expect = 7e-09
 Identities = 28/51 (54%), Positives = 40/51 (78%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLYQ+VIEDVI  +R+ F ++G++E VL++LK+ WE K+ QSKA E+
Sbjct: 5   NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKKLWETKVLQSKATED 55
>gi|13124585|sp|Q9UNN4|T2AY_HUMAN TFIIA-alpha and beta like factor (General transcription factor II
           A, 1-like factor)
 gi|5091688|gb|AAD39634.1| TFIIA large subunit isoform ALF [Homo sapiens]
          Length = 478

 Score = 92.4 bits (228), Expect = 2e-17
 Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)

Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
           V++P    T+++ +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 248 VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301

Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359

Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
             G+      ++ +  +DG +                                  PL S 
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419

Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DD S+ D  +LFDTDNV+VCQYDKIHR+K +WKF LK G+M   G+D VF KA G+AEW
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 478

 Score = 63.5 bits (153), Expect = 9e-09
 Identities = 28/51 (54%), Positives = 39/51 (76%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5   NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55
>gi|40555809|gb|AAH64585.1| ALF protein [Homo sapiens]
          Length = 477

 Score = 92.4 bits (228), Expect = 2e-17
 Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)

Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
           V++P    T+++ +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 247 VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 300

Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 301 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 358

Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
             G+      ++ +  +DG +                                  PL S 
Sbjct: 359 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 418

Query: 404 DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DD S+ D  +LFDTDNV+VCQYDKIHR+K +WKF LK G+M   G+D VF KA G+AEW
Sbjct: 419 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 477

 Score = 63.5 bits (153), Expect = 9e-09
 Identities = 28/51 (54%), Positives = 39/51 (76%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 4   NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 54
>gi|5091669|gb|AAD39617.1| SALF [Homo sapiens]
          Length = 1182

 Score = 92.4 bits (228), Expect = 2e-17
 Identities = 68/239 (28%), Positives = 107/239 (44%), Gaps = 35/239 (14%)

Query: 251  VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
            V++P    T+++ +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 952  VFSPQVSQTNSDVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 1005

Query: 303  -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                 D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 1006 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 1063

Query: 357  VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
              G+      ++ +  +DG +                                  PL S 
Sbjct: 1064 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 1123

Query: 404  DDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
            DD S+ D  +LFDTDNV+VCQYDKIHR+K +WKF LK G+M   G+D VF KA G+AEW
Sbjct: 1124 DDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 1182

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 27/51 (52%), Positives = 38/51 (74%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N   KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 709 NIQPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 759
>gi|1942936|pdb|1TAF|A Chain A, Drosophila Tbp Associated Factors Dtafii42DTAFII62
          Heterotetramer
          Length = 68

 Score = 88.6 bits (218), Expect = 3e-16
 Identities = 38/68 (55%), Positives = 56/68 (82%)

Query: 4  PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTK 63
          PKDA+ I+++L+++ V EYEP+VV+QLLEFT+RY++ +LD+AKVYANHA +  I++DD +
Sbjct: 1  PKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTSILDDAKVYANHARKKTIDLDDVR 60

Query: 64 LAVMSQLD 71
          LA    LD
Sbjct: 61 LATEVTLD 68
>gi|6324768|ref|NP_014837.1| Transcription factor IIA, large chain; Toa1p [Saccharomyces
           cerevisiae]
 gi|418108|sp|P32773|TOA1_YEAST Transcription initiation factor IIA large chain (TFIIA 32 kDa
           subunit)
 gi|285446|pir||B41810 transcription factor IIA large chain - yeast (Saccharomyces
           cerevisiae)
 gi|172893|gb|AAA19654.1| transcription factor IIA
 gi|1420463|emb|CAA99407.1| TOA1 [Saccharomyces cerevisiae]
          Length = 286

 Score = 85.5 bits (210), Expect = 2e-15
 Identities = 73/326 (22%), Positives = 140/326 (42%), Gaps = 52/326 (15%)

Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSVM 201
           +++Y+ ++E V+N +RE F + G+DE  LQ+LK  W++KL ++K    S +   +  ++ 
Sbjct: 7   SRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNI- 65

Query: 202 PLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTST 261
                 N +Q D +         ++       +  N  L     + N  +  P+   T+ 
Sbjct: 66  ------NGVQNDLNFNLATPGVNSSEFNIKEENTGNEGLILPNINSNNNI--PHSGETNI 117

Query: 262 NQQSTSGGGKQHPSVIIQADGANDQRIT---QVDGANDQRITQVDGANDQRITQVEEAND 318
           N  +         ++     G  +  +T   +++   +  +T     N+  IT VE  +D
Sbjct: 118 NTNTVEATNNSGATLNTNTSGNTNADVTSQPKIEVKPEIELT----INNANITTVENIDD 173

Query: 319 QRITQVDGANDQRITQVDEANDQRITQV--DGANDQRITQVDGANDQRITQVDGANXXXX 376
           +   + D   ++ + +  +  +Q I QV      ++R   +D   D+  +++D ++    
Sbjct: 174 ESEKKDDEEKEEDVEKTRKEKEQ-IEQVKLQAKKEKRSALLD--TDEVGSELDDSDDDYL 230

Query: 377 XXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWK 436
                                       +G +D P E     N+++C YDK+ R K RWK
Sbjct: 231 IS--------------------------EGEEDGPDE-----NLMLCLYDKVTRTKARWK 259

Query: 437 FNLKVGIMNLNGKDLVFQKATGEAEW 462
            +LK G++ +N  D  FQKA  EAEW
Sbjct: 260 CSLKDGVVTINRNDYTFQKAQVEAEW 285
>gi|46107550|ref|XP_380834.1| hypothetical protein FG00658.1 [Gibberella zeae PH-1]
 gi|42545545|gb|EAA68388.1| hypothetical protein FG00658.1 [Gibberella zeae PH-1]
          Length = 413

 Score = 82.8 bits (203), Expect = 1e-14
 Identities = 94/411 (22%), Positives = 143/411 (34%), Gaps = 90/411 (21%)

Query: 140 SVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAV------------ 187
           +V  +Y+ +I++V+N+ R  F + GV+E VL+EL+Q W+QKL Q                
Sbjct: 5   AVGGVYKAIIDEVVNSSRVDFEESGVEESVLEELRQGWQQKLTQLDVAGFPWDPKPDPPP 64

Query: 188 -----ENSSNELRHAQS------------VMPLYVYANSMQGDKSQQPLVYHTMTAAGQQ 230
                   +    HAQ+             +P      +  G K +   V        +Q
Sbjct: 65  AQAPPPAMAAAQTHAQAQFSPHPGNPTGLSLPGMAQPQAQNGVKPEGSYVKTESPVPIKQ 124

Query: 231 AAMSLPNSVLYQRQQHPN---------------QQVYAPYQPGTSTNQQSTSGGGKQHPS 275
                 N + YQ+  +PN               QQ+ A Y    + +  +     + HP 
Sbjct: 125 EPGLQGNPMAYQQYANPNINMNDKNNVAANRAAQQLQAQYGQRAAGSINALHQQQQPHPQ 184

Query: 276 VIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQV 335
              Q        + Q    + Q+  Q  G   Q +TQ ++   Q   Q   A      Q 
Sbjct: 185 Q-GQQQQQQQPALPQQQQTHHQQQQQQPG-QQQNLTQQQQQQQQMYRQQMAAATAHQQQQ 242

Query: 336 DEANDQ---RITQVDGAND---------QRITQVDGANDQRI--------------TQVD 369
             +N Q   +  Q DGA D         QR    D     R+                ++
Sbjct: 243 HPSNGQPHIQNGQTDGAGDVDDYEGVLMQRAASGDFCELGRVEIDRMLHEQILAKAKSME 302

Query: 370 GANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDD----------DPVELFDTD- 418
           G                                 DD  +D          DP +  D D 
Sbjct: 303 GGGLMVPLKEATRHSSASTSRKAKGKQPAAFDGGDDDEEDDDDAINSDLDDPEDEHDDDD 362

Query: 419 -------NVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
                  N+++C YDK+ R K +WK  LK G++ +NGK+ VF KATGE EW
Sbjct: 363 VDDDGLGNIMLCMYDKVQRVKNKWKCTLKDGVLTVNGKEYVFHKATGEYEW 413
>gi|32422741|ref|XP_331814.1| hypothetical protein [Neurospora crassa]
 gi|28926818|gb|EAA35782.1| hypothetical protein [Neurospora crassa]
 gi|38567155|emb|CAE76449.1| related to transcription factor TFIIA-L [Neurospora crassa]
          Length = 421

 Score = 81.6 bits (200), Expect = 3e-14
 Identities = 98/422 (23%), Positives = 156/422 (36%), Gaps = 104/422 (24%)

Query: 140 SVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQ---------------- 183
           +V  +Y+ +I +VIN +R  F ++GVD+ VL+ELK+ W+ KL+Q                
Sbjct: 5   AVGTVYEHIINEVINAVRVDFEENGVDDSVLEELKKGWQHKLSQLNIAQFPWDPKPEAPP 64

Query: 184 -SKAVENSSNELRHAQSVMPLYVYANSMQGDKSQQPLVYHTMTAAGQ--------QAAMS 234
            ++A +N+S+ + +AQ+       AN  Q   S Q          GQ        ++   
Sbjct: 65  PAQATQNTSSAV-NAQAAATQQPTANYTQSTLSPQTAAQSPSLPGGQPNGNGVAIKSEPG 123

Query: 235 LPNSVLYQRQQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGA 294
           + N    +++    Q +  P  PG   N  +  G   Q  +  I A+    Q    ++  
Sbjct: 124 MANEPTIKQEPGTMQPMMHPAYPG---NNAARGGMAAQRAAQAI-ANRYGSQAQASINAI 179

Query: 295 NDQ-RITQVDG---------ANDQRITQVEEANDQRITQVDGANDQRITQVDEAN-DQRI 343
           + Q +  Q+ G            Q+  Q ++   Q+  Q   +  Q+  Q   A   QRI
Sbjct: 180 HQQTQQPQIPGQAPPPPLQQQQQQQQQQQQQQQQQQQQQQQFSPQQQYQQTVAAQLQQRI 239

Query: 344 TQVDG----------ANDQRITQVDGAND------------------------------- 362
            Q  G           N     QVDG++D                               
Sbjct: 240 PQQPGQPGQPQPPNQQNGLGGAQVDGSSDGFDGVMMRRDADGNPVEMGRVEIDNLLHAQL 299

Query: 363 -QRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXP--LCSADDGSD----------- 408
            +R  Q +G                            P  L   D+  D           
Sbjct: 300 LERAKQREGGGLMLPLKEATKQRSNATKSRRAAAAGGPNQLDGNDEDDDEFDEDAINSDL 359

Query: 409 DDPVELFDTDN--------VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEA 460
           DDP +  D D+        +++C YDK+ R K +WK  LK G++ +NGK+ VF KA GE 
Sbjct: 360 DDPEDERDDDDEEDEASGWIMLCLYDKVQRVKNKWKCTLKDGVLTVNGKEYVFHKANGEY 419

Query: 461 EW 462
           EW
Sbjct: 420 EW 421
>gi|18390773|ref|NP_563790.1| transcription factor IIA large subunit, putative / TFIIA large
           subunit, putative [Arabidopsis thaliana]
 gi|14532726|gb|AAK64164.1| putative transcription factor IIA large subunit [Arabidopsis
           thaliana]
 gi|15146226|gb|AAK83596.1| At1g07470/F22G5_13 [Arabidopsis thaliana]
 gi|15983370|gb|AAL11553.1| At1g07470/F22G5_13 [Arabidopsis thaliana]
 gi|21554033|gb|AAM63114.1| transcription factor IIA large subunit [Arabidopsis thaliana]
 gi|22136784|gb|AAM91736.1| putative transcription factor IIA large subunit [Arabidopsis
           thaliana]
 gi|39545872|gb|AAR27999.1| TFIIA-L2 [Arabidopsis thaliana]
          Length = 375

 Score = 81.3 bits (199), Expect = 4e-14
 Identities = 98/384 (25%), Positives = 151/384 (39%), Gaps = 67/384 (17%)

Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
           G   + + +Y  VIEDV+N +RE F+++G   E VL EL+  WE K+ Q+  V N   E 
Sbjct: 2   GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60

Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQR 243
             AQ   P     + +       ++ + P           Q  +  P      NS +Y  
Sbjct: 61  SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120

Query: 244 QQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGAND------- 296
               +       + G + + ++        PS     +   D  +  VDG ++       
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMPPPSP--WTNPRLDVNVAYVDGRDEPERGNSN 178

Query: 297 QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ----RITQVD 347
           Q+ TQ       G   +  +     N   I Q DGA+D     + EAN +    RIT V 
Sbjct: 179 QQFTQDLFVPSSGKRKRDDSSAHYQNGGSIPQQDGASDA----IPEANFECAALRITYV- 233

Query: 348 GANDQRITQVDGANDQRITQVDGA------------NXXXXXXXXXXXXXXXXXXXXXXX 395
              D++I +    +  +I QVDG             N                       
Sbjct: 234 --GDRKIPRDLIGSSSKIPQVDGPMPDPYDEMLSTPNIYSYQGPNEEFNEARTPAPNEIQ 291

Query: 396 XXXPLCSADD---------GSDDDPVELFD--------TDNVVVCQYDKIHRAKCRWKFN 438
              P+   +D           DDD  EL D        T ++V+ Q+DK+ R K RWK +
Sbjct: 292 TSTPVAVPNDIIEDDEELLNEDDDDDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCS 351

Query: 439 LKVGIMNLNGKDLVFQKATGEAEW 462
           LK GIM++N KD++F KATGE ++
Sbjct: 352 LKDGIMHINDKDILFNKATGEFDF 375
>gi|46097824|gb|EAK83057.1| hypothetical protein UM05183.1 [Ustilago maydis 521]
          Length = 214

 Score = 79.7 bits (195), Expect = 1e-13
 Identities = 75/322 (23%), Positives = 131/322 (40%), Gaps = 113/322 (35%)

Query: 141 VAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVENSSNELRHAQSV 200
           V++LY+++I+DV+NN+R+ F + G+++ VL+EL+++WE     +K V     E   A ++
Sbjct: 6   VSQLYRSIIDDVVNNVRQDFEEMGIEKEVLEELQRSWE-----AKIVATQVAEFEGAPAL 60

Query: 201 MPLYVYANSMQGDKSQQPLVYHTMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTS 260
            P           KS+                         ++Q+  N+      +   +
Sbjct: 61  PPA----------KSRSS-----------------------KKQESKNE---VKTEDDDA 84

Query: 261 TNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQR 320
             Q S S G  QH + + + DG+ D      DGA        DG     +    +A D  
Sbjct: 85  NGQNSRSNGADQHRAKVARGDGSTDV----ADGA-------ADGKGQDGVAGSVKAEDD- 132

Query: 321 ITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXX 380
               D A  Q+  ++ + +D+  + +D ++D+    +DG        VDG          
Sbjct: 133 ----DAAAGQKRKRISD-DDEIGSDLDDSDDE---DLDGG-------VDG---------- 167

Query: 381 XXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLK 440
                                              D D++V+C YDK+ R K +WK  LK
Sbjct: 168 -----------------------------------DNDDMVLCLYDKVQRVKNKWKCVLK 192

Query: 441 VGIMNLNGKDLVFQKATGEAEW 462
            G+ +++G+D +F K  GE EW
Sbjct: 193 DGVASIDGRDYLFAKCNGEFEW 214
>gi|30680148|ref|NP_850937.1| transcription factor IIA large subunit / TFIIA large subunit
           (TFIIA-L) [Arabidopsis thaliana]
 gi|30680153|ref|NP_172228.3| transcription factor IIA large subunit / TFIIA large subunit
           (TFIIA-L) [Arabidopsis thaliana]
 gi|11358887|pir||T51333 transcription factor IIA large chain [validated] - Arabidopsis
           thaliana
 gi|2826884|emb|CAA11525.1| transcription factor IIA large subunit [Arabidopsis thaliana]
 gi|39545932|gb|AAR28029.1| TFIIA-L1 [Arabidopsis thaliana]
          Length = 375

 Score = 79.3 bits (194), Expect = 2e-13
 Identities = 93/380 (24%), Positives = 147/380 (38%), Gaps = 59/380 (15%)

Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
           G   + + +Y  VIEDV+N +RE F+++G   E VL EL+  WE K+ Q+  V N   E 
Sbjct: 2   GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60

Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQR 243
             AQ   P     + +       ++ + P           Q  +  P      NS +Y  
Sbjct: 61  SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120

Query: 244 QQHPNQQVYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGAND------- 296
               +       + G + + ++        PS     +   D  +  VDG ++       
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMPPPSP--WTNPRLDVNVAYVDGRDEPERGNSN 178

Query: 297 QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGAND 351
           Q+ TQ       G   +  +     N   I Q DGA D       E +  RIT +    D
Sbjct: 179 QQFTQDLFVPSSGKRKRDDSSGHYQNGGSIPQQDGAGDAIPEANFECDAFRITSI---GD 235

Query: 352 QRITQVDGANDQRITQVDGA------------NXXXXXXXXXXXXXXXXXXXXXXXXXXP 399
           +++ +   ++  +I QVDG             N                          P
Sbjct: 236 RKVPRDFFSSSSKIPQVDGPMPDPYDEMLSTPNIYSYQGPSEEFNEARTPAPNEIQTSTP 295

Query: 400 LCSADD---------GSDDDPVELFD--------TDNVVVCQYDKIHRAKCRWKFNLKVG 442
           +   +D           DDD  EL D        T ++V+ Q+DK+ R K RWK +LK G
Sbjct: 296 VAVQNDIIEDDEELLNEDDDDDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDG 355

Query: 443 IMNLNGKDLVFQKATGEAEW 462
           IM++N KD++F KA GE ++
Sbjct: 356 IMHINDKDILFNKAAGEFDF 375
>gi|6323892|ref|NP_013963.1| Subunit (17 kDa) of TFIID and SAGA complexes, involved in RNA
           polymerase II transcription initiation and in chromatin
           modification, similar to histone H3; Taf9p
           [Saccharomyces cerevisiae]
 gi|2497197|sp|Q05027|TAF9_YEAST Transcription initiation factor TFIID subunit 9 (TBP-associated
           factor 9) (TBP-associated factor 17 kDa) (TAFII-17)
 gi|1362400|pir||S57603 hypothetical protein YMR236w - yeast (Saccharomyces cerevisiae)
 gi|887617|emb|CAA90207.1| unknown [Saccharomyces cerevisiae]
          Length = 157

 Score = 78.6 bits (192), Expect = 3e-13
 Identities = 39/123 (31%), Positives = 69/123 (56%), Gaps = 5/123 (4%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN-----IN 58
           P+D   +  +L    + +YE +V  QL++F +RY   VL +A VY ++AG  N     + 
Sbjct: 30  PRDVRLLHLLLASQSIHQYEDQVPLQLMDFAHRYTQGVLKDALVYNDYAGSGNSAGSGLG 89

Query: 59  VDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNY 118
           V+D +LA+ ++    F    P++ ++ +A ++N  ALP V      RLPP++YCL +  +
Sbjct: 90  VEDIRLAIAARTQYQFKPTAPKELMLQLAAERNKKALPQVMGTWGVRLPPEKYCLTAKEW 149

Query: 119 RLK 121
            L+
Sbjct: 150 DLE 152
>gi|24059851|dbj|BAC21319.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
          Length = 425

 Score = 76.6 bits (187), Expect = 1e-12
 Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 3/95 (3%)

Query: 4   PKDAETIIAVLQDMGVTE--YEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDD 61
           P+D   +  +L  +G+ E  YE   VH+LL F +RY  +VL EAK YA HAGR ++  DD
Sbjct: 27  PRDLRVVREILHSLGLREGDYEEAAVHKLLLFAHRYAGDVLGEAKAYAGHAGRESLQADD 86

Query: 62  TKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALP 96
            +LA+ ++   S   PP R+ ++D+A + N I +P
Sbjct: 87  VRLAIQAR-GMSSAAPPSREEMLDIAHKCNEIPIP 120
>gi|45199045|ref|NP_986074.1| AFR527Wp [Eremothecium gossypii]
 gi|44985120|gb|AAS53898.1| AFR527Wp [Eremothecium gossypii]
          Length = 137

 Score = 75.1 bits (183), Expect = 3e-12
 Identities = 37/124 (29%), Positives = 70/124 (56%), Gaps = 4/124 (3%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGR----SN 56
           ++ P+D   +  +L    +  YE +V  QL++F +RY   VL +A +Y  +A      ++
Sbjct: 5   EDTPRDVRLLHLLLASQSIHAYEDQVPLQLMDFAHRYTRGVLKDAMMYNEYANNGSAPAD 64

Query: 57  INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSV 116
           + V+D +LA+ S+    F    P++ L+ +A+++N   LP V +    RLPP++YCL + 
Sbjct: 65  LTVEDVRLAIGSRTQYQFKPTAPKELLLQLAQERNKKPLPQVMSMWGVRLPPEKYCLTAK 124

Query: 117 NYRL 120
            ++L
Sbjct: 125 EWQL 128
>gi|25406976|pir||D86209 protein F22G5.18 [imported] - Arabidopsis thaliana
 gi|8778565|gb|AAF79573.1| F22G5.18 [Arabidopsis thaliana]
          Length = 475

 Score = 75.1 bits (183), Expect = 3e-12
 Identities = 101/402 (25%), Positives = 155/402 (38%), Gaps = 84/402 (20%)

Query: 136 GGQNSVAKLYQTVIEDVINNIREAFLDDG-VDEHVLQELKQTWEQKLAQSKAVENSSNEL 194
           G   + + +Y  VIEDV+N +RE F+++G   E VL EL+  WE K+ Q+  V N   E 
Sbjct: 2   GTTTTTSAVYIHVIEDVVNKVREEFINNGGPGESVLSELQGIWETKMMQA-GVLNGPIER 60

Query: 195 RHAQSVMPLYVYANSMQ-----GDKSQQPLVYHTMTAAGQQAAMSLP------NSVLYQ- 242
             AQ   P     + +       ++ + P           Q  +  P      NS +Y  
Sbjct: 61  SSAQKPTPGGPLTHDLNVPYEGTEEYETPTAEMLFPPTPLQTPLPTPLPGTADNSSMYNI 120

Query: 243 -----------RQQHPNQQVYAPYQPGTSTNQQSTSGG-GKQHPSVIIQADGAN------ 284
                       +   N  V A   P    ++ +  GG  K   S ++Q   +       
Sbjct: 121 PTGSSDYPTPGTENGVNIDVKARPSPYMLCSETAYLGGLQKLRLSTLLQPPPSPWTNPRL 180

Query: 285 DQRITQVDGAND-------QRITQ-----VDGANDQRITQVEEANDQRITQVDGANDQRI 332
           D  +  VDG ++       Q+ TQ       G   +  +     N   I Q DGA+D   
Sbjct: 181 DVNVAYVDGRDEPERGNSNQQFTQDLFVPSSGKRKRDDSSAHYQNGGSIPQQDGASDA-- 238

Query: 333 TQVDEANDQ----RITQVDGANDQRITQVDGANDQRITQVDGA------------NXXXX 376
             + EAN +    RIT V    D++I +    +  +I QVDG             N    
Sbjct: 239 --IPEANFECAALRITYV---GDRKIPRDLIGSSSKIPQVDGPMPDPYDEMLSTPNIYSY 293

Query: 377 XXXXXXXXXXXXXXXXXXXXXXPLCSADD---------GSDDDPVELFD--------TDN 419
                                 P+   +D           DDD  EL D        T +
Sbjct: 294 QGPNEEFNEARTPAPNEIQTSTPVAVPNDIIEDDEELLNEDDDDDELDDLESGEDMNTQH 353

Query: 420 VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAE 461
           +V+ Q+DK+ R K RWK +LK GIM++N KD++F KA+  ++
Sbjct: 354 LVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKASSTSD 395
>gi|38492543|pdb|1NVP|B Chain B, Human TfiiaTBPDNA COMPLEX
          Length = 57

 Score = 73.6 bits (179), Expect = 9e-12
 Identities = 34/50 (68%), Positives = 42/50 (84%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVE 188
           N+V KLY++VIEDVIN++R+ FLDDGVDE VL ELK  WE KL QS+AV+
Sbjct: 7   NTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVD 56
>gi|17555056|ref|NP_499814.1| TBP-Associated transcription Factor family member (20.6 kD) (taf-9)
           [Caenorhabditis elegans]
 gi|7507733|pir||T24863 hypothetical protein T12D8.7 - Caenorhabditis elegans
 gi|3879796|emb|CAB03347.1| C. elegans TAF-9 protein (corresponding sequence T12D8.7)
           [Caenorhabditis elegans]
          Length = 183

 Score = 72.8 bits (177), Expect = 1e-11
 Identities = 44/163 (26%), Positives = 80/163 (49%), Gaps = 2/163 (1%)

Query: 5   KDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKL 64
           K+A  +I++L + G+ E++P+VV  L++  Y   S++L  +   + HA ++ I  +D + 
Sbjct: 21  KEALAVISMLHECGIQEFDPRVVSMLMDVQYAVTSKILQMSSGLSRHAEKAQIEAEDVQT 80

Query: 65  AVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSA 124
           A    L    TN P R+ ++ +A  KN   LP +++    +LP DR+C L  N+  K   
Sbjct: 81  AA-DMLGVLTTNAPDREKILQLANDKNQQPLPQIRHNYGLKLPNDRFCQLQQNFVYKADD 139

Query: 125 DKKVSSMAKAPGGQNSVAKLYQTV-IEDVINNIREAFLDDGVD 166
             +   M +     + +    Q +  E V N ++    DD  D
Sbjct: 140 SYQPMEMQQVTHQPHIIEPPTQVLRPEHVQNMLKRRAPDDDFD 182
>gi|46432465|gb|EAK91945.1| hypothetical protein CaO19.8708 [Candida albicans SC5314]
          Length = 229

 Score = 72.4 bits (176), Expect = 2e-11
 Identities = 38/130 (29%), Positives = 69/130 (53%), Gaps = 12/130 (9%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN---- 56
           K +P+D   +  +     V  Y+  V  QL++F +RY   VL +A VY +H+   N    
Sbjct: 75  KIIPRDVRLLHLIFATHAVHNYQDHVPLQLMDFAHRYTKSVLKDALVYNDHSKPINANTN 134

Query: 57  --------INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPP 108
                   +N DD +LA+ ++ +  F   PP++ L+++A+++N+  LP V       LPP
Sbjct: 135 LNSNTNATVNTDDIRLAIAARSNYQFKPVPPKELLLELAQERNSRPLPPVIPRWGLNLPP 194

Query: 109 DRYCLLSVNY 118
           ++YCL + ++
Sbjct: 195 EKYCLTAKDW 204
>gi|46432443|gb|EAK91924.1| hypothetical protein CaO19.1111 [Candida albicans SC5314]
          Length = 232

 Score = 72.4 bits (176), Expect = 2e-11
 Identities = 38/130 (29%), Positives = 69/130 (53%), Gaps = 12/130 (9%)

Query: 1   KNLPKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSN---- 56
           K +P+D   +  +     V  Y+  V  QL++F +RY   VL +A VY +H+   N    
Sbjct: 77  KIIPRDVRLLHLIFATHAVHNYQDHVPLQLMDFAHRYTKSVLKDALVYNDHSKPINANTN 136

Query: 57  --------INVDDTKLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPP 108
                   +N DD +LA+ ++ +  F   PP++ L+++A+++N+  LP V       LPP
Sbjct: 137 LNSNTNATVNTDDIRLAIAARSNYQFKPVPPKELLLELAQERNSRPLPPVIPRWGLNLPP 196

Query: 109 DRYCLLSVNY 118
           ++YCL + ++
Sbjct: 197 EKYCLTAKDW 206
>gi|40746468|gb|EAA65624.1| hypothetical protein AN0794.2 [Aspergillus nidulans FGSC A4]
          Length = 287

 Score = 72.0 bits (175), Expect = 3e-11
 Identities = 50/173 (28%), Positives = 78/173 (45%), Gaps = 36/173 (20%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDT- 62
           P+D   I  +L   GVT Y+ +V  QLL+F YRY S VL +A V+    G +    D T 
Sbjct: 73  PRDVRLIHMLLAAQGVTAYQERVPLQLLDFAYRYTSGVLQDA-VHLTTEGYAGTGADTTT 131

Query: 63  ------------------KLAVMSQLDNSFTNPPPRDFLVDVARQKNTIALP-LVKNYSA 103
                             +L++ S+L   F    P++FL+DVA ++N +ALP   + + A
Sbjct: 132 SSGNKGPPEVNSVTLPALRLSIASRLHYQFQTGLPKEFLMDVAAERNRVALPGATRGFDA 191

Query: 104 A---------------RLPPDRYCLLSVNYRLKGSADKKVSSMAKAPGGQNSV 141
                           RLPP+R+CL    +++    + +      AP  +N V
Sbjct: 192 GAGKPAASNSVLMAGMRLPPERFCLTGTGWKMSDEWESEGEEDIAAPEEKNDV 244
>gi|39591770|emb|CAE71348.1| Hypothetical protein CBG18251 [Caenorhabditis briggsae]
          Length = 184

 Score = 70.5 bits (171), Expect = 7e-11
 Identities = 43/164 (26%), Positives = 80/164 (48%), Gaps = 3/164 (1%)

Query: 5   KDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEAKVYANHAGRSNINVDDTKL 64
           K+A  I+++L + GV E++P+VV  L++  Y   S++L  +   + HA +  I+ +D + 
Sbjct: 21  KEAMAIMSLLGECGVEEFDPRVVSMLMDVQYAVTSKILQVSSGLSRHADKQKIDSEDVQT 80

Query: 65  AVMSQLDNSFTNPPPRDFLVDVARQKNTIALPLVKNYSAARLPPDRYCLLSVNYRLKGSA 124
           A    L    +N P R+ ++ +A  KN   LP +++    +LP DR+C L  N+  K   
Sbjct: 81  AA-DMLGVLSSNAPDREKILQMANDKNQQPLPQIRHNYGLKLPNDRFCQLQQNFVYKADD 139

Query: 125 DKKVSSMAKAPGGQNSVAKLYQTVI--EDVINNIREAFLDDGVD 166
           + +     +       +     +V+  E V N ++    DD  D
Sbjct: 140 NSQQMETQQVAHTPRIIEPPQSSVLRPEQVQNMLKRRAPDDDFD 183
>gi|19112462|ref|NP_595670.1| putative transcription initiation factor IIA large subunit
           [Schizosaccharomyces pombe]
 gi|7493058|pir||T40052 probable transcription initiation factor IIA large subunit -
           fission yeast (Schizosaccharomyces pombe)
 gi|6018693|emb|CAB57938.1| SPBC28F2.09 [Schizosaccharomyces pombe]
          Length = 369

 Score = 70.1 bits (170), Expect = 1e-10
 Identities = 91/391 (23%), Positives = 138/391 (35%), Gaps = 97/391 (24%)

Query: 141 VAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSK--------------- 185
           V ++Y  VI DVI N R  F ++GVD+  L+EL+  W+ KL  +                
Sbjct: 6   VGEVYHHVILDVIANSRSDFEENGVDDATLRELQNLWQSKLVATDVATFPWAQAPVGTFP 65

Query: 186 ---------------------AVENSS--NELRHAQSVMPLYVYANSMQGDKSQQPLVYH 222
                                AV NS   N +   ++V  +  +A          P    
Sbjct: 66  IGQLFDPVSGLRTDSLDVTAPAVANSPILNNIAAIRAVQQMDTFAQQHGNSNYYSP---P 122

Query: 223 TMTAAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTS----TNQQSTSGGGKQHPSVII 278
           T +       +S  +S +   Q +PN     P     S    TNQ + S     H +  +
Sbjct: 123 TPSLPQSATNISFDSSAIPNVQSNPNNTAPFPSYSSNSLQLPTNQTADSPIINDHSTANV 182

Query: 279 QADG---ANDQRITQVDGA------NDQRITQVDGANDQRITQVEEANDQRITQVDGA-- 327
            + G   A D   T   G       N  + +++        T     ND  + Q DGA  
Sbjct: 183 TSTGQEHAPDSSSTNSFGGLLLPNQNSPKKSELGETESSNTTPANSRND--VPQTDGAIH 240

Query: 328 --NDQRITQVDEANDQRITQVDGAN------DQRITQVDGA----NDQRITQVDGANXXX 375
             +D       E+N   I Q   A         RI Q+DG      D++   VD  +   
Sbjct: 241 DLDDAGSPSNFESNRFAIAQKADAEIYEVLKKNRILQIDGTIEDNEDEKKPPVDTPSDEA 300

Query: 376 XXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDDPVELFDTDNV----VVCQYDKIHRA 431
                                       DD   D+  E  +  ++    V+C YDK++  
Sbjct: 301 INS-----------------------DLDDPDSDEAPETEEGSDIGQAIVLCLYDKVNHH 337

Query: 432 KCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           K +WK   + G++ +NGKD +F KA GE EW
Sbjct: 338 KNKWKCVFRDGVVGVNGKDYLFFKANGEFEW 368
>gi|32420261|ref|XP_330574.1| hypothetical protein [Neurospora crassa]
 gi|28925958|gb|EAA34951.1| hypothetical protein [Neurospora crassa]
 gi|38567266|emb|CAE76556.1| related to TFIID and SAGA subunit [Neurospora crassa]
          Length = 320

 Score = 67.8 bits (164), Expect = 5e-10
 Identities = 50/148 (33%), Positives = 77/148 (52%), Gaps = 29/148 (19%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDE-----AKVYANHA------ 52
           P+DA TI  +L   GVT +E +V   LL+F YR+ S VL +     A  Y +HA      
Sbjct: 96  PRDARTIELLLTAQGVTAFEQRVPLLLLDFAYRHTSAVLSDALHLSADPYTSHAGARPSA 155

Query: 53  --GRSNINVDD-------TKLAVMSQLDNSFTNPP--------PRDFLVDVARQKNTIAL 95
             G + +NV D        +LA+ S+L+  F             +++++++AR+KN +AL
Sbjct: 156 SSGAAPVNVGDATITSNAVQLAIASRLNFQFRGGSGGGSGGGVSKEWMMEMAREKNKVAL 215

Query: 96  PLVK-NYSAARLPPDRYCLLSVNYRLKG 122
           P V  N    RLP +R+ L  V++ LKG
Sbjct: 216 PKVSVNEWGVRLPSERFVLNGVSWGLKG 243
>gi|26787966|ref|NP_751946.1| TFIIA-alpha/beta-like factor isoform 2; TFIIA large subunit isoform
           ALF; GTF2A1-like factor [Homo sapiens]
          Length = 453

 Score = 63.5 bits (153), Expect = 9e-09
 Identities = 28/51 (54%), Positives = 39/51 (76%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSKAVEN 189
           N V KLY++VIEDVI  +R  F ++G++E VL++LKQ WE K+ QSKA E+
Sbjct: 5   NPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATED 55

 Score = 49.3 bits (116), Expect = 2e-04
 Identities = 48/204 (23%), Positives = 81/204 (39%), Gaps = 35/204 (17%)

Query: 251 VYAPYQPGTSTNQQSTSGGGKQHPSVIIQADGANDQRI-TQVDGANDQRITQV------- 302
           V++P    T++N +S   G          A   +D+ + T   GA  Q +T +       
Sbjct: 248 VFSPQVSQTNSNVESVLSGSAS------MAQNLHDESLSTSPHGALHQHVTDIQLHILKN 301

Query: 303 -----DGANDQRITQVEEANDQRITQVDGANDQRIT-QVDEANDQRITQVDGANDQRITQ 356
                D     R   +EE ++  +++ D  +   ++ +V + +   I QVDG+ D    +
Sbjct: 302 RMYGCDSVKQPR--NIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNE 359

Query: 357 VDGAN-----DQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXX--------XPLCSA 403
             G+      ++ +  +DG +                                  PL S 
Sbjct: 360 EIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSG 419

Query: 404 DDGSDDDPVELFDTDNVVVCQYDK 427
           DD S+ D  +LFDTDNV+VCQYDK
Sbjct: 420 DDVSEQDVPDLFDTDNVIVCQYDK 443
>gi|46107964|ref|XP_381040.1| hypothetical protein FG00864.1 [Gibberella zeae PH-1]
 gi|42547614|gb|EAA70457.1| hypothetical protein FG00864.1 [Gibberella zeae PH-1]
          Length = 256

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 54/201 (26%), Positives = 94/201 (46%), Gaps = 33/201 (16%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEA--------------KVYA 49
           P+D+  I  +L   GVT YEP+V   LL+F YR+ S VL +A              K  A
Sbjct: 56  PRDSRLIELLLTSQGVTAYEPRVPLLLLDFAYRHTSSVLSDALHLSGDPYVTQAGSKPSA 115

Query: 50  N------HAGRSNINVDDTKLAVMSQLDNSFTNPP-----PRDFLVDVARQKNTIALP-L 97
           N       AG + +  +  KLA+ ++L              ++++ ++AR++N +ALP +
Sbjct: 116 NAGAISAPAGDAAVTANAVKLAIAARLSYQLRGGNAGGGISKEYMQELARERNKVALPKI 175

Query: 98  VKNYSAARLPPDRYCLLSVNYRLKGSADKKVSSMAKAPGGQNSVAKLYQTVIEDVINN-- 155
           V +    RLP +R+ L   ++ LK   D+      +   G +++  +     EDV  +  
Sbjct: 176 VPSEWGVRLPSERFVLSGTSWGLKDVWDEDDEEDDEMDQGGDAMEGIEGPEPEDVGGDGV 235

Query: 156 ----IREAFLDDGVDEHVLQE 172
               + + F +D VDE + +E
Sbjct: 236 EGGTVEDMFGED-VDEEMAEE 255
>gi|37526068|ref|NP_929412.1| hypothetical protein [Photorhabdus luminescens subsp. laumondii
           TTO1]
 gi|36785498|emb|CAE14445.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
           TTO1]
          Length = 247

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 27/83 (32%), Positives = 51/83 (61%)

Query: 287 RITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQRITQV 346
           R+T+ +   D + T+V+G  D R+T+ E   D + T+V+G  D R+T+ +   D + T+V
Sbjct: 104 RLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEV 163

Query: 347 DGANDQRITQVDGANDQRITQVD 369
           +G  D R+T+ +   D ++T+V+
Sbjct: 164 EGKFDARLTKSENRFDSKLTEVE 186

 Score = 52.8 bits (125), Expect = 2e-05
 Identities = 26/104 (25%), Positives = 55/104 (52%)

Query: 262 NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRI 321
           N  S     +Q  + + +++   D + T+V+G  D R+T+ +   D + T+VE   D R+
Sbjct: 90  NTLSKKPSEEQLNARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARL 149

Query: 322 TQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRI 365
           T+ +   D + T+V+   D R+T+ +   D ++T+V+    + +
Sbjct: 150 TKSENRFDSKFTEVEGKFDARLTKSENRFDSKLTEVESKQTESV 193

 Score = 48.1 bits (113), Expect = 4e-04
 Identities = 21/62 (33%), Positives = 38/62 (61%)

Query: 309 RITQVEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQV 368
           R+T+ E   D + T+V+G  D R+T+ +   D + T+V+G  D R+T+ +   D + T+V
Sbjct: 104 RLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEVEGKFDARLTKSENRFDSKFTEV 163

Query: 369 DG 370
           +G
Sbjct: 164 EG 165
>gi|38109765|gb|EAA55584.1| hypothetical protein MG01235.4 [Magnaporthe grisea 70-15]
          Length = 407

 Score = 60.5 bits (145), Expect = 8e-08
 Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 3/58 (5%)

Query: 405 DGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           D SD+D  EL      ++C YDK+ R K +WK  LK G++ +NGK+ VF KATGE EW
Sbjct: 353 DESDEDGDEL---QQQMLCLYDKVQRVKNKWKCTLKDGVLCVNGKEYVFHKATGEYEW 407
>gi|40746490|gb|EAA65646.1| hypothetical protein AN0816.2 [Aspergillus nidulans FGSC A4]
          Length = 419

 Score = 60.1 bits (144), Expect = 1e-07
 Identities = 62/244 (25%), Positives = 87/244 (35%), Gaps = 37/244 (15%)

Query: 227 AGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPGTSTNQ--QSTSGGGKQHPSVI-IQADGA 283
           A  Q+ M LP     Q  Q   Q       P     Q  Q T      HPSV   Q DG 
Sbjct: 205 AQSQSPMGLPRPSSMQHMQQMQQMQQMQQMPNGQNPQIKQETGYHPVSHPSVSNAQTDGT 264

Query: 284 NDQRITQVDG-ANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQ- 341
               +++       +R    DG  D+ +    +   Q++ +++G     +  +DE   Q 
Sbjct: 265 ASDALSEWKAEVTRRREAAQDGEGDRALRNHLK---QQMLRLEGGG--LMVPLDERESQP 319

Query: 342 ---RITQVDGANDQRITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXX 398
              R    D +  +   Q DG       + D  +                          
Sbjct: 320 KAPRFASADSSGSK--AQFDGPGGDDTKEEDDEDAINSDLDDPDDLV------------- 364

Query: 399 PLCSADDGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATG 458
               A D  DDD V       V++C YDK+ R K +WK  LK GI+   GK+ VF K  G
Sbjct: 365 ----AQDDEDDDAV-----GQVMLCTYDKVQRVKNKWKCTLKDGILTTGGKEYVFHKGQG 415

Query: 459 EAEW 462
           E EW
Sbjct: 416 EFEW 419
>gi|45201324|ref|NP_986894.1| AGR228Cp [Eremothecium gossypii]
 gi|44986178|gb|AAS54718.1| AGR228Cp [Eremothecium gossypii]
          Length = 200

 Score = 59.3 bits (142), Expect = 2e-07
 Identities = 26/63 (41%), Positives = 40/63 (63%), Gaps = 5/63 (7%)

Query: 405 DGSDDDPVELFDTDN-----VVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
           D +DD+ V   + +N     +++C Y+K+ R K +WK NLK G+  +N KD  FQK+ GE
Sbjct: 138 DDTDDEYVNFGEEENGSDVNIMLCLYEKVLRVKNKWKCNLKDGVATINNKDYAFQKSQGE 197

Query: 460 AEW 462
           +EW
Sbjct: 198 SEW 200
>gi|38109853|gb|EAA55659.1| hypothetical protein MG01310.4 [Magnaporthe grisea 70-15]
          Length = 278

 Score = 59.3 bits (142), Expect = 2e-07
 Identities = 41/135 (30%), Positives = 67/135 (49%), Gaps = 25/135 (18%)

Query: 4   PKDAETIIAVLQDMGVTEYEPKVVHQLLEFTYRYISEVLDEA-----KVYANHAGR---- 54
           P+DA TI  +L   GVT +EP+V   LL+F YR+ + VL +A       Y  HAG     
Sbjct: 67  PRDARTIELLLTAQGVTAFEPRVPLLLLDFAYRHTAAVLSDALHLAGDPYTTHAGAKPSA 126

Query: 55  -------------SNINVDDTKLAVMSQLDNSFT--NPPPRDFLVDVARQKNTIALPLVK 99
                        ++++    ++A+ S+L   F       ++FL++ AR++N  ALP + 
Sbjct: 127 AQNATASAPSGGDASVSASAIQVAIASRLAFQFRGGGSTSKEFLLETARERNKAALPRIL 186

Query: 100 NYS-AARLPPDRYCL 113
            +    RLP +R+ L
Sbjct: 187 PHEWGVRLPSERFVL 201
>gi|17065294|gb|AAL32801.1| similar to TFIIA [Arabidopsis thaliana]
 gi|30023680|gb|AAP13373.1| At1g07480 [Arabidopsis thaliana]
          Length = 218

 Score = 58.9 bits (141), Expect = 2e-07
 Identities = 49/175 (28%), Positives = 72/175 (41%), Gaps = 32/175 (18%)

Query: 317 NDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQRITQVDGA----- 371
           N   I Q DGA D       E +  RIT +    D+++ +   ++  +I QVDG      
Sbjct: 47  NGGSIPQQDGAGDAIPEANFECDAFRITSI---GDRKVPRDFFSSSSKIPQVDGPMPDPY 103

Query: 372 -------NXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADD---------GSDDDPVELF 415
                  N                          P+   +D           DDD  EL 
Sbjct: 104 DEMLSTPNIYSYQGPSEEFNEARTPAPNEIQTSTPVAVQNDIIEDDEELLNEDDDDDELD 163

Query: 416 D--------TDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           D        T ++V+ Q+DK+ R K RWK +LK GIM++N KD++F KA GE ++
Sbjct: 164 DLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKAAGEFDF 218
>gi|1633309|pdb|1YTF|C Chain C, Yeast TfiiaTBPDNA COMPLEX
 gi|38492532|pdb|1NH2|C Chain C, Crystal Structure Of A Yeast TfiiaTBPDNA COMPLEX
          Length = 79

 Score = 58.9 bits (141), Expect = 2e-07
 Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 5/58 (8%)

Query: 405 DGSDDDPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           +G +D P E     N+++C YDK+ R K RWK +LK G++ +N  D  FQKA  EAEW
Sbjct: 26  EGEEDGPDE-----NLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEW 78
>gi|1429226|emb|CAA67368.1| TFIIA [Arabidopsis thaliana]
          Length = 375

 Score = 58.2 bits (139), Expect = 4e-07
 Identities = 28/60 (46%), Positives = 42/60 (70%), Gaps = 1/60 (1%)

Query: 404 DDGSDD-DPVELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGEAEW 462
           DD  DD +  E  +T ++V+ Q+DK+ R K RWK +LK GIM++N KD++F KA GE ++
Sbjct: 316 DDELDDLESGEDMNTQHLVLAQFDKVTRTKSRWKCSLKDGIMHINDKDILFNKAAGEFDF 375
>gi|15237841|ref|NP_200731.1| transcription factor-related [Arabidopsis thaliana]
 gi|9759244|dbj|BAB09768.1| unnamed protein product [Arabidopsis thaliana]
 gi|39545874|gb|AAR28000.1| TFIIA-L3 [Arabidopsis thaliana]
          Length = 186

 Score = 55.8 bits (133), Expect = 2e-06
 Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 26/107 (24%)

Query: 354 ITQVDGANDQRITQVDGANXXXXXXXXXXXXXXXXXXXXXXXXXXPLCSADDGSDDD-PV 412
           I Q DGA D+ I  VD                             PL   DD  +DD   
Sbjct: 102 IPQQDGARDEAIVDVD-------------------------ENEEPLNEDDDDEEDDIDD 136

Query: 413 ELFDTDNVVVCQYDKIHRAKCRWKFNLKVGIMNLNGKDLVFQKATGE 459
           +  +  ++V+CQ+DK+ R+K +W+     G+M +NGK+++F +ATG+
Sbjct: 137 DDMNIQHLVMCQFDKVKRSKNKWECKFNAGVMQINGKNVLFSQATGD 183
>gi|1633308|pdb|1YTF|B Chain B, Yeast TfiiaTBPDNA COMPLEX
 gi|38492531|pdb|1NH2|B Chain B, Crystal Structure Of A Yeast TfiiaTBPDNA COMPLEX
          Length = 53

 Score = 50.4 bits (119), Expect = 8e-05
 Identities = 18/44 (40%), Positives = 33/44 (75%)

Query: 142 AKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQTWEQKLAQSK 185
           +++Y+ ++E V+N +RE F + G+DE  LQ+LK  W++KL ++K
Sbjct: 6   SRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETK 49
>gi|22994332|ref|ZP_00038840.1| hypothetical protein [Xylella fastidiosa Dixon]
          Length = 222

 Score = 49.3 bits (116), Expect = 2e-04
 Identities = 34/89 (38%), Positives = 49/89 (55%), Gaps = 8/89 (8%)

Query: 285 DQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQ---RITQVDEANDQ 341
           DQR  QVD    QR  ++D    ++I Q  E  DQR  QVD   +Q      Q+D+  DQ
Sbjct: 114 DQRFAQVD----QRFEKID-QRFEKIDQRFEKIDQRFAQVDQRFEQIAKDFAQLDKNMDQ 168

Query: 342 RITQVDGANDQRITQVDGANDQRITQVDG 370
           R+ Q+D A +QR  Q++ A +QR  + +G
Sbjct: 169 RLGQLDKALEQRFGQLEKALEQRFAKTEG 197
>gi|29893524|gb|AAN16518.1| circumsporozoite protein [Plasmodium coatneyi]
          Length = 341

 Score = 49.3 bits (116), Expect = 2e-04
 Identities = 32/91 (35%), Positives = 33/91 (36%)

Query: 281 DGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEAND 340
           DGA D      DGA D      DGA D      + A D      DGA D      D A D
Sbjct: 90  DGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARD 149

Query: 341 QRITQVDGANDQRITQVDGANDQRITQVDGA 371
                 DGA D      DGA D      DGA
Sbjct: 150 GPAPAADGARDGPAPAADGARDGPAPPADGA 180

 Score = 47.4 bits (111), Expect = 7e-04
 Identities = 30/83 (36%), Positives = 31/83 (37%)

Query: 280 ADGANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEAN 339
           ADGA D      DGA D      DGA D      + A D      DGA D      D A 
Sbjct: 100 ADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGARDGPAPAADGAR 159

Query: 340 DQRITQVDGANDQRITQVDGAND 362
           D      DGA D      DGA D
Sbjct: 160 DGPAPAADGARDGPAPPADGARD 182
>gi|24580579|ref|NP_722615.1| CG18497-PA [Drosophila melanogaster]
 gi|46397733|sp|Q8SX83|SPEN_DROME Split ends protein
 gi|10727421|gb|AAF51535.2| CG18497-PA [Drosophila melanogaster]
          Length = 5560

 Score = 48.9 bits (115), Expect = 2e-04
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)

Query: 205  VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
            V+ N+ Q    QQP V   MTA   QQ      +  + QRQQH   QQ++   Q  TS  
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864

Query: 262  ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
                      QQ      +QH +  + A     Q+  Q     +Q+I Q       ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924

Query: 313  VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
              +A  Q ++Q    + Q++ Q  +A  Q++ Q+     Q++ Q+ G   Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|6979936|gb|AAF34661.1| split ends long isoform [Drosophila melanogaster]
          Length = 5554

 Score = 48.9 bits (115), Expect = 2e-04
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)

Query: 205  VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
            V+ N+ Q    QQP V   MTA   QQ      +  + QRQQH   QQ++   Q  TS  
Sbjct: 3801 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3858

Query: 262  ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
                      QQ      +QH +  + A     Q+  Q     +Q+I Q       ++ Q
Sbjct: 3859 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3918

Query: 313  VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
              +A  Q ++Q    + Q++ Q  +A  Q++ Q+     Q++ Q+ G   Q+
Sbjct: 3919 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3965
>gi|24580583|ref|NP_722616.1| CG18497-PC [Drosophila melanogaster]
 gi|6715140|gb|AAF26299.1| split ends [Drosophila melanogaster]
 gi|22945598|gb|AAN10511.1| CG18497-PC [Drosophila melanogaster]
          Length = 5476

 Score = 48.9 bits (115), Expect = 2e-04
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)

Query: 205  VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
            V+ N+ Q    QQP V   MTA   QQ      +  + QRQQH   QQ++   Q  TS  
Sbjct: 3750 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3807

Query: 262  ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
                      QQ      +QH +  + A     Q+  Q     +Q+I Q       ++ Q
Sbjct: 3808 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3867

Query: 313  VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
              +A  Q ++Q    + Q++ Q  +A  Q++ Q+     Q++ Q+ G   Q+
Sbjct: 3868 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3914
>gi|6467825|gb|AAF13218.1| Spen RNP motif protein long isoform [Drosophila melanogaster]
          Length = 5533

 Score = 48.9 bits (115), Expect = 2e-04
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)

Query: 205  VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
            V+ N+ Q    QQP V   MTA   QQ      +  + QRQQH   QQ++   Q  TS  
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864

Query: 262  ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
                      QQ      +QH +  + A     Q+  Q     +Q+I Q       ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924

Query: 313  VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
              +A  Q ++Q    + Q++ Q  +A  Q++ Q+     Q++ Q+ G   Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|24580581|ref|NP_524718.2| CG18497-PB [Drosophila melanogaster]
 gi|10727420|gb|AAF51534.2| CG18497-PB [Drosophila melanogaster]
          Length = 5533

 Score = 48.9 bits (115), Expect = 2e-04
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 19/172 (11%)

Query: 205  VYANSMQGDKSQQPLVYHTMTA-AGQQAAMSLPNSVLYQRQQH-PNQQVYAPYQPGTST- 261
            V+ N+ Q    QQP V   MTA   QQ      +  + QRQQH   QQ++   Q  TS  
Sbjct: 3807 VHLNAHQNQ--QQPQVIAKMTAHQHQQHMQQFMHQQMIQRQQHMQQQQLHGQSQQITSAP 3864

Query: 262  ---------NQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQ 312
                      QQ      +QH +  + A     Q+  Q     +Q+I Q       ++ Q
Sbjct: 3865 QHQMHQQHQAQQQQQHHNQQHLNQQLHAQQHPTQKQHQAQQQFNQQIQQHQSQQQHQVQQ 3924

Query: 313  VEEANDQRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQVDGANDQR 364
              +A  Q ++Q    + Q++ Q  +A  Q++ Q+     Q++ Q+ G   Q+
Sbjct: 3925 QNQAQQQHLSQQQHQSQQQLNQQHQAQQQQLQQI-----QKLQQMHGPQQQQ 3971
>gi|26247178|ref|NP_753218.1| Minor curlin subunit precursor [Escherichia coli CFT073]
 gi|26107579|gb|AAN79778.1| Minor curlin subunit precursor [Escherichia coli CFT073]
          Length = 160

 Score = 47.4 bits (111), Expect = 7e-04
 Identities = 43/158 (27%), Positives = 61/158 (38%), Gaps = 10/158 (6%)

Query: 208 NSMQGDKSQQPLVYHTMT---------AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
           + +QGD  +  L++  +T         AAG   A S  N  + +  +    Q     Q G
Sbjct: 3   DQVQGDNMKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAG 62

Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
           T+ + Q   GG K   +V+ Q   +N  +I Q    N   I Q   AND  I+Q    N 
Sbjct: 63  TNNSAQLRQGGSKLL-TVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNT 121

Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
             I Q    N   ITQ        + Q       R+TQ
Sbjct: 122 AMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQ 159
>gi|24112441|ref|NP_706951.1| minor curlin subunit precursor, similar ro CsgA [Shigella flexneri
           2a str. 301]
 gi|24051320|gb|AAN42658.1| minor curlin subunit precursor, similar ro CsgA [Shigella flexneri
           2a str. 301]
          Length = 160

 Score = 47.4 bits (111), Expect = 7e-04
 Identities = 43/158 (27%), Positives = 61/158 (38%), Gaps = 10/158 (6%)

Query: 208 NSMQGDKSQQPLVYHTMT---------AAGQQAAMSLPNSVLYQRQQHPNQQVYAPYQPG 258
           + +QGD  +  L++  +T         AAG   A S  N  + +  +    Q     Q G
Sbjct: 3   DQVQGDNMKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAG 62

Query: 259 TSTNQQSTSGGGKQHPSVIIQADGANDQRITQVDGANDQRITQVDGANDQRITQVEEAND 318
           T+ + Q   GG K   +V+ Q   +N  +I Q    N   I Q   AND  I+Q    N 
Sbjct: 63  TNNSAQLRQGGSKLL-AVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNT 121

Query: 319 QRITQVDGANDQRITQVDEANDQRITQVDGANDQRITQ 356
             I Q    N   ITQ        + Q       R+TQ
Sbjct: 122 AMIIQKGSGNKANITQYGTQKTAVVVQRQSQMAIRVTQ 159
>gi|46112495|ref|ZP_00180737.2| COG1322: Uncharacterized protein conserved in bacteria [Moorella
           thermoacetica ATCC 39073]
          Length = 190

 Score = 47.4 bits (111), Expect = 7e-04
 Identities = 28/85 (32%), Positives = 54/85 (63%)

Query: 283 ANDQRITQVDGANDQRITQVDGANDQRITQVEEANDQRITQVDGANDQRITQVDEANDQR 342
           A +Q++TQ   A +Q++TQ     +Q++T+  E  +Q++TQ   A +Q++T+  E  +Q+
Sbjct: 58  AVEQKLTQRIDAVEQKLTQRIDVVEQKLTRHIEEVEQKLTQRIDAVEQKLTEHIEEVEQK 117

Query: 343 ITQVDGANDQRITQVDGANDQRITQ 367
           +TQ   A ++ +TQ   A DQ++T+
Sbjct: 118 LTQRIDAVEKTLTQGIDAVDQKLTR 142
>gi|20148954|gb|AAM12730.1| TFIIA alpha/beta-like protein ALF [Mus musculus]
          Length = 41

 Score = 47.4 bits (111), Expect = 7e-04
 Identities = 20/37 (54%), Positives = 30/37 (81%)

Query: 139 NSVAKLYQTVIEDVINNIREAFLDDGVDEHVLQELKQ 175
           N V KLYQ+VIEDVI  +R+ F ++G++E VL++LK+
Sbjct: 5   NLVPKLYQSVIEDVIEGVRDLFAEEGIEEQVLKDLKK 41
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF excluding environmental samples
    Posted date:  May 11, 2004 12:59 AM
  Number of letters in database: 593,787,773
  Number of sequences in database:  1,798,171
  
Lambda     K      H
   0.313    0.129    0.365 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 497,867,535
Number of Sequences: 1798171
Number of extensions: 19614691
Number of successful extensions: 66046
Number of sequences better than 1.0e-03: 91
Number of HSP's better than  0.0 without gapping: 77
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 65829
Number of HSP's gapped (non-prelim): 178
length of query: 462
length of database: 593,787,773
effective HSP length: 129
effective length of query: 333
effective length of database: 361,823,714
effective search space: 120487296762
effective search space used: 120487296762
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 110 (47.0 bits)