Branden and Tooze, 2nd ed. Chap. 2, esp. 29-32.
***Doolittle, R. F. (1995) The multiplicity of domains in proteins. Annu. Rev. Biochem. 64:287-314.
Hofmann, K., Bucher, P., Falquet, L., and Bairoch, A. (1999) The PROSITE database, its status in 1999. Nucleic Acids Res. 27:215-219. http://www.expasy.org/prosite/
Bateman, A., Birney, E., Durbin, R., Eddy, S. R., Finn, R. D., and Sonnhammer, E. L. L. (1999) Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res. 27:260-262. http://pfam.wustl.edu/
Henikoff, J. G., Henikoff, S., and Pietrokovski, S. (1999) New features of the Blocks Database servers. Nucleic Acids Res. 27:226-228. http://blocks.fhcrc.org/
Gilbert, W., de Souza, S. J., and Long, M. (1997) Origin of Genes Proc. Natl. Acad. Sci. USA 94:7698-703
Hurst, L. D. and McVean, G. T. (1996) A difficult phase for introns-early. Molecular evolution. Curr. Biol. 6:533-536.
Stoltzfus, A., Spencer, D. F., Zuker, M., Logsdon, J. M., and Doolittle, W. F. (1994) Testing the intron theory of genes: the evidence from protein structure. Science 265:202-207.
Cho, G. and Doolittle, R. F. (1997) Intron distribution in ancient paralogs supports random insertion and not random loss. J. Mol. Evol. 44:573-84.
Logsdon Jr., J. M. (1998) The recent origins of spliceosomal introns revisited. Curr. Opin. Genet. Dev. 8:637-48.
Heringa, J. and Taylor, W. R. (1997) Three-dimensional domain duplication, swapping and stealing. Curr Opin Struct Biol 7:416-421.
**Marcotte, E. M., Pellegrini, M., Ng, H. L., Rice, D. W., Yeates, T. O., and Eisenberg, D. (1999) Detecting protein function and protein-protein interactions from genome sequences. Science 285:751-753.
*** Lecture by Sean Eddy on Protein domains and Protein Domain Databases: http://www.people.Virginia.EDU/~wrp/cshl97/domain-lecture.html
(from Doolittle, 1995, Fig. 2)
| macrophage scavenger receptor | MSRE_HUMAN | Pfam | InterPro |
| Collagen VI(a3) | CO6A3_HUMAN | Pfam | InterPro |
| Collagen XII | CONA1_HUMAN | Pfam | InterPro |
| Enterokinase | ENTK_HUMAN | Pfam | InterPro |
| Factor XII | FA12_HUMAN | Pfam | InterPro |
| Complement C1r | C1R_HUMAN | Pfam | InterPro |
| Complement C6 | CO6_HUMAN | Pfam | InterPro |
| Name | Pfam(US)/Pfam(UK) | len | |
|---|---|---|---|
| VWFA (VA) | von Willebrand factor, type A | PF00092 / vwa | 174 |
| LAMG (LM) | laminin g-like (A-type module) | PF00054 / laminin_G | 134 |
| FA58C (FC) | coagulation factor V/VIII, type C | PF00754 / F5_F8_type_C | 147 |
| C1Q (CQ) | collagen/complement C1q | PF00386 / C1q | 118 |
| CADH | cadherin-like | PF00028 / cadherin | 94 |
| IGSF | immunoglobulin | PF00047 / ig | 65 |
| FN3 | fibronectin, type III | PF00041 / fn3 | 85 |
| HEMOP (HX) | hemopexin-like | PF00045 / hemopexin | 45 |
| LDLY (LY) | "YWTD" repeat, LDL-receptor | PF00045 / ldl_recept_b | 43 |
| LRP (LR) | leucine-rich (tolloid) | PF00560 / LRR | 23 |
(Doolittle, Table 3)
| Name | Pfam | len | |
|---|---|---|---|
| VWFB | von Willebrand factor, type B | ?? | |
| SOMAB | Somatomedin (vitronectin) B | PF01093 / vwc | 44 |
| LDLRA (LA) | LDL receptor, type A | PF00057 / ldl_recept_a | 40 |
| FN1 (F1) | Fibronectin, type I | PF00039 / fn1 | 37 |
| EGF (EG) | epidermal growth-factor like | PF00008 / EGF | 34 |
| FOLL1 (FS) | follistatin (ovomucoid) | ?? | |
| PDOM (PD) | P domain (trefoil) | ?? | |
| FN2 (F2) | fibronectin type II | PF00040 / fn2 | 41 |
| TSP1 (T1) | thrombospondin, type I | PF00090 / tsp_1 | 49 |
| CCP (CP) | complement control protein (sushi, SCR) | PF00084 / sushi | 57 |
![]() |
![]() |