Chapter 2. Review of the literature

Table of Contents
2.1. Fibril-forming collagens
2.2. Non-fibril forming collagens
2.3. Family of type XV and XVIII collagens
2.4. Collagen genes
2.5. Mutations in human collagen genes
2.6. Collagen mutations in animals
2.7. Collagens in development
2.8. Basement membranes – structure and function

The word “collagen” originates from the mid 19th century and it means a constituent of the connective tissue that produces glue when boiled. Although the early histologists knew about the presence of fibers in connective tissues in the 19th century, it was not until the 1950s that the building unit of the collagen fiber, a collagen molecule, was identified (for review, see van der Rest & Garrone, 1991). Thereafter, it was realized that a similar collagenous sequence was shared by a number of ECM proteins, and the existence of a protein family became evident. This chapter provides a brief overview of vertebrate collagen types, their genes and tissue locations, with special emphasis on the latest developments of the family of type XV and XVIII collagens. The role of collagens in the development and etiology of various diseases will be discussed, as well as the contribution of the use of transgenic technology in collagen research. A short discussion of basement membranes (BM) is included in view of its relevance to the results obtained in this research.

For a protein to be classified as collagen it should have at least one domain in the characteristic triple helical conformation and it should form supramolecular aggregates in the ECM. All collagens contain two distinct types of domains, the triple helical, collagenous (COL) domains and the globular, noncollagenous (NC) domains. The size, number and the distribution of the two domain types vary between individual collagen types. The characteristic triple-helical conformation is achieved as three identical (homotrimer) or dissimilar (heterotrimer) polypeptide chains, called α-chains, with repeated Gly-X-Y sequences wind around each other. The presence of glycine as every third amino acid is essential, since a larger amino acid would not fit into the center of the helix. The X and Y positions are frequently occupied by proline and hydroxyproline, respectively. In addition, some lysine residues are modified by hydroxylation and glycosylation. Collagen biosynthesis is a complex process involving a number of unique post-translational modifications and eight specific post-translational enzymes. (For reviews, see Burgeson & Nimni, 1992; Hulmes, 1992; Kivirikko, 1993; Prockop & Kivirikko, 1995; Myllyharju & Kivirikko, 2001)

The collagen family is composed of 23 collagen types and 38 genetically distinct α-chains, four of which are currently under characterization (for reviews, see Vuorio & de Crombrugghe, 1990; van der Rest & Garrone, 1991; Burgeson & Nimni, 1992; Hulmes, 1992; Kielty et al., 1993; Kivirikko, 1993; Mayne & Brewton, 1993; van der Rest & Bruckner, 1993; Pihlajaniemi & Rehn, 1995; Prockop & Kivirikko, 1995; Myllyharju & Kivirikko, 2001). The various collagen types, their constituent chains, chain compositions, and occurrence in tissues are summarized in Table 1. A brief discussion is provided below to provide a necessary basis for further discussions on the gene structures and the role of collagens in development and various human diseases. Collagens can be divided into two major groups based on similarities in the supramolecular assemblies, primary structures, and other characteristics - the fibril-forming collagens and the non-fibril forming collagens. The latter group can be further divided into several subgroups.

Table 1. Collagen types, their constituent chains, chain compositions, and tissue distributions.

TypeConstituentChain compositionOccurrence
Iα1(I)[α1(I)]2α2(I) commonMost connective tissues; abundant in dermis, bone, tendon, ligament, cornea
α2(I)[α1(I)]3 rare
IIα1(II)[α1(II)]3Cartilage, vitreous body
IIIα1(III)[α1(III)]3Tissues rich in collagen I, especially skin, blood vessels, and inner organs; not in bone or cartilage
IVα1(IV)[α1(IV)]2α2(IV)All basement membranes
α2(IV)[α3(IV)]3
α3(IV)[α3(IV)]2α4(IV)
α4(IV)α3(IV)α4(IV)α5(IV)
α5(IV)α1(IV)?α6(IV)?
α6(IV)α3(IV)?α6(IV)?
Vα1(V)[α1(V)]2α2(V)Tissues containing collagen I; quantitatively minor component
α2(V)α1(V)α2(V)α3(V)
α3(V)[α1(V)]3
α4(V)α1(V)?α4(V)?
VIα1(VI)α1(VI)α2(VI)α3(VI)Most connective tissues, including cartilage
α2(VI) 
α3(VI) 
VIIα1(VII)[α1(VII)]3Anchoring fibrils (skin, cornea, cervix, oral and oesophageal mucosa)
VIIIα1(VIII)[α1(VIII)]2α2(VIII)Many tissues, e.g. Descemet’s membrane of eye(Greenhill et al., 2000)
α2(VIII)[α1(VIII)]3, [α2(VIII)]3
IXα1(IX)α1(IX)α2(IX)α3(IX)Tissues containing type II collagen; quantitatively minor component
α2(IX) 
α3(IX) 
Xα1(X)[α1(X)]3Hypertrophic cartilage
XIα1(XI)α1(XI)α2(XI)α3(XI)Tissues containing collagen II; quantitatively minor component
α2(XI)other forms
α3(XI)§ 
XIIα1(XII)[α1(XII)]3Tissues containing collagen I; quantitatively minor component
XIIIα1(XIII)[α1(XIII)]3Most tissues in low quantities
XIVα1(XIV)[α1(XIV)]3Tissues containing collagen I; quantitatively minor component
XVα1(XV)NDMany tissues in the BM zone
XVIα1(XVI)[α1(XVI)]3Many tissues
XVIIα1(XVII)[α1(XVII)]3Skin hemidesmosomes
XVIIIα1(XVIII)NDMany tissues in the BM zone
XIXα1(XIX)NDMany tissues in the BM zone
* Fibril-forming collagen-like  
* FACIT-like, XXI (Fitzgerald & Bateman, 2001 ) mRNAs in many tissues, incl. heart, stomach, kidney, skeletal muscle, placenta
* FACIT-like  
* Collagen XIII-like  

If not indicated, collected from references appearing in the text.

§ The α3(XI) is a post-translational variant of α1(II).

ND, not determined

* Complete cDNA sequences characterized

Left outside the collagen superfamily are 15 proteins that contain triple-helical collagenous domains, but lack one or more of the criteria for classification as collagens (for example, are not structural components of ECMs) (for reviews, see Myllyharju & Kivirikko, 2001).

2.1. Fibril-forming collagens

As the name implies, the fibril-forming collagens, types I-III, V, and XI, aggregate into characteristic, highly-ordered, quarter-staggered fibrils in the ECM (van der Rest & Garrone, 1991; Burgeson & Nimni, 1992; Hulmes, 1992; Kielty et al., 1993; Fichard et al., 1994; Prockop & Kivirikko, 1995). They share a strong structural similarity in that the major part of each molecule is formed by an uninterrupted triple-helical domain. In addition, they are all synthesized as precursors that are proteolytically trimmed of their non-collagenous ends to yield mature molecules. The important function of fibrillar collagens in providing structural integrity to tissues and organs is illustrated by the consequences of their mutations in a number of human diseases and in transgenic animal models (see Kuivaniemi et al., 1991; Kivirikko, 1993; Prockop & Kivirikko, 1995; Aszodi et al., 1998; and Myllyharju & Kivirikko, 2001 for reviews and discussion later). In addition, collagens V and XI have a specific role in fibrillogenesis in the regulation of fibril diameter (reviewed by Fichard et al., 1994).

Type I collagen is a major structural constituent of most connective tissues, except for cartilage, where homotrimeric type II collagen is prevalent. Type III collagen is found in many tissues rich in type I collagen, but especially in those requiring extensibility such as skin, lung, and blood vessels. Quantitatively minor fibrillar collagens, types V and XI, are associated with collagens I and II, respectively, and are located in the core of the collagen fibrils. Contrary to the other fibrillar collagens, their N-terminal extensions are retained and project onto the fibril surface. This feature, together with the correct molar ratios of I/V and II/XI collagens in fibrils is significant in the regulation of the fibril diameter (Birk et al., 1990; Marchant et al., 1996; Blaschke et al., 2000). The recently isolated novel α4(V) chain has a restricted tissue distribution in the developing and regenerating peripheral nerves, which suggests a unique role in the regulation of these processes (Chernousov et al., 1996; 1999; 2000).