Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
A full-length cDNA clone for the human preprocal(I) chain of type I procollagen was characterized.
Nucleotide sequencing of the first 1500 nucleotide residues of the 5'-end of the cDNA clone provided 729
nucleotide residues and the codons for 243 amino acid residues not previously defined from any species. The
data made it possible, for the first time, to compare completely codon usage for the human ocl(I) and z2(I)
chains.
INTRODUCTION
Type I collagen is the major component of bone,
tendon, skin and dentine (for reviews see Prockop &
Kivirikko, 1984; Burgeson & Morris, 1986). It is a
heterotrimer with the chain composition [al(I)]2a2(I),
and it is first synthesized as a procollagen comprised of
two proacl(I) and one proa2(I) chains. Although data on
the complex structure of type I collagen are now available,
a complete nucleotide sequence for the preproal1(I)
cDNA has not hitherto been determined.
Here we report the partial sequence of a full-length
human cDNA clone coding for a preproacl(I) chain
isolated previously (Stacey et al., 1987). The information
(a)
Av
-Y
XP
(b)
pHC
b
Av XPAv Av
P Av
RR
Av X
Av
Hf 404
~~pHUCI
Ba
I
Hf 677
I
500 bp
H -l
chain (pHUCI)
Fig. 1. Sequencing strategy used and partial restriction map of the cDNA clone for preproal(I)
(a) Sequence strategy used for the about 1.5 kb EcoRI/XhoI fragment of pHUCI. Arrows starting with X indicate clones that
were generated by the Sequenest transposon-deletion system. Other arrows indicate clones starting at corresponding restriction
sites. (b) Partial restriction map for entire pHUCI. C], Untranslated region. The regions coding for the protein domains are
indicated by shading and hatching: X, propeptides; U, telopeptides; C, triple-helical domain; E, signal peptide. Also shown
are the relative sizes of two clones previously reported (Chu et al., 1982; Bernard et al., 1983b) for the human proal(I) chain
(Hf404 and Hf677). Symbols: Av, AvaI site; Ba, BamHI site; E, EcoRI site; P, PvuII site; R, RsaI site; X, XhoI site.
Vol. 253
920
113
+61
+103
A TA
C
C C
C
-- -GA CAA AG
CT A
C TG SA T A A TC
GO CAC GCG GAG TGT GAG 6CC ACG CAT GAG CGO ACG CTA ACC CCC TCC CCA GCC ACA AAG AGT CTA CAT GTC TAG GOT
Het Ser Gly Arg Stop
Met Ser Arg Val
- - Asn lie
- Thr Arg Pro
chicken
human
human
chicken
chicken
human
human
chicken
bovine
+120
v
A G T TA A C CG
T TCT
TA
GTG A
T
GG
CTA GAC ATG TTC AGC m GTG GAC CTC COG CTC CTG CTC CTC TTA OCG 6CC ACC GCC CTC CTG ACG CAC GOC CAA
Leu
Thr
His Gly Gln
Stop Met Phe Ser Phe Val Asp Leu Arg Leu Leu Leu Lou Leu Ala Ala Thr Ala Leu
l- -l-- -Va
Ser
Arg - Glu
203
AG -
GA
Gly -Glu' *
- - - Glu
Glu -
28
chicken
human
human
chicken
bovine
I2
--
A C
G C
A G
T AAG
293
TGC GTA CAG AAC GGC CTC AGG TAC CAT GAC CGA GAC GTG TGO AAA CCC GAG CCC TGC
Cys Val Gin Asn Gly Leu Arg Tyr His Asp Arg Asp Val Trp Lys Pro Glu Pro Cys
-
- Asp - Asp-
-a
58
chicken
human
human
chicken
bovine
G
C
A
CGOATC TGC GTC TGC GAC AAC GGC AAG
Arg II* Cys Val Cys Asp Asn Gly Lys
Gin - - - - - Ser - Asn
Gin - - - - - - - Asn
C G C
TCC G
A C C C
C G
GTG TTG tGC GAT GAC GTG ATC TGT GAC GAG ACC AAGAAC
Val Leu Cys Asp Asp Val IIe Cys Asp Glu Thr Lys Asn
Ile - - - G6u - - - Glu Asp - Ser Asp
- Gln Leu - Asp
-
T
C A GT T
A
A
C
GGC GAG TGC TGT CCC GTC TGC CCC GAC GGC TCA GAG tCA CCC ACC GAC
Gly Glu Cys Cys Pro Vol Cys Pro Asp Gly Ser Glu Ser Pro Thr Asp
human
- Ile - - Vol Asp Ala - - Vol Tyr
chickenAsp -- - - - - - - Glu - Gin - - - - bovine
humn
human
human
chicken
bovine
humn
chicken
bovine
human
humn
chicken
bovine
humn
human
chicken
- Thr
E4
473
C
T T A
CS G6 T G T A A
CAA GA ACC ACC GGCGTCGAG GGA CCC AAG GGA GAC ACT GGC
Gin G6u Thr Thr Gly Val Glu Gly Pro Lys Gly Asp Thr Gly
Pro - Ser Ala - - - - - - - - -
118
563
IES
A C T A
C C G
A
T CC
C
A
A SAC
CCC CGA GGC CCA AGG6ACCC GCA GGC CCC CcT GGC CCA GAT GGC ATC CCT GGA CAG CCT GGA c CCC GGA CCC CCC GGA CCC CCC GGA
Pro Arg Sly Pro Arg Gly Pro Ala Gly Pro Pro Gly Arg Asp Sly Ile Pro Gly Gin Pro Gly Leu Pro Gly Pro Pro Sly Pro Pro Sly
- - - Asp - - Lou Pro - - - - - - - - - - - - - - - - - - - - - -
148
4
653
|E6
GAGAAC m GCT CCC CA6 CTG TCT TAT GGC TAT SAT GAG AAA TCA ACC GSA GSA ATT TCC GTG CCT
human
TTC
GTC CCCGAG
Vol Pro G6u
I1i - Phe
88
|E3
T GAC KIC
chicken
chicken
AA
TGC CCC GGC GCC GAA
Cys Pro Gly Ala G6u
- - Asn - - - Asn - Lys
383
C A
CCT CCC GGA CCC CcT GGCCT
CTC
Pro Pro Gly Pro Pro Sly Leu Gly Gly Asn Phe Ala Pro Gin Leo Sor Tyr Gly Tyr Asp Glu Lys Ser Thy Sly Gly lIe Ser Val Pro
6lu t - - - - - - - - Ale - * Val Aia - - 6lu *
-
178
GGC CCC AT6IS CCC TCT GGt CCT CGT G6T CTC CCT GGC CCC CCT GGtTCA CCT G6t CCC CM GGC TTC CAAGGT CCC CCT G GA CCT
Sly Pro Met Gly Pro Ser Giy Pro Arg Sly Leu Pro Sly Pro Pro Sly Ala Pro Sly Pro Gin Sly Phe Gln Giy Pro Pro Gly Glu Pro
_
- Ala -
208
833
G6C SAG CCT GGA GCT TCA GGT CCC ATG 66T CCC CSA G6T CCC CCA GGT CCC CCT GSA AAS AAT GSA SAT SAT BBS GS GCT GSA AA CCT
Sly G1u Pro Gly Ala Ser Sly Pro Het Gly Pro Arg Sly Pro Pro Gly Pro Pro Sly Lys Asn Sly Asp Asp Sly G6u Ala Sly Lys Pro
- Al-
bovine
238
923
human
humn
chicken
bovine
S A TTG CCC GSA ACA GCT GGC CTC CCT GSA ATG UG GGA CAC ASA
GGt CGT CCT G6t GAG CGT GGG CCT CCT GGS CCT CA6 GGtGCT CTA
Gly Arg Pro Sly Glu Arg Gly Pro Pro Gly Pro Gin Gly Ala Arg Gly Leu Pro Gly Thr Ale Sly Lou Pro Gly Met Lys Sly His Arg
- Sin.
268
1013
humn
human
chicken
bovine
humn
humn
chicken
GGt TTG GAT GGt GCC AAM GGAAT GCT GGT CCT GCT Gt CCTAAM GGTSAG CCT GGC AGC CCT GtGAA AAT GGAGCT CCT
Gt TTC AGT
Gly Phe Ser Gly Leu Asp Gly Ala Lys Sly Asp Ale Sly Pro Ale Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro
- Gin Pro 298
GGT CAG ATG GGC CCC CGT GGC CTG CCT GGtGAT AGA GGT CGC CCT GGA GCC CCT GGC CCT GCT GGT GCt CGT GGA AMT GAT GGT GCT ACT
Gly Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Asn Asp Sly Ala Thr
-
1103
bovine
human
humn
bovine
human
human
bovine
328
GGtTCT GCC GGG CCC CCT GGt CCC ACC GGC CCC GCT GGT CCTCCTGGC TTC CCTGGt TCT TT GGT GCT AAG GGT GM GCT GGT CCC CAA
Gly Ala Ala Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro Sly Phe Pro Gly Ala Vol Gly Ala Lys Gly Glu Ala Sly Pro Gin
- Sly - GGG CCC CGA GGC TCT 6M GGT CCC CAG GGT GTG CGT GGT SAG CCT GC CCC CCT GGC CCT GCT GGt GCT GCT GGC CCT GCT GGA AAC CCT
Sly Pro Arg Sly Ser G6u Sly Pro Gin Sly Vol Arg Gly Slu Pro Sly Pro Pro Sly Pro Ale Gly Ala Ala Gly Pro Ala Gly Asn Pro
1193
358
1283
388
1373
humn
hman
bovine
human
human
bovine
human
human
bovine
GGT GCT SAT GGA CAG CCT GGT GCT AAA 66T GCC AAT GGT 6CT CCT 66T ATtGCT GGT GCT CCT GGC TTC CCT GGT GCCCGA GGC CCC TCT
Gly Ala Asp Gly Sin Pro Gly Ala Lys Sly Ala Asn Sly Ala Pro Sly IIe Ala Sly Ala Pro Sly Phe Pro Gly Ala Arg Gly Pro Ser
-
Glu
GGA CCC CAG SGC CCC GSC SOC CCT CCT GGT CCC AAM G6T AAC A6C SGT SM CCT GST GCT CCT GGC AC AAA SGA SAC ACT GGT GCT AAG
Sly Ala Pro Sly Ser Lys Sly Asp Thr Giy Ala Lys
Sly Pro Sin Sly Pro Sly Sly Pro Pro Sly Pro Lys Gly Asn Ser Sly Glu Pro
- Asn -Ser
418
1463
448
1536
472
1988
921
Vol. 253
al(I)
a2(I)
Gly
0.50
0.28
0.18
0.03
342
0.60
0.38
0.02
0
235
0.84
0.13
0.03
0
117
0.75
0.20
0.05
0
118
0.51
0.22
Codons examined
Pro (total)
Codons examined
Pro (Yaa position)
Codons examined
Ala
Codons examined
0.22
0.05
342
0.62
0.21
0.16
0.01
199
0.73
0.09
0.16
0.02
91
0.76
0.15
0.08
0
107
Third
base
U
C
A
G
U
C
A
G
U
C
A
G
U
C
A
G
922
The work presented here was supported in part by N.I.H.
Research Grant AR-38188 and by a grant from the March of
Dimes-Birth Defects Foundation.
REFERENCES
Bernard, M. P., Myers, J. C., Chu, M.-L., Ramirez, F.,
Eikenberry, E. F. & Prockop, D. J. (1983a) Biochemistry 22,
1139-1145
1988