GaCs: an ensemble of genetic algorithm classifiers to predict the long-range contacts


Server


Abstract:
Background: Prediction of long-range inter-residue contacts is an important topic in bioinformatics research. It is helpful for determining protein structures, understanding protein foldings, and therefore advancing the annotation of protein functions.

Results: In this paper, we propose a novel ensemble of genetic algorithm classifiers (GaCs) to address the long-range contact prediction problem. Our method is based on the key idea called sequence profile centers (SPCs). Each SPC is the average sequence profiles of residue pairs belonging to the same contact class or non-contact class. GaCs train on multiple but different pairs of long-range contact data (positive data) and long-range non-contact data (negative data). The negative data sets, having roughly the same sizes as the positive ones, are constructed by random sampling over the original imbalanced negative data. As a result, about 21.5% long-range contacts are correctly predicted. We also found that the ensemble of GaCs indeed makes an accuracy improvement by around 5.6% over the single GaC.

Conclusions: Classifiers with the use of sequence profile centers may advance the long-range contact prediction. In line with this approach, key structural features in proteins would be determined with high efficiency and accuracy.

 

 

DATASET:

  There are 480 protein chains in the dataset which shows as below:

153l00     1    10   530    10     3     1     1   185 1.60
1a6m00 1 10 490 10 1 1 1 151 1.00
1a8l00 5 1 623 1 1 1 1 226 1.90
1a8q00 3 40 50 1820 12 3 1 274 1.75
1a4400 3 90 280 10 2 3 1 185 1.84
1acf00 3 30 450 30 2 1 1 125 2.00
1af700 5 1 291 1 1 1 1 274 2.00
1agi00 3 10 130 10 4 1 1 125 1.50
1ah700 1 10 575 10 1 1 1 245 1.50
1ajz00 3 20 20 20 1 3 1 282 2.00
1ako00 3 60 10 10 1 1 1 268 1.70
1amm00 5 1 77 1 1 2 1 174 1.20
1amp00 3 40 630 10 1 1 1 291 1.80
1arb00 5 1 21 1 1 1 1 263 1.20
1bd800 1 25 40 20 3 1 1 156 1.80
1bea00 1 10 120 10 1 1 1 127 1.95
1bn6A0 3 40 50 1820 2 2 1 294 1.50
1bolA0 3 90 730 10 4 1 1 222 2.00
1bxaA0 2 60 40 420 1 1 4 105 1.30
1byrA0 3 30 870 10 3 1 1 155 2.00
1c1kA0 5 1 682 10 1 1 1 217 1.45
1c7kA0 3 40 390 10 1 1 1 132 1.00
1c5200 1 10 760 10 9 1 1 131 1.28
1cc8A0 3 30 70 100 1 1 1 73 1.02
1cei00 1 10 1200 20 1 2 1 94 1.80
1cewI0 3 10 450 10 2 1 1 108 2.00
1chd00 3 40 50 180 1 1 1 203 1.75
1cpn00 2 60 120 200 14 3 1 208 1.80
1cpq00 1 20 120 10 4 1 1 129 1.72
1ctj00 1 10 760 10 2 1 1 89 1.10
1cuoA0 2 60 40 420 6 2 1 129 1.60
1CV800 3 90 70 10 7 2 1 173 1.59
1cyo00 3 10 120 10 2 1 1 93 1.50
1cznA0 3 40 50 360 1 2 1 169 1.70
1dbs00 3 40 50 300 1 1 1 224 1.80
1dhn00 3 30 1130 10 1 1 1 121 1.65
1dlwA0 1 10 490 10 11 1 1 116 1.54
1dupA0 2 70 40 10 1 1 1 136 1.90
1e29A0 1 10 760 10 5 1 1 135 1.21
1eb6A0 3 40 390 10 2 1 1 177 1.00
1edt00 3 20 20 80 34 1 1 271 1.90
1eiqA0 5 1 53 1 1 1 1 289 2.00
1esl00 3 10 100 10 15 1 1 162 2.00
1ew4A0 3 30 920 10 1 1 1 106 1.40
1exrA0 5 1 36 1 1 1 1 146 1.00
1ey0A0 2 40 50 90 1 2 1 149 1.60
1fi2A0 2 60 120 10 4 1 1 201 1.60
1fm4A0 3 30 530 20 1 2 1 159 1.97
1fna00 2 60 40 30 3 1 1 91 1.80
1fnc00 5 1 71 1 1 2 1 296 1.70
1fua00 3 40 225 10 2 1 5 215 1.92
1g5tA0 3 40 50 300 39 1 1 170 1.80
1g8aA0 5
1g66A0 3 40 50 1820 1 1 1 207 0.90
1gn0A0 3 40 250 10 1 1 2 108 1.80
1gnuA0 3 10 20 90 3 1 1 117 1.75
1gpr00 2 70 70 10 1 1 1 162 1.90
1gqnA0 3 20 20 70 24 1 1 252 1.78
1gqvA0 3 10 130 10 2 1 1 135 0.98
1hbkA0 1 20 80 10 4 1 1 94 2.00
1hcz00 5 1 45 1 1 3 1 250 1.96
1hfs00 3 40 390 10 3 3 1 160 1.70
1hg7A0 3 90 1210 10 1 2 1 66 1.15
1hh7A0 1 10 760 10 3 1 1 114 1.40
1hka00 3 30 70 560 1 1 1 158 1.50
1hztA0 3 90 79 10 1 1 1 160 1.45
1i1nA 3
1i2kA0 5 1 715 10 1 1 1 269 1.79
1i6pA0 3 40 1050 10 2 1 1 220 2.00
1i6tA0 3 90 80 10 1 1 1 175 1.20
1i9dA0 5 1 936 1 1 1 1 138 1.65
1i9gA0 5 1 1038 1 1 1 1 264 1.98
1iab00 3 40 390 10 7 1 1 200 1.79
1igd00 3 10 20 10 1 1 1 61 1.10
1iqqA0 3 90 730 10 2 1 1 200 1.50
1iqzA0 3 30 70 20 1 1 1 81 0.92
1is6A0 2 60 120 200 4 1 1 135 1.70
1ispA0 3 40 50 1820 7 1 1 181 1.30
1j0oA0 3 90 10 10 1 1 2 107 1.15
1j1tA0 2 60 120 200 23 1 1 228 2.00
1j2lA0 4 10 70 10 1 1 1 70 1.70
1j7gA0 3 50 80 10 1 2 1 144 1.64
1j27A0 3 30 70 1120 1 1 1 98 1.70
1jbc00 2 60 120 200 1 1 1 237 1.20
1jfxA0 3 20 20 80 22 1 1 217 1.65
1jk7A0 3 60 21 10 2 1 1 299 1.90
1jvwA0 3 10 50 40 2 1 1 167 1.70
1k12A0 2 60 120 260 11 1 1 158 1.90
1kao00 3 40 50 300 4 2 1 167 1.70
1khiA0 5 1 1082 10 1 1 1 147 1.78
1klxA0 1 25 40 10 9 1 1 138 1.95
1knb00 2 60 90 10 1 2 1 196 1.70
1kt7A0 2 40 128 20 2 1 1 183 1.27
1kv9A2 1 10 760 10 14 1 1 103 1.90
1l8fA0 2 40 40 10 1 2 1 207 1.80
1lam01 3 40 220 10 1 1 1 165 1.60
1lki00 1 20 1250 10 9 1 1 180 2.00
1lkoA0 5 1 276 1 1 1 1 190 1.63
1lp8A0 5
1ltuA0 1 10 800 10 1 1 1 297 1.74
1lwbA0 1 20 90 10 3 1 1 122 1.05
1lxzA0 2 60 110 10 1 1 2 207 1.25
1ly2A0 5 1 843 1 1 1 1 130 1.80
1md6A0 2 80 10 50 6 1 1 154 1.60
1mgtA0 5 1 605 1 1 1 1 169 1.80
1mh9A0 5 1 1090 11 1 1 1 194 1.80
1mml00 5 1 115 1 1 1 1 251 1.80
1mzl00 1 10 110 10 1 1 1 93 1.90
1nar00 3 20 20 80 31 1 1 290 1.80
1nepA0 2 60 40 770 1 1 1 130 1.70
1nf9A0 3 40 50 850 1 1 1 207 1.50
1nogA0 1 20 1200 10 1 1 1 155 1.55
1nox00 3 40 109 10 1 1 1 205 1.59
1np4A0 2 40 128 20 1 1 1 184 1.50
1npk00 3 30 70 141 1 1 1 154 1.80
1nsj00 3 20 20 70 41 1 1 205 2.00
1ntfA0 3 60 10 10 1 1 1 280 1.80
1nwzA0 3 30 450 20 1 1 1 125 0.82
1o4yA0 2 60 120 200 6 1 1 288 1.48
1o08A0 5 1 1081 10 1 1 1 221 1.20
1oejA0 2
1ok0A0 2 60 40 20 1 1 1 74 0.93
1orb00 5 1 122 1 1 1 1 293 2.00
1ouvA0 1 25 40 10 13 1 1 265 2.00
1p5fA0 3 40 50 880 1 1 1 189 1.10
1pbn00 3 40 50 1580 1 1 2 289 2.00
1plc00 2 60 40 420 4 1 1 99 1.33
1pmi01 2 60 120 10 8 1 1 185 1.70
1pmy00 2 60 40 420 5 2 1 123 1.50
1ppn00 3 90 70 10 3 1 1 212 1.60
1puc00 3 30 170 10 1 1 1 105 1.95
1pxfA0 2 40 50 140 15 1 1 111 1.87
1pz4A0 3 30 1050 10 1 1 1 116 1.35
1qb7A0 3 40 50 2020 4 1 1 236 1.50
1qgiA0 5 1 654 10 1 1 1 259 1.60
1qj8A0 2 40 128 70 2 1 1 148 1.90
1qk8A0 3 40 30 10 6 1 2 146 1.40
1qtwA0 3 20 20 150 2 1 1 285 1.02
1qy1A0 2 40 128 20 11 1 1 174 1.70
1qz9A0 3 90 1150 10 8 1 1 143 1.85
1rj1A 1
1rlhA 3
1rljA0 3 40 50 360 16 1 1 135 2.00
1rocA0 2 60 40 1490 1 1 1 155 1.50
1rv9A 3
1rw7A0 3 40 50 880 10 1 1 243 1.80
1s8nA0 5
1sf9A 2
1sjwA 3
1skf00 3 40 710 10 5 1 6 262 2.00
1smlA0 3 60 15 10 5 1 1 269 1.70
1sur00 3 40 50 620 22 1 1 215 2.00
1sznA2 2 60 40 1180 6 1 1 102 1.54
1tal00 5 1 75 1 1 1 1 198 1.50
1thm00 3 40 50 200 1 5 1 279 1.37
1tif00 3 10 20 80 1 1 1 78 1.80
1tml00 3 20 20 40 3 1 1 286 1.80
1tp6A0 3 10 450 50 5 1 1 126 1.50
1tzvA0 1 10 940 10 1 1 1 141 1.35
1uaiA0 2 60 120 200 2 1 1 223 1.20
1udh00 3 40 470 10 1 3 1 228 1.75
1uekA0 5 1 2099 10 1 1 1 268 1.70
1uohA0 1 25 40 20 6 1 1 226 2.00
1v7rA0 3 90 950 10 1 1 1 186 1.40
1vhh00 3 30 1380 10 1 1 1 162 1.70
1vid00 3 40 50 150 22 1 1 221 2.00
1vls00 1 20 120 30 1 1 1 146 1.85
1wab00 3 40 50 1110 2 1 2 216 1.70
1whi00 2 40 150 20 1 1 1 122 1.50
1who00 2 60 40 760 1 1 1 96 1.90
1xnb00 2 60 120 180 1 2 1 185 1.49
1zin00 3 40 50 300 22 1 1 217 1.60
2abk00 5 1 5 1 1 2 1 211 1.85
2acy00 3 30 70 100 4 1 1 98 1.80
2baa00 5 1 42 1 1 2 1 243 1.80
2cba00 3 10 200 10 1 1 1 260 1.54
2end00 1 10 440 10 1 1 1 138 1.45
2hvm00 3 20 20 80 21 2 1 273 1.80
2izi00 2 40 128 30 1 1 3 123 1.70
2lisA0 1 20 150 10 1 1 1 136 1.35
2mcm00 2 60 40 230 1 2 1 112 1.50
2mhr00 1 20 120 50 1 2 1 118 1.70
2pii00 3 30 70 120 2 1 1 112 1.90
2plc00 3 20 20 190 2 1 1 274 2.00
2pth00 3 40 50 1470 1 1 1 193 1.20
2rn200 3 30 420 10 2 1 3 155 1.48
2tgi00 2 10 90 10 2 1 1 112 1.80
3chy00 3 40 50 2300 1 1 2 128 1.66
3lzm00 1 10 530 40 1 4 37 164 1.70
3vub00 2 30 30 110 2 1 1 101 1.40
4lzt00 1 10 530 10 1 1 1 129 0.95
7fd1A0 3 30 70 20 3 1 1 106 1.30
7yasA0 3 40 50 1820 4 1 1 257 1.75 1evjD01 3 40 0 20 1 1 1 54 2.700
1a4iA02 3 40 2 10 0 1 1 89 1.500
1b2lA00 3 40 0 20 8 1 1 54 1.600
2hntF00 2 40 0 10 2 3 2 00 2.500
1fizA02 2 40 0 10 5 1 2 20 2.900
1agqB00 2 10 0 10 1 2 1 96 1.900
1ia2B00 3 40 0 10 2 1 1 92 1.820
1pdgB00 2 10 0 10 3 1 1 83 3.000
1wniA00 3 40 0 10 4 1 2 98 2.200
2bt6A00 3 10 0 30 6 1 1 05 1.500
1bwuA00 2 90 0 10 1 4 4 06 2.800
1r83A00 2 40 0 40 2 1 1 62 1000.000
1bkoD00 3 30 2 10 2 1 1 78 2.750
1e5pC00 2 40 8 20 3 1 1 44 1.630
1bkpA00 3 30 2 10 2 1 1 78 1.700
1lzwA00 3 30 0 10 3 1 1 91 2.500
1z7gC00 3 40 0 20 4 3 1 04 1.900
1qspB00 1 20 0 60 3 1 1 65 2.700
1jeqA02 2 40 0 10 3 1 1 26 2.700
2z6kD00 2 40 0 40 8 1 1 17 3.000
2om5A04 2 60 0 10 6 1 2 90 3.070
1b0yA00 4 10 0 10 1 1 2 85 0.930
1crkD01 1 10 5 10 2 2 3 98 3.000
2o5rA01 3 40 0 20 7 2 1 17 2.340
1rfkB00 3 10 0 30 1 2 1 97 1.250
1ekuA02 1 20 0 10 8 1 2 24 2.900
1ui9A00 3 30 0 40 1 1 1 16 1.650
3clqB03 1 10 0 60 1 1 1 67 2.500
1admA02 3 90 0 10 1 1 1 78 1000.000
1cbkB00 3 30 0 60 1 2 1 60 2.020
1regY00 3 30 0 50 1 1 1 20 1.900
2atbB00 3 30 0 10 1 2 1 65 1.600
2qyzA01 3 30 0 60 1 1 1 53 2.040
1t4wA00 2 60 0 20 3 1 1 96 2.100
1jkmB00 3 40 0 20 7 1 1 61 1.850
1a2jA00 3 40 0 10 2 1 1 88 2.000
1oy0E00 3 20 0 60 3 4 1 48 2.800
1h2aS01 3 40 0 00 1 1 1 80 1.800
1h2aS02 4 10 0 10 1 1 1 85 1.800
1e3uA00 3 40 0 10 8 1 1 43 1.660
2rbbB00 3 10 0 10 6 1 1 25 1.820
2hmoB00 3 10 0 50 5 1 2 92 1.600
1kz9D00 3 40 0 60 3 1 1 47 3.100
2bv0B00 2 60 0 10 2 1 1 38 1.800
1p9sB02 2 40 0 10 9 1 2 95 2.540
1esgA00 3 40 1 20 2 1 1 09 1.900
1a73B00 3 90 5 10 1 1 1 62 1.800
2qfkB00 1 10 0 10 7 1 1 37 1.620
1znoB00 3 90 0 10 2 2 1 60 2.000
1dabA00 2 60 0 20 2 1 1 39 2.500
3ce1A00 2 60 0 00 1 3 1 50 1.200
2trhB00 2 60 0 80 1 1 1 27 1.900
1mwrB03 3 30 0 30 1 1 1 68 2.450
1j97B02 1 10 0 10 1 1 1 61 1.500
1j97B01 3 40 0 00 1 1 1 46 1.500
1cumB00 3 40 9 10 5 1 1 30 1000.000
1f6kC00 3 20 0 70 2 1 1 85 1.600
1jt2A00 3 40 0 20 1 1 1 55 1.800
3b5mD01 2 30 0 10 8 1 1 19 1.210
1vjkA00 3 10 0 30 4 1 1 88 1.510
1h9kA01 2 40 0 00 2 3 1 70 1.800
1e6vE01 3 30 0 70 2 3 1 47 2.700
1m5hH02 3 30 0 20 1 2 2 46 2.000
1fzwH00 3 90 0 10 4 1 1 92 1.900
1g0sA00 3 90 9 10 1 1 1 01 1.900
1jd3A00 3 40 0 10 1 1 1 64 2.030
2dy0B00 3 40 0 20 1 1 1 81 1.250
1bwdB00 3 75 0 10 4 2 1 48 3.100
3cwbR02 2 02 0 10 2 2 3 29 3.510
1gd0B00 3 30 9 10 1 1 1 14 1.500
1ge5A00 3 40 0 10 6 1 1 67 2.000
1yphD00 2 40 0 10 5 1 1 31 1.340
1yphF00 2 40 0 10 2 1 1 97 1.340
2zczE00 2 60 0 50 1 1 1 68 1.800
1gu2B00 1 10 0 10 1 1 1 23 1.190
1jwlB03 3 40 0 00 1 1 1 43 4.000
1drjA02 3 40 0 00 3 1 1 46 2.500
3cddC01 2 30 0 10 1 1 1 69 2.100
1fujD02 2 40 0 10 3 3 1 08 2.200
2o01F00 1 10 8 10 1 2 1 54 3.400
1gvnD00 3 40 0 00 8 1 1 71 1.950
3buzA01 3 90 6 10 5 1 1 11 2.810
1oxcD00 2 60 0 00 1 1 1 14 1.200
1w6sD00 4 10 0 10 1 1 1 73 1.200
2qe7H01 2 60 5 10 1 3 1 88 3.060
2bcjG00 4 10 0 10 2 1 2 64 3.060
1p51D00 4 10 0 10 1 1 1 92 2.500
1lv1A01 2 40 0 10 4 1 0 99 2.100
1hxhA00 3 40 0 20 7 1 1 53 1.220
1i0sA00 2 30 0 10 3 1 1 61 1.650
1qvvB00 3 40 0 80 2 1 1 36 2.350
1o72B00 2 60 0 20 2 1 1 75 2.410
1x9rB00 2 60 0 20 6 2 1 04 1.900
1wk2A00 2 30 0 30 1 1 1 82 2.500
1inlB01 3 40 0 50 3 1 1 25 1.500
1vh9B00 3 10 9 10 1 3 1 38 2.150
1ugpB01 1 10 2 20 2 1 1 26 1.630
1ugpB02 2 30 0 50 2 1 1 97 1.630
2cf4A01 3 40 0 10 0 1 1 36 3.080
1eifA01 2 30 0 30 2 2 1 68 1.900
1eifA02 2 40 0 40 6 2 1 60 1.900
1v7zF00 3 40 0 10 1 1 1 57 1.600
1nf6F00 1 20 0 10 4 1 1 71 2.350
1m5qU02 3 30 0 60 1 1 2 55 2.000
1qd2A01 3 40 0 10 1 3 1 62 1.860
1qd2A02 4 10 0 10 1 3 1 85 1.860
1ilvA00 3 40 0 10 1 3 1 35 2.000
1vj1A02 3 40 0 20 2 1 2 66 2.100
2c7kB00 1 10 0 20 1 3 1 72 3.200
1sdzA00 1 10 0 10 2 1 1 97 1.780
2jfqB01 3 40 0 60 2 5 1 52 2.150
2dx7B01 3 40 0 60 3 1 1 19 2.000
1ehyD00 3 40 0 20 0 1 1 82 2.100
3b3iA02 2 60 0 10 5 1 1 95 1.860
3b3iA01 3 30 0 10 1 1 1 81 1.860
1ddqC03 3 90 0 10 1 1 2 80 1000.000
2hr5B01 1 20 0 10 5 2 1 31 2.700
1jkvF02 3 30 0 10 1 1 1 63 1.390
1jdqA00 3 30 0 40 4 1 1 98 999.000
1jr0E00 2 40 0 10 1 1 1 03 1.300
1jr2A01 3 40 0 90 3 1 1 20 1.840
1jr2A02 3 40 0 90 4 1 1 40 1.840
1k1eC00 3 40 0 00 0 1 1 74 1.670
1f81A00 1 20 0 10 2 1 2 87 999.000
1vf4A01 3 40 0 10 3 1 1 18 2.450
1pmzD00 1 10 8 10 7 1 1 82 1000.000
1kpfA00 3 30 8 10 4 1 1 11 1.500
1ugwE00 2 00 0 30 2 1 1 33 1.700
1kv8B00 3 20 0 70 5 1 1 15 1.620
1z9sB00 2 60 0 10 1 1 1 49 2.200
1yntG02 2 60 0 20 1 1 1 23 3.100
1nrwA01 3 40 0 00 1 2 1 49 1.700
1wr8B02 3 90 0 10 2 1 1 69 1.600
1vcvB00 3 20 0 70 0 1 1 26 2.000
1qdlB00 3 40 0 80 4 1 1 95 2.500
2o7nA00 3 40 0 10 2 1 1 80 1.750
2a8fB00 1 10 9 10 1 1 1 98 1.350
1m0sB02 3 30 0 60 1 1 2 72 1.900
1m0sB01 3 40 0 60 7 1 2 46 1.900
1lniA00 3 10 0 30 2 1 1 96 1.000
1iujB00 3 30 0 00 2 1 1 03 1.600
1xtcA00 3 90 0 10 1 1 1 92 2.400
1gpjA02 3 40 0 20 7 1 1 54 1.950
1karB02 3 40 0 70 1 1 1 90 2.100
1m2bB00 3 40 0 10 6 1 1 01 1.250
1m38A00 3 90 0 10 2 1 1 82 1.800
1m44A00 3 40 0 30 2 1 1 77 1.600
1m4rB00 1 20 0 10 3 1 1 41 2.000
3ypiB00 3 20 0 70 1 3 1 47 2.800
2c1dH00 1 10 0 10 8 2 1 37 1.920
1dw2C00 1 10 0 10 2 1 1 12 2.200
3hflL02 2 60 0 10 3 1 1 03 1000.000
1fh5L01 2 60 0 10 4 1 2 08 2.900
2essA02 3 10 9 10 1 1 1 98 1.900
2o9uX00 3 10 0 10 1 1 1 96 1.150
2fl5H01 2 60 0 10 0 1 6 20 3.000
3ckdC01 1 20 0 30 1 1 1 79 2.650
3bvoB02 1 20 0 20 4 1 1 75 3.000
1mu4A00 3 30 0 10 1 2 1 86 1.800
1n0qB00 1 25 0 20 7 1 1 93 1.260
2qqdG00 3 50 0 10 1 1 1 52 2.000
1fheA02 1 20 0 10 3 2 2 05 3.000
2ntoA01 3 40 0 10 6 1 1 95 2.100
1pszA02 3 40 0 80 9 1 1 31 2.000
2ov1A01 3 40 0 80 2 1 1 54 2.500
1g9zB00 3 10 8 10 2 1 1 52 1.800
1kejA01 1 10 0 10 3 1 1 95 3.000
1xo3A00 3 10 0 30 2 2 1 01 999.000
1iygA00 1 25 0 10 1 1 3 33 999.000
1nbaB00 3 40 0 50 6 1 1 52 2.000
1lmlA01 3 10 0 20 1 1 1 70 1.860
1npyA02 3 40 0 20 2 1 1 69 1.750
1npyA01 3 40 2 10 9 1 1 00 1.750
1ih8A00 3 40 0 20 2 1 1 71 1.900
2pcnA00 3 50 0 40 1 3 1 61 1.900
2uy9A02 2 60 0 10 7 1 1 74 3.100
1td3C00 2 40 0 10 2 1 1 03 2.370
1q90C00 2 02 0 10 4 1 4 26 3.100
1o0wA01 1 10 0 10 2 1 1 54 2.000
1o0wA02 3 30 0 20 2 1 1 68 2.000
1o4tA00 2 60 0 10 7 1 1 15 1.950
1qwrB02 2 60 0 10 7 1 1 87 1.800
1cefA00 3 40 0 10 4 1 1 47 2.040
1o82D00 1 20 5 10 1 1 1 70 1.460
2gs4B00 1 20 0 10 2 2 1 52 2.000
1obpB00 2 40 8 20 8 2 1 55 2.000
1ofnA00 2 60 0 10 1 2 1 02 1.500
1w8sJ00 3 20 0 70 0 1 1 50 1.850
1fk8A00 3 40 0 20 9 2 1 40 1.950
1jraD00 1 20 0 10 1 1 1 03 2.000
1mumB00 3 20 0 60 1 1 1 89 1.900
1otkA00 1 20 0 10 6 1 1 44 2.000
1p1xB00 3 20 0 70 2 1 1 51 0.990
1p6oA00 3 40 0 10 2 1 1 56 1.140
1otzx00 2 60 0 40 3 2 1 44 3.300
1pkhB00 2 70 0 10 6 1 1 75 1.420
1x8mF02 2 60 0 00 1 1 3 37 2.600
1cp9A02 1 10 7 50 1 1 2 30 2.500
1e3aA01 1 10 9 10 1 1 1 47 1.800
1psrA00 1 10 8 10 1 1 1 00 1.050
1t6iC00 1 20 0 00 1 1 1 07 2.810
1j9oA00 2 40 0 40 3 1 1 93 999.000
2cslF00 3 30 0 40 2 2 1 21 2.500
1qftB00 2 40 8 20 7 1 1 70 1.250
1ko2A00 3 60 5 10 2 2 1 29 2.200
1qaeB00 3 40 0 10 1 1 1 40 2.050
1qveA00 3 30 0 10 2 1 1 26 1.540
1qxrB00 2 60 0 10 1 1 1 87 1.700
1ewqB01 3 40 0 10 1 1 1 15 2.200
1smtA00 1 10 0 10 7 1 1 98 2.200
2b97B00 3 20 0 10 1 1 1 70 0.750
4ullA00 2 40 0 70 1 1 1 69 999.000
1r5tC00 3 40 0 10 5 1 1 40 2.000
1n7uA02 2 70 0 10 1 1 1 63 2.400
1rk4B02 1 20 0 10 1 1 1 30 1.790
2in3A01 3 40 0 10 0 1 1 79 1.850
1g2rA00 3 30 0 10 1 1 1 94 1.350
1md9A02 3 40 0 80 8 1 1 70 2.800
3b8eC03 3 40 0 00 2 2 1 74 3.500
2d9bA00 3 90 0 10 1 5 1 09 999.000
1rmqB00 3 40 0 00 0 1 1 08 2.000
1a75B00 1 10 8 10 3 1 4 08 1.900
1obuA00 2 40 8 20 0 1 1 81 2.000
1q4sA00 3 10 9 10 5 1 1 42 1.950
1j1tA00 2 60 0 00 3 1 1 28 2.000
1sh8B00 3 10 9 10 9 1 1 49 1.500
1smoB00 2 60 0 10 8 1 1 10 1.470
1nunA00 2 80 0 50 7 3 1 39 2.900
1jlyB02 2 80 0 50 0 1 1 40 2.200
1rrpC00 3 40 0 00 1 1 2 80 2.960
1i1oA00 3 40 0 60 4 1 1 47 2.000
1u02A02 3 30 0 20 1 1 1 73 1.920
1tuwA00 3 30 0 90 1 1 1 06 1.900
1yq1B01 3 40 0 10 2 5 1 86 3.000
1ffuF01 3 30 3 10 1 1 2 54 2.350
1nwjA00 3 30 0 00 1 1 1 12 1000.000
1ty9A00 2 30 0 10 3 1 1 02 1.800
1tzyE00 1 10 0 10 6 1 1 04 1.900
1tzyF00 1 10 0 10 3 1 1 93 1.900
1tzyC00 1 10 0 10 1 1 1 95 1.900
1kx3F00 1 10 0 10 5 1 1 87 2.000
1u11A00 3 40 0 00 1 2 1 59 1.550
1u1sF00 2 30 0 00 2 1 1 65 1.600
1u1wA02 3 10 0 10 1 1 1 48 1.350
1xubA01 3 10 0 10 3 1 1 30 1.300
1onvA00 1 10 0 10 1 1 2 67 999.000
1o3uA00 1 20 0 30 2 1 1 19 1.750
1ugrA00 3 90 0 10 1 2 1 01 1.800
1uhjA00 1 10 8 10 4 1 3 82 1.800
1ulmB02 3 30 0 10 2 5 1 42 1.800
1ulkB02 3 30 0 10 2 5 2 41 1.800
1ulmB01 3 30 0 10 2 7 1 40 1.800
1uqrE00 3 40 0 00 1 2 1 43 1.700
1w52X01 3 40 0 20 0 1 7 35 2.990
1gz9A00 2 60 0 00 2 2 1 39 1.700
1nzaA00 3 30 0 30 3 1 1 03 1.700
1lmlA03 2 10 5 10 1 1 1 63 1.860
1ihoB02 3 30 0 10 2 1 1 03 1.700
1ihoB01 3 40 0 20 5 2 1 79 1.700
1wzgB00 1 20 0 10 1 1 1 09 1.750
1h4zA00 3 30 0 24 1 1 1 10 2.740
2atmA00 3 20 0 70 4 2 1 23 2.000
1b6uA01 2 60 0 10 2 1 2 98 3.000
2om5A03 2 60 0 10 2 1 2 79 3.070
1sv7A00 3 10 0 10 7 1 1 09 1000.000
1vhlB00 3 40 0 00 3 1 1 08 1.650
1vkeD00 1 20 0 10 5 1 1 01 1.560
3balD01 2 60 0 10 3 1 2 29 1.950
1vpsE00 2 60 5 10 1 1 1 85 1.900
1ny1B00 3 20 0 70 1 1 1 31 1.800
1wa3D00 3 20 0 70 1 1 1 03 1.900
1vr0C00 3 90 0 10 1 2 1 30 2.490
1wkqB00 3 40 0 10 3 1 1 55 1.170
1wmzD00 3 10 0 10 2 1 1 40 1.700
1jcgA01 3 30 0 40 0 1 1 62 3.100
3cerC02 3 30 0 50 2 1 1 99 2.400
1wpbP01 1 10 7 80 1 1 1 56 2.000
1wpbP02 1 10 0 10 1 1 1 12 2.000
1ibzC00 2 60 0 20 7 1 1 10 2.300
1x6iA00 1 10 0 50 1 1 1 89 1.200
1t38A02 1 10 0 10 0 2 1 89 3.200
1kp0B02 3 90 0 10 7 1 3 46 2.700
1xizA00 3 40 0 10 2 1 1 55 2.000
1xo7A00 2 40 0 10 1 1 1 66 1.610
1xrjA00 3 40 0 00 9 1 1 11 2.000
2ofdB00 2 60 0 20 1 1 1 41 1.960
1j2rB00 3 40 0 50 2 1 1 85 1.300
1ydgH00 3 40 0 60 9 1 1 02 2.000
1corA00 1 10 0 10 0 1 4 81 999.000
3dvpB00 3 30 0 10 1 1 1 85 2.500
1xh3B00 2 60 0 10 2 1 1 99 1.480
1jafB00 1 20 0 10 1 2 1 28 2.500
2h47G00 2 60 0 10 1 1 1 25 2.600
1ksjB00 2 70 0 40 1 1 1 38 2.600
2bkvA00 3 40 0 60 2 1 1 41 1.500
2einT00 4 10 5 10 1 1 1 83 2.700
1ps2A00 4 10 0 10 1 3 1 60 999.000
2hq8B00 1 10 8 10 9 1 1 82 1.800
1eqnE03 1 20 0 20 2 1 1 45 2.900
2trxA00 3 40 0 10 3 5 1 08 1.680
1utgA00 1 10 0 10 1 1 1 70 1.340
2r4zB00 1 10 0 10 2 1 1 46 1.600
1a6eB02 3 30 0 10 2 3 1 07 3.200
3kauB00 2 10 0 10 1 1 1 01 1000.000


* Protein chain containing several domains is listed separately in terms of domains. -----------------------------------------------------------------------

CATH List File (CLF) Format 2.0:
-------------------------------
This file format has an entry for each structural entry in CATH.

Column 1: CATH domain name (seven characters)
Column 2: Class number
Column 3: Architecture number
Column 4: Topology number
Column 5: Homologous superfamily number
Column 6: S35 sequence cluster number
Column 7: S60 sequence cluster number
Column 8: S95 sequence cluster number
Column 9: Domain length
Column 10: Structure resolution (Angstroms)
(999.000 for NMR structures and 1000.000 for obsolete PDB entries)
 


Copyright @ 2004-2010 by Peng Chen

All Rights Reserved