UniProtKB/Swiss-Prot protein knowledgebase release 55.6 statistics
1. INTRODUCTION
Release 55.6 of 01-Jul-08 of UniProtKB/Swiss-Prot contains 390696 sequence entries,
comprising 140503634 amino acids abstracted from 171250 references.
1671 sequences have been added since release 55.5, the sequence data of
70 existing entries has been updated and the annotations of
24738 entries have been revised.
Number of fragments: 8132
Number of additional sequences produced by alternative splicing, initiation or promoter usage, or ribosomal frameshifting: 26036
Protein existence:
PE 1: Evidence at protein level 59554 entries
PE 2: Evidence at transcript level 62868 entries
PE 3: Inferred from homology 254004 entries
PE 4: Predicted 13076 entries
PE 5: Uncertain 1194 entries
The growth of the database is summarized below.
2. AMINO ACID COMPOSITION
2.1 Composition in percent for the complete database
Ala (A) 8.13 Gln (Q) 3.95 Leu (L) 9.67 Ser (S) 6.66
Arg (R) 5.49 Glu (E) 6.73 Lys (K) 5.88 Thr (T) 5.36
Asn (N) 4.05 Gly (G) 7.04 Met (M) 2.41 Trp (W) 1.09
Asp (D) 5.40 His (H) 2.28 Phe (F) 3.88 Tyr (Y) 2.93
Cys (C) 1.42 Ile (I) 5.92 Pro (P) 4.78 Val (V) 6.82
Asx (B) 0.000 Glx (Z) 0.000 Xaa (X) 0.00
Legend: gray = aliphatic, red = acidic, green = small hydroxy,
blue = basic, black = aromatic, white = amide, yellow = sulfur
2.2 Classification of the amino acids by their frequency
Leu, Ala, Gly, Val, Glu, Ser, Ile, Lys, Arg, Asp, Thr, Pro, Asn, Gln,
Phe, Tyr, Met, His, Cys, Trp
3. TAXONOMIC ORIGIN
Total number of species represented in this release of UniProtKB/Swiss-Prot: 11444
The first twenty species represent 97825 sequences: 25 % of the total
number of entries.
3.1 Table of the frequency of occurrence of species
Species represented 1x: 5234
2x: 1698
3x: 823
4x: 545
5x: 418
6x: 318
7x: 231
8x: 195
9x: 168
10x: 107
11- 20x: 511
21- 50x: 344
51-100x: 140
>100x: 712
3.2 Table of the most represented species
------ --------- --------------------------------------------
Number Frequency Species
------ --------- --------------------------------------------
1 19929 Homo sapiens (Human)
2 15740 Mus musculus (Mouse)
3 7098 Rattus norvegicus (Rat)
4 6867 Arabidopsis thaliana (Mouse-ear cress)
5 6554 Saccharomyces cerevisiae (Baker's yeast)
6 5327 Bos taurus (Bovine)
7 4421 Schizosaccharomyces pombe (Fission yeast)
8 4343 Escherichia coli (strain K12)
9 3189 Caenorhabditis elegans
10 2876 Bacillus subtilis
11 2813 Drosophila melanogaster (Fruit fly)
12 2788 Xenopus laevis (African clawed frog)
13 2344 Dictyostelium discoideum (Slime mold)
14 2162 Danio rerio (Zebrafish) (Brachydanio rerio)
15 2111 Pongo abelii (Sumatran orangutan)
16 2045 Gallus gallus (Chicken)
17 1947 Escherichia coli O157:H7
18 1782 Methanocaldococcus jannaschii (Methanococcus jannaschii)
19 1774 Haemophilus influenzae
20 1715 Oryza sativa subsp. japonica (Rice)
21 1698 Salmonella typhimurium
22 1624 Escherichia coli O6
23 1623 Shigella flexneri
24 1442 Mycobacterium tuberculosis
25 1318 Sus scrofa (Pig)
26 1290 Salmonella typhi
27 1240 Pseudomonas aeruginosa
28 1183 Mycobacterium bovis
29 1169 Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
30 1117 Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey)
31 988 Synechocystis sp. (strain PCC 6803)
32 980 Archaeoglobus fulgidus
33 950 Yersinia pestis
34 912 Vibrio cholerae
35 909 Acanthamoeba polyphaga mimivirus (APMV)
36 886 Rhizobium meliloti (Sinorhizobium meliloti)
37 871 Oryctolagus cuniculus (Rabbit)
38 861 Salmonella paratyphi A
39 859 Staphylococcus aureus (strain Mu50 / ATCC 700699)
40 858 Staphylococcus aureus (strain N315)
41 830 Staphylococcus aureus (strain MW2)
42 830 Staphylococcus aureus (strain COL)
43 826 Staphylococcus aureus (strain MSSA476)
44 823 Staphylococcus aureus (strain MRSA252)
45 809 Salmonella choleraesuis
46 807 Yersinia pseudotuberculosis
47 806 Shigella sonnei (strain Ss046)
48 802 Escherichia coli O6:K15:H31 (strain 536 / UPEC)
49 763 Vibrio parahaemolyticus
50 763 Ashbya gossypii (Yeast) (Eremothecium gossypii)
51 762 Shigella boydii serotype 4 (strain Sb227)
52 759 Aquifex aeolicus
53 754 Pasteurella multocida
54 746 Shigella dysenteriae serotype 1 (strain Sd197)
55 745 Canis familiaris (Dog)
56 741 Escherichia coli O9:H4 (strain HS)
57 738 Escherichia coli (strain UTI89 / UPEC)
58 737 Escherichia coli O139:H28 (strain E24377A / ETEC)
59 736 Kluyveromyces lactis (Yeast) (Candida sphaerica)
60 727 Candida albicans (Yeast)
61 722 Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum)
62 717 Neurospora crassa
63 707 Streptomyces coelicolor
64 705 Vibrio vulnificus
65 705 Escherichia coli (strain ATCC 8739 / DSM 1576 / Crooks)
66 699 Staphylococcus epidermidis (strain ATCC 35984 / RP62A)
67 698 Staphylococcus epidermidis (strain ATCC 12228)
68 694 Candida glabrata (Yeast) (Torulopsis glabrata)
69 691 Photorhabdus luminescens subsp. laumondii
70 688 Vibrio vulnificus (strain YJ016)
71 688 Bacillus halodurans
72 687 Mycoplasma pneumoniae
73 681 Shigella flexneri serotype 5b (strain 8401)
74 668 Pan troglodytes (Chimpanzee)
75 663 Bacillus anthracis
76 653 Yersinia pestis bv. Antiqua (strain Nepal516)
77 651 Anabaena sp. (strain PCC 7120)
78 647 Yersinia pestis bv. Antiqua (strain Antiqua)
79 647 Mycobacterium leprae
80 647 Yersinia enterocolitica serotype O:8 / biotype 1B (strain 8081)
81 637 Pseudomonas syringae pv. tomato
82 636 Pseudomonas putida (strain KT2440)
83 634 Yersinia pseudotuberculosis serotype O:1b (strain IP 31758)
84 626 Escherichia coli O1:K1 / APEC
85 619 Escherichia coli
86 619 Staphylococcus aureus (strain NCTC 8325)
87 616 Bradyrhizobium japonicum
88 615 Salmonella paratyphi B (strain ATCC BAA-1250 / SPB7)
89 613 Treponema pallidum
90 609 Zea mays (Maize)
91 607 Enterobacter sp. (strain 638)
92 596 Yersinia pestis (strain Pestoides F)
93 595 Methanobacterium thermoautotrophicum
94 594 Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
95 590 Bacillus cereus (strain ATCC 14579 / DSM 31)
96 590 Agrobacterium tumefaciens (strain C58 / ATCC 33970)
97 585 Ralstonia solanacearum (Pseudomonas solanacearum)
98 583 Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696)
99 581 Shewanella oneidensis
100 581 Rickettsia prowazekii
101 579 Helicobacter pylori (Campylobacter pylori)
102 576 Rhizobium loti (Mesorhizobium loti)
103 574 Staphylococcus aureus (strain USA300)
104 573 Serratia proteamaculans (strain 568)
105 572 Buchnera aphidicola subsp. Acyrthosiphon pisum
106 568 Listeria monocytogenes
107 566 Lactococcus lactis subsp. lactis (Streptococcus lactis)
108 562 Buchnera aphidicola subsp. Schizaphis graminum
109 562 Staphylococcus aureus (strain bovine RF122)
110 560 Photobacterium profundum (Photobacterium sp. (strain SS9))
111 560 Helicobacter pylori J99 (Campylobacter pylori J99)
112 560 Listeria innocua
113 558 Neisseria meningitidis serogroup B
114 556 Xanthomonas campestris pv. campestris
115 551 Salmonella arizonae (strain ATCC BAA-731 / CDC346-86 / RSK2980)
116 545 Staphylococcus haemolyticus (strain JCSC1435)
117 539 Staphylococcus saprophyticus subsp. saprophyticus
118 538 Neisseria meningitidis serogroup A
119 536 Brucella melitensis
120 533 Brucella suis
121 532 Bacillus cereus (strain ATCC 10987)
122 532 Yarrowia lipolytica (Candida lipolytica)
123 531 Clostridium acetobutylicum
124 528 Caulobacter crescentus (Caulobacter vibrioides)
125 524 Enterobacter sakazakii (strain ATCC BAA-894)
126 521 Emericella nidulans (Aspergillus nidulans)
127 521 Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
128 521 Xanthomonas axonopodis pv. citri
129 514 Oceanobacillus iheyensis
130 512 Bacillus thuringiensis subsp. konkukian
131 509 Pseudomonas syringae pv. syringae (strain B728a)
132 507 Buchnera aphidicola subsp. Baizongia pistaciae
133 507 Streptococcus pneumoniae
134 503 Vibrio fischeri (strain ATCC 700601 / ES114)
135 503 Pseudomonas fluorescens (strain PfO-1)
136 501 Listeria monocytogenes serotype 4b (strain F2365)
137 500 Bacillus cereus (strain ZK / E33L)
138 499 Xylella fastidiosa
139 498 Pseudomonas fluorescens (strain Pf-5 / ATCC BAA-477)
140 498 Pseudomonas aeruginosa (strain UCBPP-PA14)
141 496 Thermotoga maritima
142 491 Bacillus licheniformis (strain DSM 13 / ATCC 14580)
143 491 Rickettsia conorii
144 491 Bordetella bronchiseptica (Alcaligenes bronchisepticus)
145 490 Xylella fastidiosa (strain Temecula1 / ATCC 700964)
146 488 Pseudomonas syringae pv. phaseolicola (strain 1448A / Race 6)
147 483 Mycoplasma genitalium
148 481 Haemophilus ducreyi
149 480 Bordetella parapertussis
150 480 Chromobacterium violaceum
151 480 Bordetella pertussis
152 478 Deinococcus radiodurans
153 473 Clostridium perfringens
154 472 Sodalis glossinidius (strain morsitans)
155 469 Corynebacterium glutamicum (Brevibacterium flavum)
156 465 Vibrio cholerae serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
157 464 Methanosarcina acetivorans
158 459 Brucella abortus
159 457 Haemophilus influenzae (strain 86-028NP)
160 456 Pyrococcus horikoshii
161 455 Mannheimia succiniciproducens (strain MBEL55E)
162 454 Pseudomonas entomophila (strain L48)
163 452 Pyrococcus abyssi
164 452 Streptomyces avermitilis
165 451 Xanthomonas campestris pv. campestris (strain 8004)
166 448 Halobacterium salinarium (Halobacterium halobium)
167 446 Rickettsia felis (Rickettsia azadi)
168 446 Enterococcus faecalis (Streptococcus faecalis)
169 445 Pseudomonas aeruginosa (strain PA7)
170 444 Streptococcus pneumoniae (strain ATCC BAA-255 / R6)
171 444 Methanosarcina mazei (Methanosarcina frisia)
172 444 Bacillus clausii (strain KSM-K16)
173 442 Burkholderia pseudomallei (Pseudomonas pseudomallei)
174 442 Shewanella sp. (strain MR-7)
175 439 Lactobacillus plantarum
176 439 Shewanella sp. (strain MR-4)
177 438 Synechococcus elongatus (Thermosynechococcus elongatus)
178 438 Vibrio harveyi (strain ATCC BAA-1116 / BB120)
179 438 Geobacillus kaustophilus
180 436 Streptococcus mutans
181 436 Chlamydia trachomatis
182 434 Thermoanaerobacter tengcongensis
183 434 Oryza sativa subsp. indica (Rice)
184 433 Rickettsia bellii (strain RML369-C)
185 433 Pyrococcus furiosus
186 431 Ovis aries (Sheep)
187 429 Streptococcus pyogenes serotype M6
188 429 Synechococcus sp. (strain PCC 7942) (Anacystis nidulans R2)
189 428 Acinetobacter sp. (strain ADP1)
190 428 Brucella abortus (strain 2308)
191 427 Borrelia burgdorferi (Lyme disease spirochete)
192 427 Nicotiana tabacum (Common tobacco)
193 425 Rhodopseudomonas palustris
194 422 Burkholderia sp. (strain 383) (Burkholderia cepacia
195 422 Campylobacter jejuni
196 422 Anabaena variabilis (strain ATCC 29413 / PCC 7937)
197 421 Xanthomonas campestris pv. vesicatoria (strain 85-10)
198 420 Burkholderia mallei (Pseudomonas mallei)
199 419 Chlamydia pneumoniae (Chlamydophila pneumoniae)
200 418 Pseudomonas putida (strain F1 / ATCC 700007)
201 414 Shewanella frigidimarina (strain NCIMB 400)
202 414 Aspergillus fumigatus (Sartorya fumigata)
203 413 Ralstonia eutropha (strain JMP134) (Alcaligenes eutrophus)
204 413 Shewanella sp. (strain ANA-3)
205 412 Xanthomonas oryzae pv. oryzae (strain MAFF 311018)
206 411 Pseudomonas putida (strain GB-1)
207 410 Methylococcus capsulatus
208 409 Chlamydia muridarum
209 409 Streptococcus pyogenes serotype M1
210 408 Rhizobium sp. (strain NGR234)
211 407 Ralstonia eutropha (Cupriavidus necator
212 407 Sulfolobus solfataricus
213 406 Staphylococcus aureus (strain Newman)
214 405 Streptococcus pyogenes serotype M18
215 403 Rickettsia typhi
216 403 Streptococcus pyogenes serotype M3
217 402 Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158)
218 400 Nitrosomonas europaea
219 399 Bacillus amyloliquefaciens (strain FZB42)
220 398 Shewanella baltica (strain OS185)
221 397 Solanum lycopersicum (Tomato) (Lycopersicon esculentum)
222 396 Hahella chejuensis (strain KCTC 2396)
223 396 Gloeobacter violaceus
224 395 Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / NCIB 9240)
225 394 Pseudoalteromonas haloplanktis (strain TAC 125)
226 392 Corynebacterium efficiens
227 391 Dechloromonas aromatica (strain RCB)
228 390 Staphylococcus aureus (strain Mu3 / ATCC 700698)
229 389 Chlorobium tepidum
230 389 Shewanella sp. (strain W3-18-1)
231 389 Colwellia psychrerythraea (strain 34H / ATCC BAA-681) (Vibrio psychroerythus)
232 388 Shewanella putrefaciens (strain CN-32 / ATCC BAA-453)
233 387 Neisseria gonorrhoeae (strain ATCC 700825 / FA 1090)
234 385 Burkholderia xenovorans (strain LB400)
235 384 Mycobacterium paratuberculosis
236 384 Idiomarina loihiensis
237 384 Pseudomonas mendocina (strain ymp)
238 383 Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1)
239 381 Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013)
240 381 Synechococcus sp. (strain WH8102)
241 381 Pyrococcus kodakaraensis (Thermococcus kodakaraensis)
242 380 Haemophilus influenzae (strain PittEE)
243 380 Shewanella baltica (strain OS195)
244 378 Aeromonas salmonicida (strain A449)
245 378 Shewanella baltica (strain OS155 / ATCC BAA-1091)
246 374 Actinobacillus pleuropneumoniae serotype 5b (strain L20)
247 374 Solanum tuberosum (Potato)
248 374 Shewanella amazonensis (strain ATCC BAA-1098 / SB2B)
249 373 Burkholderia thailandensis (strain E264 / ATCC 700388 / DSM 13276 / CIP 106301)
250 372 Azoarcus sp. (strain EbN1) (Aromatoleum aromaticum (strain EbN1))
251 372 Burkholderia cenocepacia (strain AU 1054)
252 372 Streptococcus agalactiae serotype III
253 371 Prochlorococcus marinus (strain MIT 9313)
254 370 Xanthomonas oryzae pv. oryzae
255 369 Shewanella loihica (strain ATCC BAA-1088 / PV-4)
256 368 Streptococcus agalactiae serotype V
257 368 Coxiella burnetii
258 367 Methanopyrus kandleri
259 367 Listeria welshimeri serovar 6b (strain ATCC 35897 / DSM 20650 / SLCC5334)
260 363 Leptospira interrogans
261 363 Burkholderia pseudomallei (strain 1710b)
262 363 Prochlorococcus marinus
263 363 Bacillus cereus subsp. cytotoxis (strain NVH 391-98)
264 362 Rhizobium etli (strain CFN 42 / ATCC 51251)
265 362 Geobacter sulfurreducens
266 361 Staphylococcus aureus (strain JH1)
267 356 Aeropyrum pernix
268 356 Haemophilus somnus (strain 129Pt) (Histophilus somni (strain 129Pt))
269 355 Staphylococcus aureus (strain JH9)
270 355 Nitrosococcus oceani (strain ATCC 19707 / NCIMB 11848)
271 354 Haemophilus influenzae (strain PittGG)
272 353 Leptospira interrogans serogroup Icterohaemorrhagiae serovar copenhageni
273 351 Shewanella halifaxensis (strain HAW-EB4)
274 351 Pisum sativum (Garden pea)
275 351 Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579)
276 350 Burkholderia cenocepacia (strain HI2424)
277 349 Legionella pneumophila (strain Paris)
278 349 Ralstonia metallidurans (strain CH34 / ATCC 43123 / DSM 2839)
279 348 Legionella pneumophila (strain Lens)
280 348 Rhizobium leguminosarum bv. viciae (strain 3841)
281 346 Actinobacillus succinogenes (strain ATCC 55618 / 130Z)
282 346 Bacillus pumilus (strain SAFR-032)
283 345 Nocardia farcinica
284 345 Shewanella pealeana (strain ATCC 700345 / ANG-SQ1)
285 345 Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB 13768)
286 345 Sulfolobus tokodaii
287 344 Thiobacillus denitrificans (strain ATCC 25259)
288 344 Prochlorococcus marinus subsp. pastoris (strain CCMP 1378 / MED4)
289 343 Psychromonas ingrahamii (strain 37)
290 343 Glycine max (Soybean)
291 342 Legionella pneumophila subsp. pneumophila
292 340 Mycobacterium tuberculosis (strain ATCC 25177 / H37Ra)
293 339 Saccharophagus degradans (strain 2-40 / ATCC 43961 / DSM 17024)
294 338 Pseudoalteromonas atlantica (strain T6c / BAA-1087)
295 338 Shewanella sediminis (strain HAW-EB3)
296 338 Desulfovibrio vulgaris (strain Hildenborough / ATCC 29579 / NCIMB 8303)
297 337 Silicibacter pomeroyi
298 337 Burkholderia ambifaria (strain ATCC BAA-244 / AMMD) (Burkholderia cepacia
299 336 Macaca mulatta (Rhesus macaque)
300 336 Neisseria meningitidis serogroup C / serotype 2a (strain ATCC 700532 / FAM18)
301 330 Rhodopirellula baltica
302 329 Caenorhabditis briggsae
303 329 Geobacillus thermodenitrificans (strain NG80-2)
304 329 Bordetella avium (strain 197N)
305 328 Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia
306 328 Pseudomonas stutzeri (strain A1501)
307 328 Mycobacterium bovis (strain BCG / Pasteur 1173P2)
308 327 Lactococcus lactis subsp. cremoris (strain MG1363)
309 326 Nitrosospira multiformis (strain ATCC 25196 / NCIMB 11849)
310 326 Fusobacterium nucleatum subsp. nucleatum
311 325 Symbiobacterium thermophilum
312 325 Rhodoferax ferrireducens (strain DSM 15236 / ATCC BAA-621 / T118)
313 324 Zymomonas mobilis
314 324 Staphylococcus aureus (strain USA300 / TCH1516)
315 322 Thermoplasma acidophilum
316 321 Clostridium perfringens (strain ATCC 13124 / NCTC 8237 / Type A)
317 320 Thermus thermophilus (strain HB27 / ATCC BAA-163 / DSM 7039)
318 320 Wolinella succinogenes
319 320 Methanococcus maripaludis
320 319 Alcanivorax borkumensis (strain SK2 / ATCC 700651 / DSM 11573)
321 319 Rhodospirillum rubrum (strain ATCC 11170 / NCIB 8255)
322 318 Geobacter metallireducens (strain GS-15 / ATCC 53774 / DSM 7210)
323 318 Triticum aestivum (Wheat)
324 317 Burkholderia pseudomallei (strain 1106a)
325 317 Bacillus thuringiensis (strain Al Hakam)
326 317 Streptococcus agalactiae serotype Ia
327 317 Bacteroides thetaiotaomicron
328 316 Corynebacterium diphtheriae
329 316 Methylobacillus flagellatus (strain KT / ATCC 51484 / DSM 6875)
330 316 Rhodopseudomonas palustris (strain HaA2)
331 314 Rhodopseudomonas palustris (strain BisB18)
332 314 Azoarcus sp. (strain BH72)
333 313 Marinobacter aquaeolei (Marinobacter hydrocarbonoclasticus
334 313 Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1)
335 313 Sinorhizobium medicae (strain WSM419) (Ensifer medicae)
336 312 Clostridium tetani
337 311 Methanosarcina barkeri (strain Fusaro / DSM 804)
338 311 Nitrobacter winogradskyi (strain Nb-255 / ATCC 25391)
339 310 Campylobacter jejuni (strain RM1221)
340 309 Thiomicrospira crunogena (strain XCL-2)
341 309 Brucella suis (strain ATCC 23445 / NCTC 10510)
342 308 Hordeum vulgare (Barley)
343 308 Streptococcus pneumoniae serotype 2 (strain D39 / NCTC 7466)
344 308 Brucella canis (strain ATCC 23365 / NCTC 10854)
345 308 Alkalilimnicola ehrlichei (strain MLHE-1)
346 307 Prochlorococcus marinus (strain NATL2A)
347 307 Burkholderia pseudomallei (strain 668)
348 305 Burkholderia mallei (strain NCTC 10247)
349 304 Clostridium perfringens (strain SM101 / Type A)
350 303 Rhodopseudomonas palustris (strain BisB5)
351 302 Ochrobactrum anthropi (strain ATCC 49188 / DSM 6882 / NCTC 12168)
352 301 Burkholderia mallei (strain NCTC 10229)
353 301 Haloarcula marismortui (Halobacterium marismortui)
354 301 Sulfolobus acidocaldarius
355 301 Bacteroides fragilis
356 301 Carboxydothermus hydrogenoformans (strain Z-2901 / DSM 6008)
357 301 Nitrobacter hamburgensis (strain X14 / DSM 10229)
358 299 Burkholderia mallei (strain SAVP1)
359 299 Gluconobacter oxydans (Gluconobacter suboxydans)
360 299 Streptococcus thermophilus (strain CNRZ 1066)
361 298 Mesorhizobium sp. (strain BNC1)
362 297 Synechococcus sp. (strain CC9902)
363 297 Cryptococcus neoformans (Filobasidiella neoformans)
364 297 Streptococcus thermophilus (strain ATCC BAA-250 / LMG 18311)
365 295 Roseobacter denitrificans (strain ATCC 33942 / OCh 114) (Erythrobacter sp.
366 295 Prochlorococcus marinus (strain MIT 9312)
367 295 Staphylococcus aureus
368 295 Bartonella henselae (Rochalimaea henselae)
369 295 Psychrobacter arcticus (strain DSM 17307 / 273-4)
370 293 Cavia porcellus (Guinea pig)
371 293 Pyrobaculum aerophilum
372 293 Nitrosomonas eutropha (strain C91)
373 291 Helicobacter hepaticus
374 289 Lactococcus lactis subsp. cremoris (strain SK11)
375 289 Legionella pneumophila (strain Corby)
376 289 Thermoplasma volcanium
377 289 Bartonella quintana (Rochalimaea quintana)
378 289 Streptococcus sanguinis (strain SK36)
379 288 Synechococcus sp. (strain CC9605)
380 288 Streptococcus gordonii (strain Challis / ATCC 35105 / CH1 / DL1 / V288)
381 288 Desulfotalea psychrophila
382 286 Synechococcus sp. (strain JA-2-3B'a(2-13))
383 286 Synechococcus sp. (strain JA-3-3Ab)
384 286 Halorhodospira halophila (strain DSM 244 / SL1) (Ectothiorhodospira halophila
385 285 Streptococcus pyogenes serotype M28
386 285 Pseudomonas putida
387 285 Psychrobacter cryohalolentis (strain K5)
388 285 Moorella thermoacetica (strain ATCC 39073)
389 284 Brucella ovis (strain ATCC 25840 / 63/290 / NCTC 10512)
390 283 Streptococcus pyogenes serotype M5 (strain Manfredo)
391 282 Haemophilus somnus (strain 2336) (Histophilus somni (strain 2336))
392 281 Rhodopseudomonas palustris (strain BisA53)
393 281 Jannaschia sp. (strain CCS1)
394 281 Lactobacillus sakei subsp. sakei (strain 23K)
395 280 Bifidobacterium longum
396 279 Ustilago maydis (Smut fungus)
397 279 Rhodobacter sphaeroides (strain ATCC 17029 / ATH 2.4.9)
398 279 Wigglesworthia glossinidia brevipalpis
399 278 Spinacia oleracea (Spinach)
400 278 Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9)
401 278 Silicibacter sp. (strain TM1040)
402 277 Trichodesmium erythraeum (strain IMS101)
403 276 Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176)
404 276 Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
405 275 Lactobacillus johnsonii
406 274 Campylobacter jejuni subsp. jejuni serotype O:6 (strain 81116 / NCTC 11828)
407 274 Equus caballus (Horse)
408 274 Porphyromonas gingivalis (Bacteroides gingivalis)
409 273 Propionibacterium acnes
410 272 Leifsonia xyli subsp. xyli
411 271 Gorilla gorilla gorilla (Lowland gorilla)
412 270 Polaromonas sp. (strain JS666 / ATCC BAA-500)
413 269 Bacteroides fragilis (strain ATCC 25285 / NCTC 9343)
414 269 Aspergillus oryzae
415 268 Francisella tularensis subsp. tularensis
416 268 Bacteriophage T4
417 268 Clostridium botulinum (strain Langeland / NCTC 10281 / Type F)
418 267 Blochmannia floridanus
419 267 Acidovorax avenae subsp. citrulli (strain AAC00-1)
420 267 Bradyrhizobium sp. (strain ORS278)
421 266 Rhodococcus sp. (strain RHA1)
422 266 Desulfovibrio desulfuricans (strain G20)
423 266 Helicobacter pylori (strain HPAG1)
424 266 Anaeromyxobacter dehalogenans (strain 2CP-C)
425 265 Magnetospirillum magneticum (strain AMB-1 / ATCC 700264)
426 264 Lactobacillus acidophilus
427 264 Clostridium novyi (strain NT)
428 263 Janthinobacterium sp. (strain Marseille) (Minibacterium massiliensis)
429 263 Ureaplasma parvum (Ureaplasma urealyticum biotype 1)
430 262 Mycobacterium ulcerans (strain Agy99)
431 262 Rhodobacter capsulatus (Rhodopseudomonas capsulata)
432 262 Chlorobium chlorochromatii (strain CaD3)
433 261 Paracoccus denitrificans (strain Pd 1222)
434 261 Streptococcus pyogenes serotype M12 (strain MGAS9429)
435 260 Streptococcus pyogenes serotype M4 (strain MGAS10750)
436 260 Chlamydophila caviae
437 258 Corynebacterium glutamicum (strain R)
438 258 Neisseria meningitidis serogroup C (strain 053442)
439 257 Streptococcus pyogenes serotype M2 (strain MGAS10270)
440 257 Desulfitobacterium hafniense (strain Y51)
441 257 Synechococcus sp. (strain CC9311)
442 257 Polaromonas naphthalenivorans (strain CJ2)
443 256 Francisella tularensis subsp. holarctica (strain LVS)
444 255 Myxococcus xanthus (strain DK 1622)
445 255 Herminiimonas arsenicoxydans
446 255 Prochlorococcus marinus (strain MIT 9301)
447 255 Mycobacterium avium (strain 104)
448 254 Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052)
449 254 Acidovorax sp. (strain JS42)
450 254 Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
451 254 Vaccinia virus (strain Copenhagen) (VACV)
452 253 Clostridium botulinum (strain ATCC 19397 / Type A)
453 253 Thermotoga petrophila (strain RKU-1 / ATCC BAA-488 / DSM 13995)
454 253 Pelodictyon luteolum (strain DSM 273) (Chlorobium luteolum (strain DSM 273))
455 253 Synechococcus sp. (strain WH7803)
456 253 Corynebacterium jeikeium (strain K411)
457 253 Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
458 252 Thermobifida fusca (strain YX)
459 252 Prochlorococcus marinus (strain MIT 9515)
460 252 Novosphingobium aromaticivorans (strain DSM 12444)
461 251 Mycobacterium sp. (strain MCS)
462 250 Prochlorococcus marinus (strain AS9601)
463 250 Mycobacterium vanbaalenii (strain DSM 7251 / PYR-1)
464 248 Lactobacillus salivarius subsp. salivarius (strain UCC118)
465 247 Bdellovibrio bacteriovorus
466 247 Rhodobacter sphaeroides (strain ATCC 17025 / ATH 2.4.3)
467 247 Clostridium kluyveri (strain ATCC 8527 / DSM 555 / NCIMB 10680)
468 247 Campylobacter jejuni subsp. doylei (strain ATCC BAA-1458 / RM4099 / 269.97)
469 246 Methylibium petroleiphilum (strain PM1)
470 245 Alkaliphilus metalliredigens (strain QYMF)
471 245 Prochlorococcus marinus (strain NATL1A)
472 244 Blochmannia pennsylvanicus (strain BPEN)
473 244 Marinomonas sp. (strain MWYL1)
474 243 Coxiella burnetii (strain Dugway 5J108-111)
475 243 Mycobacterium sp. (strain KMS)
476 243 Prochlorococcus marinus (strain MIT 9215)
477 243 Coxiella burnetii (strain RSA 331 / Henzerling II)
478 243 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / ORS 571)
479 243 Streptococcus pyogenes serotype M12 (strain MGAS2096)
480 242 Sulfurimonas denitrificans (Thiomicrospira denitrificans
481 242 Geobacter uraniireducens (strain Rf4) (Geobacter uraniumreducens)
482 241 Mycobacterium sp. (strain JLS)
483 241 Francisella tularensis subsp. tularensis (strain FSC 198)
484 241 Clostridium difficile (strain 630)
485 240 Desulfovibrio vulgaris subsp. vulgaris (strain DP4)
486 239 Prochlorococcus marinus (strain MIT 9303)
487 238 Francisella tularensis subsp. novicida (strain U112)
488 238 Lactobacillus casei (strain ATCC 334)
489 237 Treponema denticola
490 236 Baumannia cicadellinicola subsp. Homalodisca coagulata
491 236 Francisella tularensis subsp. holarctica (strain OSU18)
492 235 Bacillus stearothermophilus (Geobacillus stearothermophilus)
493 234 Clostridium botulinum (strain Hall / ATCC 3502 / NCTC 13319 / Type A)
494 234 Acaryochloris marina (strain MBIC 11017)
495 234 Natronomonas pharaonis (strain DSM 2160 / ATCC 35678)
496 234 Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224)
497 233 Leptospira borgpetersenii serovar Hardjo-bovis (strain JB197)
498 232 Sphingopyxis alaskensis (Sphingomonas alaskensis)
499 232 Methanococcus maripaludis (strain C7 / ATCC BAA-1331)
500 232 Syntrophus aciditrophicus (strain SB)
501 231 Chlamydomonas reinhardtii
502 231 Hyphomonas neptunium (strain ATCC 15444)
503 230 Pediococcus pentosaceus (strain ATCC 25745 / 183-1w)
504 229 Verminephrobacter eiseniae (strain EF01-2)
505 229 Chlorobium phaeobacteroides (strain DSM 266)
506 228 Helicobacter acinonychis (strain Sheeba)
507 228 Methanococcus maripaludis (strain C5 / ATCC BAA-1333)
508 228 Pelobacter propionicus (strain DSM 2379)
509 227 Maricaulis maris (strain MCS10)
510 227 Alkaliphilus oremlandii (strain OhILAs) (Clostridium oremlandii (strain OhILAs))
511 227 Deinococcus geothermalis (strain DSM 11300)
512 226 Chlamydia trachomatis (strain A/HAR-13 / ATCC VR-571B)
513 225 Francisella tularensis subsp. tularensis (strain WY96-3418)
514 224 Cricetulus griseus (Chinese hamster)
515 224 Protochlamydia amoebophila (strain UWE25)
516 222 Francisella tularensis subsp. holarctica (strain FTA)
517 222 Syntrophomonas wolfei subsp. wolfei (strain Goettingen)
518 221 Desulfotomaculum reducens (strain MI-1)
519 221 Caulobacter sp. (strain K31)
520 220 Frankia sp. (strain CcI3)
521 220 Dinoroseobacter shibae (strain DFL 12)
522 219 Bartonella tribocorum (strain CIP 105476 / IBS 506)
523 218 Synechococcus sp. (strain RCC307)
524 218 Lactobacillus brevis (strain ATCC 367 / JCM 1170)
525 217 Felis silvestris catus (Cat)
526 217 Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081)
527 217 Chlamydophila abortus
528 217 Porphyra purpurea
529 217 Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB)
530 216 Bartonella bacilliformis (strain ATCC 35685 / KC583)
531 216 Methanococcoides burtonii (strain DSM 6242)
532 215 Leptospira borgpetersenii serovar Hardjo-bovis (strain L550)
533 215 Rickettsia akari (strain Hartford)
534 214 Klebsiella pneumoniae
535 214 Dehalococcoides sp. (strain CBDB1)
536 213 Dehalococcoides ethenogenes (strain 195)
537 211 Rickettsia canadensis (strain McKiel)
538 211 Parvibaculum lavamentivorans (strain DS-1 / DSM 13023 / NCIMB 13966)
539 210 Rickettsia rickettsii (strain Sheila Smith)
540 209 Gibberella zeae (Fusarium graminearum)
541 208 Mycobacterium gilvum (strain PYR-GCK) (Mycobacterium flavescens
542 208 Streptococcus suis (strain 98HAH33)
543 208 Porphyra yezoensis
544 208 Francisella philomiragia subsp. philomiragia (strain ATCC 25017)
545 208 Anaeromyxobacter sp. (strain Fw109-5)
546 208 Granulobacter bethesdensis (strain ATCC BAA-1260 / CGDNIH1)
547 207 Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 / NCTC 11154)
548 206 Nitratiruptor sp. (strain SB155-2)
549 206 Pelagibacter ubique
550 206 Mesocricetus auratus (Golden hamster)
551 205 Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903)
552 204 Chlamydophila felis (strain Fe/C-56)
553 204 Salinibacter ruber (strain DSM 13855)
554 203 Encephalitozoon cuniculi
555 203 Tropheryma whipplei (strain TW08/27) (Whipple's bacillus)
556 203 Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC BAA-365)
557 203 Prosthecochloris vibrioformis (Chlorobium vibrioforme subsp. thiosulfatophilum (Chlorobium phaeovibrioides
558 203 Psychrobacter sp. (strain PRwf-1)
559 202 Magnetococcus sp. (strain MC-1)
560 202 Tropheryma whipplei (strain Twist) (Whipple's bacillus)
561 202 Lactobacillus reuteri (strain ATCC 23272 / DSM 20016 / F275)
562 200 Vaccinia virus (strain Western Reserve / WR) (VACV)
3.3 Taxonomic distribution of the sequences
Kingdom sequences (% of the database)
Archaea 14637 ( 4%)
Bacteria 222942 ( 57%)
Eukaryota 140754 ( 36%)
Viruses 12363 ( 3%)
Within Eukaryota:
Category sequences (% of Eukaryota) (% of the complete database)
Human 19930 ( 14%) ( 5%)
Other Mammalia 42795 ( 30%) ( 11%)
Other Vertebrata 13889 ( 10%) ( 4%)
Viridiplantae 23238 ( 17%) ( 6%)
Fungi 21935 ( 16%) ( 6%)
Insecta 5522 ( 4%) ( 1%)
Nematoda 3764 ( 3%) ( 1%)
Other 9681 ( 7%) ( 2%)
3.4 Annotation of high-priority organisms
4. SEQUENCE SIZE
Repartition of the sequences by size (excluding fragments)
From To Number From To Number
1- 50 6397 1001-1100 2920
51- 100 29131 1101-1200 1986
101- 150 41790 1201-1300 1572
151- 200 41109 1301-1400 1426
201- 250 40870 1401-1500 1112
251- 300 35938 1501-1600 553
301- 350 34995 1601-1700 435
351- 400 30746 1701-1800 380
401- 450 25270 1801-1900 350
451- 500 20933 1901-2000 274
501- 550 14736 2001-2100 173
551- 600 10881 2101-2200 244
601- 650 9198 2201-2300 219
651- 700 6482 2301-2400 155
701- 750 5298 2401-2500 111
751- 800 3902 >2500 860
801- 850 3404
851- 900 3637
901- 950 2976
951-1000 2101
The average sequence length in UniProtKB/Swiss-Prot is 359 amino acids.
The shortest sequence is GWA_SEPOF (P83570): 2 amino acids.
The longest sequence is TITIN_MOUSE (A2ASS6): 35213 amino acids.
5. JOURNAL CITATIONS
Note: the following citation statistics reflect the number of distinct
journal citations.
Total number of journals cited in this release of UniProtKB/Swiss-Prot: 1918
5.1 Table of the frequency of journal citations
Journals cited 1x: 624
2x: 267
3x: 129
4x: 103
5x: 72
6x: 56
7x: 39
8x: 37
9x: 36
10x: 23
11- 20x: 151
21- 50x: 151
51-100x: 86
>100x: 144
5.2 List of the most cited journals in UniProtKB/Swiss-Prot
Nb Citations Journal name
-- --------- -------------------------------------------------------------
1 16294 Journal of Biological Chemistry
2 7637 Proceedings of the National Academy of Sciences of the U.S.A.
3 4673 Journal of Bacteriology
4 4396 Gene
5 4186 Biochemical and Biophysical Research Communications
6 4175 Nucleic Acids Research
7 3742 FEBS Letters
8 3493 Biochemistry
9 3472 The EMBO Journal
10 3115 Molecular and Cellular Biology
11 3007 European Journal of Biochemistry
12 2962 Nature
13 2817 Biochimica et Biophysica Acta
14 2690 Journal of Molecular Biology
15 2430 Genomics
16 2411 Cell
17 2012 Biochemical Journal
18 1885 Science
19 1600 Journal of Virology
20 1582 Molecular Microbiology
21 1425 Journal of Cell Biology
22 1417 Plant Molecular Biology
23 1288 Molecular and General Genetics
24 1222 Virology
25 1195 Nature Genetics
26 1191 Genes and Development
27 1191 Human Molecular Genetics
28 1121 Journal of Biochemistry
29 1099 Oncogene
30 1091 The American Journal of Human Genetics
31 1085 Plant Physiology
32 982 Development
33 918 Journal of Immunology
34 900 Human Mutation
35 868 Genetics
36 848 Molecular Biology of the Cell
37 812 Infection and Immunity
38 792 Structure
39 768 Journal of General Virology
40 755 Archives of Biochemistry and Biophysics
41 723 Yeast
42 703 The Plant Cell
43 698 Blood
44 662 Microbiology
45 647 Molecular Cell
46 614 Developmental Biology
47 607 Journal of Cell Science
48 594 Cancer Research
49 594 FEMS Microbiology Letters
50 584 The Plant Journal
51 563 Human Genetics
52 563 Nature Structural Biology
53 529 Mechanisms of Development
54 521 Current Biology
55 511 Current Genetics
56 474 Journal of Neuroscience
57 471 Applied and Environmental Microbiology
58 465 Journal of Clinical Investigation
59 464 Acta Crystallographica, Section D
60 462 Neuron
61 460 Protein Science
62 458 Mammalian Genome
63 421 The Journal of Experimental Medicine
64 420 Immunogenetics
65 413 Molecular Endocrinology
66 413 Toxicon
67 410 Molecular and Biochemical Parasitology
68 406 American Journal of Physiology
69 378 Journal of Neurochemistry
70 365 Endocrinology
71 359 Journal of Molecular Evolution
72 353 DNA and Cell Biology
73 351 The Journal of Clinical Endocrinology and Metabolism
74 343 DNA Sequence
75 332 Molecular Biology and Evolution
76 312 Bioscience, Biotechnology, and Biochemistry
77 306 Journal of Medical Genetics
78 306 Brain Research. Molecular Brain Research
79 286 Biological Chemistry Hoppe-Seyler
80 279 Proteins
81 272 Cytogenetics and Cell Genetics
82 260 Comparative Biochemistry and Physiology
83 256 Journal of Investigative Dermatology
84 255 Peptides
85 245 Journal of General Microbiology
86 245 Molecular Pharmacology
87 244 Antimicrobial Agents and Chemotherapy
88 238 Biology of Reproduction
89 237 Plant and Cell Physiology
90 234 Nature Cell Biology
91 232 Experimental Cell Research
92 224 Genome Research
93 215 Hoppe-Seyler's Zeitschrift fur Physiologische Chemie
94 211 Virus Research
95 207 Neurology
96 194 Developmental Dynamics
97 193 Molecular Plant-Microbe Interactions
98 193 RNA
99 191 DNA Research
100 188 European Journal of Immunology
101 182 Biochimie
102 180 Tissue Antigens
103 173 Annals of Neurology
104 173 European Journal of Human Genetics
105 165 Journal of Human Genetics
106 163 Molecular and Cellular Endocrinology
107 163 Immunity
108 163 Genes to Cells
109 162 Planta
110 159 DNA
111 158 Developmental Cell
112 155 Molecular Phylogenetics and Evolution
113 154 American Journal of Medical Genetics
114 152 Hemoglobin
115 149 Archives of Microbiology
116 149 Eukaryotic cell
117 148 The New England Journal of Medicine
118 146 Bioorganicheskaia Khimiia
119 143 Insect Biochemistry and Molecular Biology
120 137 Investigative Ophthalmology and Visual Science
121 136 Molecular Reproduction and Development
122 136 Diabetes
123 134 Glycobiology
124 133 Animal Genetics
125 131 Molecular Immunology
126 129 General and Comparative Endocrinology
127 127 Molecular and Cellular Neuroscience
128 124 International Journal of Cancer
129 121 Archives of Virology
130 119 Agricultural and Biological Chemistry
131 116 The FASEB Journal
132 112 British Journal of Haematology
133 111 Molecular Genetics and Metabolism
134 110 EMBO Reports
135 109 Journal of Protein Chemistry
136 106 Clinical Genetics
137 106 Biological Chemistry
138 105 Molecular Genetics and Genomics
139 104 Journal of Neuroscience Research
140 104 Journal of Cellular Biochemistry
141 103 Neuroscience Letters
142 103 Journal of Molecular Endocrinology
143 103 Journal of Lipid Research
144 101 Biochemistry and Molecular Biology International
6. STATISTICS FOR SOME LINE TYPES
The following table summarizes the total number of some UniProtKB/Swiss-Prot lines,
as well as the number of entries with at least one such line, and the
frequency of the lines.
Total Number of Average
Line type / subtype number entries per entry
--------------------------------- -------- --------- ---------
References (RL) 713537 1.83
1 Journal 578611 305616 1.48
2 Submitted to EMBL/GenBank/DDBJ 127885 117829 0.33
3 Submitted to other databases 5057 4669 0.01
4 Book citation 592 582 <0.01
5 Plant Gene Register 540 528 <0.01
6 Thesis 420 418 <0.01
7 Unpublished observations 285 281 <0.01
8 Patent 141 139 <0.01
9 Worm Breeder's Gazette 6 6 <0.01
Total number of distinct authors cited in UniProtKB/Swiss-Prot: 262148.
Comments (CC) 1615187 4.13
1 SIMILARITY 452484 367283 1.16
2 FUNCTION 280266 269770 0.72
3 SUBCELLULAR LOCATION 223231 219059 0.57
4 CATALYTIC ACTIVITY 156747 143258 0.40
5 SUBUNIT 153224 153224 0.39
6 PATHWAY 91670 79897 0.23
7 COFACTOR 64979 59572 0.17
8 TISSUE SPECIFICITY 29402 29402 0.08
9 PTM 28721 23515 0.07
10 MISCELLANEOUS 26848 24501 0.07
11 DOMAIN 24216 21357 0.06
12 ALTERNATIVE PRODUCTS 16768 16768 0.04
13 SEQUENCE CAUTION 10295 10295 0.03
14 INTERACTION 9420 9420 0.02
15 INDUCTION 9092 9092 0.02
16 DEVELOPMENTAL STAGE 7537 7537 0.02
17 WEB RESOURCE 6282 5109 0.02
18 ENZYME REGULATION 6248 6248 0.02
19 CAUTION 5292 5187 0.01
20 DISEASE 4348 2996 0.01
21 MASS SPECTROMETRY 3526 2699 0.01
22 BIOPHYSICOCHEMICAL PROPERTIES 2202 2202 0.01
23 POLYMORPHISM 710 681 <0.01
24 RNA EDITING 544 544 <0.01
25 ALLERGEN 445 445 <0.01
26 TOXIC DOSE 374 366 <0.01
27 BIOTECHNOLOGY 236 234 <0.01
28 PHARMACEUTICAL 80 80 <0.01
Features (FT) 2427091 6.21
1 CHAIN 396977 386803 1.02
2 TRANSMEM 238504 50971 0.61
3 METAL 178701 44721 0.46
4 BINDING 126185 39828 0.32
5 DOMAIN 118207 68155 0.30
6 CONFLICT 107783 37444 0.28
7 STRAND 106870 10129 0.27
8 MOD_RES 104052 37107 0.27
9 TOPO_DOM 103947 21172 0.27
10 HELIX 103736 10660 0.27
11 ACT_SITE 94410 55812 0.24
12 CARBOHYD 86714 22306 0.22
13 DISULFID 84881 21510 0.22
14 REPEAT 72737 11060 0.19
15 NP_BIND 69671 48076 0.18
16 REGION 60293 33528 0.15
17 VARIANT 59464 12735 0.15
18 COMPBIAS 37160 21344 0.10
19 VAR_SEQ 35189 14936 0.09
20 SIGNAL 29265 29255 0.07
21 MOTIF 25711 16695 0.07
22 TURN 25708 8579 0.07
23 SITE 24924 14367 0.06
24 ZN_FING 24421 9934 0.06
25 MUTAGEN 24077 5827 0.06
26 COILED 14825 9805 0.04
27 INIT_MET 12314 12314 0.03
28 NON_TER 10980 8393 0.03
29 LIPID 9389 6024 0.02
30 PROPEP 9362 7804 0.02
31 DNA_BIND 8759 8097 0.02
32 PEPTIDE 7506 4583 0.02
33 TRANSIT 5597 5514 0.01
34 CA_BIND 3348 1389 0.01
35 CROSSLNK 2969 2060 0.01
36 NON_CONS 1447 584 <0.01
37 UNSURE 668 224 <0.01
38 NON_STD 340 266 <0.01
Cross-references (DR) 6890658 17.64
1 InterPro 940478 362099 2.41
2 EMBL 674786 381821 1.73
3 GO 651575 259330 1.67
4 Pfam 496864 348358 1.27
5 PROSITE 354235 222028 0.91
6 RefSeq 353595 323554 0.91
7 GeneID 339717 323324 0.87
8 KEGG 299425 279467 0.77
9 GenomeReviews 255202 237670 0.65
10 HAMAP 203811 203712 0.52
11 HOGENOM 197998 197998 0.51
12 TIGRFAMs 184582 172915 0.47
13 Gene3D 176938 146995 0.45
14 BioCyc 145866 139351 0.37
15 PANTHER 140796 129954 0.36
16 PRINTS 123385 100879 0.32
17 NMPDR 116721 116721 0.30
18 PIR 110871 101204 0.28
19 ProDom 108689 105859 0.28
20 SMART 104025 79096 0.27
21 HSSP 84033 84033 0.22
22 UniGene 78073 72463 0.20
23 HOVERGEN 74716 74695 0.19
24 Ensembl 66420 64911 0.17
25 PIRSF 57574 57574 0.15
26 ArrayExpress 52970 52970 0.14
27 PDBsum 50389 12753 0.13
28 PDB 50389 12753 0.13
29 SMR 49871 49871 0.13
30 GermOnline 41987 41375 0.11
31 TIGR 31484 30787 0.08
32 CleanEx 30195 29561 0.08
33 HGNC 18746 18613 0.05
34 LinkHub 18097 18097 0.05
35 IntAct 16422 16422 0.04
36 PharmGKB 15817 15808 0.04
37 MGI 15611 15560 0.04
38 MIM 15107 12029 0.04
39 PhosphoSite 14849 14849 0.04
40 H-InvDB 11267 9572 0.03
41 DIP 8998 8948 0.02
42 MEROPS 7201 6905 0.02
43 RGD 6976 6971 0.02
44 TAIR 6951 6837 0.02
45 SGD 6641 6539 0.02
46 CYGD 6628 6524 0.02
47 DrugBank 5331 1630 0.01
48 PeptideAtlas 5168 5168 0.01
49 GeneDB_Spombe 4460 4419 0.01
50 EcoGene 4332 4329 0.01
51 EchoBASE 4159 4124 0.01
52 WormPep 3881 3186 0.01
53 Gramene 3735 3731 0.01
54 FlyBase 3688 3560 0.01
55 WormBase 3577 3493 0.01
56 Reactome 3416 2069 0.01
57 HPA 2985 2565 0.01
58 SubtiList 2817 2816 0.01
59 Orphanet 2590 1683 0.01
60 dictyBase 2429 2343 0.01
61 GeneFarm 2240 2220 0.01
62 ZFIN 2077 2061 0.01
63 StyGene 1651 1647 <0.01
64 TubercuList 1470 1434 <0.01
65 SWISS-2DPAGE 1184 1182 <0.01
66 PseudoCAP 1179 1170 <0.01
67 ListiList 1129 1121 <0.01
68 REPRODUCTION-2DPAGE 1029 941 <0.01
69 AGD 769 763 <0.01
70 LegioList 699 697 <0.01
71 PhotoList 691 691 <0.01
72 Leproma 650 647 <0.01
73 PeroxiBase 496 485 <0.01
74 World-2DPAGE 495 495 <0.01
75 CGD 471 471 <0.01
76 MaizeGDB 468 463 <0.01
77 ProMEX 422 422 <0.01
78 DisProt 397 394 <0.01
79 OGP 380 378 <0.01
80 SagaList 373 372 <0.01
81 REBASE 351 343 <0.01
82 ECO2DBASE 351 299 <0.01
83 GlycoSuiteDB 282 282 <0.01
84 BuruList 262 262 <0.01
85 PHCI-2DPAGE 244 244 <0.01
86 VectorBase 228 221 <0.01
87 MypuList 198 198 <0.01
88 DOSAC-COBS-2DPAGE 152 150 <0.01
89 Aarhus/Ghent-2DPAGE 126 96 <0.01
90 Siena-2DPAGE 102 102 <0.01
91 HSC-2DPAGE 85 85 <0.01
92 2DBase-Ecoli 84 84 <0.01
93 PhosSite 70 70 <0.01
94 Cornea-2DPAGE 67 67 <0.01
95 COMPLUYEAST-2DPAGE 59 59 <0.01
96 euHCVdb 55 44 <0.01
97 PMMA-2DPAGE 52 52 <0.01
98 PptaseDB 31 31 <0.01
99 Rat-heart-2DPAGE 28 28 <0.01
100 ANU-2DPAGE 22 22 <0.01
7. MISCELLANEOUS STATISTICS
4375 entries are encoded on a mitochondrion, and 3423 are encoded on a plasmid.
9658 entries are encoded on a plastid,
of which 16 are encoded on apicoplasts,
9263 on chloroplasts,
on chromatophores,
145 on cyanelles,
124 on non-photosynthetic plastids and
110 on unspecified types of plastid.