222 E 2006, 36(2): 222~234 * 1** 1,2 1,2 1,2 1,2 3 3 1 1 1,2 (1., 100080; 2., 100039; 3., 200031),. ;,,.. 20 90 (human genome project, HGP), ; HGP 2003, 21. (proteomics) ; (computational proteomics),,, [1].,,,. : 2005-09-28; : 2005-11-17 * 973 (2002CB713807) (2004BA711A21) ** E-mail: rxsun@ict.ac.cn
2 : 223,, 1)..,, (mass spectrometry, MS), 2002 MALDI ESI., [2~5]. 1 ; 2 3 KSDP pfind, ; 4 ; 5. 1 DNA ;, 20,. (primary structure), 1,. 1,,,. Edman 1).., 2004 7 www.scichina.com
224 E 36,, ; : (mass spectrometer) (, ), ;,, (protein identification)., ( ),,,. (ion source) (mass analyzer) (detector).,,,,,,, (peptide mass fingerprinting, PMF);, (collision-induced dissociation, CID),,, (MS/MS, Tandem MS). MS/MS, ( 2 ). MS/MS,. MS/MS, 2 2 MS/MS
2 : 225,, m/z;,. 14 EGVNDNEEGFFSAR CID, ( ) 1570.6 u. CID,, N a, b, c C x, y, z., a-x,b-y,c-z, 3 4..,.,,,,, 2 MS/MS. 2 b y,,. 3 4 2 KSDP pfind : 1).. www.scichina.com
226 E 36 2) (de novo peptide sequencing).,. 3) (tag),,.,,. 2.1 KSDP [6].,,. SEQUEST (cross correlation) [7] Mascot [8]..,, 10,.,,,,.,,,.,,, (kernel spectrum dot product, KSDP). (SDP),,.,.,. 5 KSDP, 5(b) (c) 6 y,, (b), KSDP. KSDP SEQUEST, Sonar MS/MS [9] 1, 230 1323,, KSDP,
2 : 227 5 KSDP (a) ; (b) 6 y ; (c) 6 y 1 KSDP KSDP SEQUEST Sonar MS/MS A-1,2 230 10 16 46 A 1323 40 66 a) a) Sonar MS/MS, 1323,, [6]. 2.2 pfind [10] KSDP, pfind, 6, ( http://pfind.jdl.ac.cn ), 7, pfind,, 2.0, pfind. 6 pfind www.scichina.com
228 E 36 7 pfind pfind,,. 3 [11],,,,,.,., C,H,N,O,S,P,,,,, 8..,.., FFP(fragment formula prediction) [11]. C,H,N,O,S,P,,,,.,, Q-TOF 500 u. 50 Q-TOF 9, (0~300 u),
2 : 229 83%, 5 97%; (300~500 u) 5 95%, [11].,. 8 9 FFP 4,,, www.scichina.com
230 E 36,,,.,,. (quantitative proteomics).,, (disease proteomics),,.. 4.1,,,, [13].,. : (two dimensional gel electrophoresis, 2DE).,.., [12].,,,,.,, (liquid chromatography, LC).,. :,..., (signal-to-noise ratio, S/N).,,.,,,,,,,,
2 : 231,. 4.2,, (clinical proteomics). 2005 7 25 (http://www.bioon.com/biology/ advance/cancer/200508/149930.html) 2005 8 28 HUPO(The human proteome organization) (http://www.hupo2005. com/),,.,, (biomarker).,,,,.,,,, [13,14].,. 5, 100 95, CA125 [15].,,,,. Ciphergen SELDI-TOF(surface-enhanced laser desorption/ionization time-of-flight, ).,,,.,. SELDI-TOF 10 [16]. ( ),,,, ; SELDI-TOF, www.scichina.com
232 E 36 m/z, 10 ( m/z, );,,. 10, SELDI-TOF,, 10 SELDI-TOF [16].,,, m/z, ;,, ;,,,,., SELDI-TOF :.,,, (matrix) A/D.,,.,,., (overfitting).,
2 : 233,. 5,,,,,,, ;,. KSDP pfind,,.,,,,.,,.,,,,,,,,.,,,,,..,,., pfind. 1 Patterson S D, Aebersold R H. Proteomics: the first decade and beyond. Nature Genetics Supplement, 2003, 33:311~323[DOI] www.scichina.com
234 E 36 2 Johnson R S, Davis M T, Taylor J A, et al. Informatics for protein identification by mass spectrometry. Methods, 2005, 35: 223 ~236[DOI] 3 Aebersold R H, Mann M. Mass spectrometry-based proteomics. Nature, 2003, 422:198~207[DOI] 4 Steen H, Mann M. The ABC's(and XYZ's) of peptide sequencing. Nature Reviews Molecular Cell Biology, 2004, 5:699~711[DOI] 5 Russell S A, Old W, Resing K A, et al. Proteomic informatics. International Review of Neurobiology, 2004, 61:129~157 6 Fu Y, Yang Q, Sun R, et al. Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry. Bioinformatics, 2004, 20:1948~1954[DOI] 7 Eng J K, McCormack A L, Yates J R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. Journal of American Society of Mass Spectrometry, 1994, 5: 976~989[DOI] 8 Perkins D N, Pappin D J, Creasy D M, et al. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis, 1999, 20: 3551~3567[DOI] 9 Fenyö D, Beavis R C. A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. Analytical Chemisty, 2003, 75:768~774[DOI] 10 Li D, Fu Y, Sun R, et al. pfind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry. Bioinformatics, 2005, 21(13): 3049~3050[DOI] 11 Zhang J, Gao W, Cai J, et al. Predicting molecular formulas of fragment ions with isotope patterns in tandem mass spectra. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2005, 2(3):217~230[DOI] 12 Gygi S P, Rist B, Gerber S A, et al. Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nature Biotechnololy, 1999, 17: 994~999[DOI] 13,. SELDI., 2002, 1:1~4 14 Liotta L A, Ferrari M, Petricoin E F. Clinical proteomics: written in blood. Nature, 2003, 425:905[DOI] 15 Petricoin E F, Ardekani A M, Hitt B A, et al. Use of proteomic patterns in serum to identify ovarian cancer. The Lancet, 2002, 359: 572~577[DOI] 16 Wulfkuhle J D, Liotta L A, Petricoin E F. Proteomic applications for the early detection of cancer. Nature Reviews Cancer, 2003, 3: 267-275[DOI]