TY - JOUR
T1 - Assessment of hierarchical clustering methodologies for proteomic data mining
AU - Meunier, Bruno
AU - Dumas, Emilie
AU - Piec, Isabelle
AU - Bechet, Daniel
AU - Hebraud, Michel
AU - Hocquette, Jean-Francois
N1 - An erratum to this article has been published and is available at: https://pubs.acs.org/doi/10.1021/pr078001e
"Due to a production error, several zip codes in the affiliation addresses of the authors contained misprints in the version posted on the Web December 14, 2006 (ASAP) and published in the January 5, 2007 issue (Vol. 6, No. 1, pp 353−366). The correct affiliation addresses are as follows: UR 1213, Unité de Recherches sur les Herbivores, Equipe Croissance et Métabolisme du Muscle, INRA de Clermont-Ferrand/Theix, F-63122 Saint-Genès Champanelle, France, UR484 Microbiologie, Equipe Qualité et Sécurité des Aliments (QuaSA), INRA de Clermont-Ferrand/Theix, F-63122 Saint-Genès Champanelle, France, Laboratoire d'Immunologie EMI 0351, INSERM, 3 rue des Louvels, F-80036 Amiens, France, UMR 1019, Unité de Nutrition Humaine, INRA de Clermont-Ferrand/Theix, F-63122 Saint-Genès Champanelle, France, and Plate-forme protéomique, INRA de Clermont-Ferrand/Theix, F-63122 Saint-Genès Champanelle, France. The electronic version of the paper was corrected and replaced on the Web February 7, 2007.02/07/2007"
PY - 2007
Y1 - 2007
N2 - Hierarchical clustering methodology is a powerful data mining approach for a first exploration of proteomic data. It enables samples or proteins to be grouped blindly according to their expression profiles. Nevertheless, the clustering results depend on parameters such as data preprocessing, between-profile similarity measurement, and the dendrogram construction procedure. We assessed several clustering strategies by calculating the F-measure, a widely used quality metric. The combination, on logged matrix, of Pearson correlation and Ward's methods for data aggregation is among the best clustering strategies, at least with the data sets we studied. This study was carried out using PermutMatrix, a freely available software derived from transcriptomics.
AB - Hierarchical clustering methodology is a powerful data mining approach for a first exploration of proteomic data. It enables samples or proteins to be grouped blindly according to their expression profiles. Nevertheless, the clustering results depend on parameters such as data preprocessing, between-profile similarity measurement, and the dendrogram construction procedure. We assessed several clustering strategies by calculating the F-measure, a widely used quality metric. The combination, on logged matrix, of Pearson correlation and Ward's methods for data aggregation is among the best clustering strategies, at least with the data sets we studied. This study was carried out using PermutMatrix, a freely available software derived from transcriptomics.
UR - https://publons.com/wos-op/publon/2853261/
U2 - 10.1021/PR060343H
DO - 10.1021/PR060343H
M3 - Article
VL - 6
SP - 358
EP - 366
JO - Journal of Proteome Research
JF - Journal of Proteome Research
SN - 1535-3893
IS - 1
ER -