John H. Wolfe

From Wikipedia the free encyclopedia


John H. Wolfe is the inventor of model-based clustering for continuous data.[1][2][3] Wolfe graduated with a B.A. in mathematics from Caltech and then went to graduate school in psychology at the University of California, Berkeley to work with Robert Tryon.

Around 1959, Paul Lazarsfeld visited Berkeley and gave a lecture on his latent class analysis, which fascinated Wolfe, and led him to start thinking about how one could do the same thing for continuous data. Wolfe's 1963 M.A. thesis[4] is a first, but ultimately failed attempt to do this. After graduating from Berkeley, Wolfe took a job with the US Navy in San Diego first as a computer programmer and then as an operations research analyst.

He continued his research on clustering and in 1965 he published the paper that invented model-based clustering.[5][3] He used the mixture of multivariate normal distributions model, estimated it by maximum likelihood using a Newton-Raphson algorithm and gave the expression for the posterior probabilities of membership in each cluster. This paper also contains the first publicly available software for estimating the model, called NORMIX. This was extended and published in a journal by Wolfe (1970).[6]

After 1970, Wolfe worked on other topics, but model-based clustering grew rapidly. Articles on model-based clustering have garnered over 20,000 citations in scientific publications,[7] while two of the most widely used software packages to implement it (the mclust and flexmix R packages) have been downloaded over 14 million times.[8]

References[edit]

  1. ^ McNicholas, P.D. (2016). Mixture Model-Based Classification. Chapman & Hall/CRC Press. ISBN 9781482225662.
  2. ^ McNicholas, P.D. (2016). "Model-based clustering". Journal of Classification. 33 (3): 331–373. doi:10.1007/s00357-016-9211-9.
  3. ^ a b Bouveyron, C.; Celeux, G.; Murphy, T.B.; Raftery, A.E. (2019). "Section 2.8". Model-Based Clustering and Classification for Data Science: With Applications in R. Cambridge University Press. ISBN 9781108494205.
  4. ^ Wolfe, J.H. (1963). Object cluster analysis of social areas, M.A. thesis. University of California, Berkeley.
  5. ^ Wolfe, J.H. (1965). A computer program for the maximum-likelihood analysis of types. USNPRA Technical Bulletin 65-15 (Report). US Naval Pers. Res. Act., San Diego, CA.
  6. ^ Wolfe, J.H. (1970). "Pattern clustering by multivariate mixture analysis". Multivariate Behavioral Research. 5 (3): 329–350. doi:10.1207/s15327906mbr0503_6. PMID 26812701.
  7. ^ Assessed by adding the citations to all articles with "model-based clustering" in the title enumerated by Google Scholar, accessed March 3, 2024
  8. ^ https://www.datasciencemeta.com/rpackages, accessed March 3, 2024