Jump to content

Michael Berthold

From Wikipedia, the free encyclopedia

Michael R. Berthold
Michael presenting at the 2022 KNIME Fall Summit
NationalityGerman
Occupation(s)Computer scientist, entrepreneur, and author
AwardsKS Fu Award, the North American Fuzzy Information Processing Society
Fellow, The Institute of Electrical and Electronics Engineers (IEEE)
Honorary Professor, Óbuda University, Budapest
Academic background
Alma materUniversity of Karlsruhe, Germany
Academic work
InstitutionsKonstanz University, Germany

Michael R. Berthold is a German computer scientist, entrepreneur, academic and author. He held the chair for bioinformatics and information mining at Konstanz University, and is an honorary professor at Óbuda University.[1] He is also the co-founder of KNIME, and is serving as a president and CEO of KNIME AG since 2017.[2]

Berthold has authored over 250 publications while focusing his research on usage of machine learning methods for the interactive analysis of large information repositories. He is the editor and co-author of textbooks, including, Guide To Intelligent Data Science, and Intelligent Data Analysis.[3]

Berthold is a Fellow of the Institute of Electrical and Electronics Engineers (IEEE), the past president of the North American Fuzzy Information Processing Society,[4] and past president of the IEEE Systems, Man, and Cybernetics Society. He is an associate editor of Data Mining and Knowledge Discovery (DMKD),[5] Knowledge and Information Systems (KIS), Journal of Cheminformatics (JCIS),[6] and International Journal of Computational Intelligence in Bioinformatics and Systems Biology (IJCIBSB). He has been involved in the organization of various conferences, most notably the IDA-series of symposia on Intelligent Data Analysis.[7]

Early life and education

[edit]

Berthold was born in 1966 in Stuttgart, Germany. He received his MSc degree in computer science in 1992, and his Dr.rer.nat. degree in 1997, both from Karlsruhe University.[8]

He is a great-grandson of Prof. Gottfried Berthold [de], professor for botany at Göttingen University from 1887 until 1923.

Career

[edit]

Berthold started his academic career as a visiting researcher at Carnegie Mellon University in 1991. He then held appointments as a visiting researcher at the University of Sydney in 1994, and as a researcher at the University of Karlsruhe in 1993. From 1997 till 2000, he was a BISC Research Fellow and lecturer at the University of California, Berkeley. From 2003 until 2024, he was a full professor, and chair for bioinformatics and information mining at Konstanz University, Germany.[8] In 2017 he took a leave of absence to become full-time CEO at KNIME AG, Zurich, Switzerland.[9]

At IEEE, Berthold served as a president of the IEEE System, Man, and Cybernetics Society from 2010 till 2011.[10]

Research

[edit]

Berthold has focused his research on large, and heterogeneous data sources, with particular focus on methods from AI (rule learning, neural networks, fuzzy logic and general machine learning).[11]

Fuzzy models

[edit]

Berthold has published on methods to extract fuzzy models from data based on constructive methods to build probabilistic neural networks.[12] He developed similar algorithms for the extraction of fuzzy rule models.[13] He then extended those models beyond classification and invented algorithms to extract regression models, so-called fuzzy graphs from data automatically.[13]

Bisociative knowledge discovery

[edit]

At Konstanz University, Berthold initiated a European project (EU FP7 BISON) that focused on bisociative methods to create insights from diverse data sources. The consortium created output summarized in the resulting edited volume Bisociative Knowledge Discovery.[14]

Widening of machine learning algorithms

[edit]

Berthold was the first to introduce the idea of widened machine learning which draws on parallel resources to improve model accuracy rather than the usual focus on speed-up. He discussed a number of generic ways of tuning data mining algorithms while providing a series of experiments.[15][16] Later on, he conducted an in-depth analysis of the concept of Widened Data Mining, which aims at reducing the impact of heuristics by exploring more than just one suitable solution at each step.[17] In 2017, Berthold and his team proposed the bucket selector, a model-independent randomized selection strategy, with the capability to perform better than existing selection strategies in cases without a diversity measure.[18]

Data science design patterns

[edit]

In 2023, Berthold and his co-authors introduced the notion of visual design patterns for data science.[19] The presented methods are using graph patterns, lending themselves naturally to the data flow paradigm underlying most data science tools.

Awards and honors

[edit]
  • 2001 – KS Fu Award, North American Fuzzy Information Processing Society
  • 2010 – Fellow, The Institute of Electrical and Electronics Engineers (IEEE)[10]
  • 2011 – honorary professor, Óbuda University, Budapest

Bibliography

[edit]

Selected books

[edit]
  • Advances in Intelligent Data Analysis – Reasoning about Data (1997) ISBN 9783540408130
  • Intelligent Data Analysis 2nd Edition (2007) 9783540430605
  • Bisociative Knowledge Discovery (2012) ISBN 9783642318306
  • Guide To Intelligent Data Science (2020) ISBN 9783030455736

Selected articles

[edit]
  • Berthold, M., & Diamond, J. (1994). Boosting the performance of rbf networks with dynamic decay adjustment. Advances in neural information processing systems, 7.
  • Berthold, M. R., & Huber, K. P. (1998). Missing values and learning of fuzzy rules. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(02), 171–178.
  • Berthold, M. R., & Huber, K. P. (1999). Constructing fuzzy graphs from examples. Intelligent Data Analysis, 3(1), 37–53.
  • Eliceiri, K. W., Berthold, M. R., Goldberg, I. G., Ibáñez, L., Manjunath, B. S., Martone, M. E., ... & Carpenter, A. E. (2012). Biological imaging software tools. Nature methods, 9(7), 697–710.
  • Berthold, M. R., Fillbrunn, A., & Siebes, A. (2021). Widening: using parallel resources to improve model quality. Data Mining and Knowledge Discovery, 35(4), 1258–1286.
  • Berthold, M.R., Brookhart, D., Gerber, S., Hayasaka, S., Widmann, M. (2023). Towards Data Science Design Patterns. Advances in Intelligent Data Analysis XXI. IDA 2023. Lecture Notes in Computer Science, vol 13876. Springer.

References

[edit]
  1. ^ "Chair for Bioinformatics and Information Mining".
  2. ^ "Interview: Michael Berthold, President and Founder of KNIME, on Data Mining, Startups, and Visual Workflow".
  3. ^ "Books by Michael Berthold".
  4. ^ "NAFIPS".
  5. ^ "Data Mining and Knowledge Discovery".
  6. ^ "Journal of Cheminformatics".
  7. ^ "Michael R. Berthold – the dblp computer science bibliography".
  8. ^ a b "Prof. Dr. Michael Berthold – Universität Konstanz".
  9. ^ "KNIME Team".
  10. ^ a b "Michael R. Berthold – IEEE Xplore".
  11. ^ "Michael R Berthold – ResearchGate".
  12. ^ "Boosting the Performance of RBF Networks with Dynamic Decay Adjustment" (PDF).
  13. ^ a b "MISSING VALUES AND LEARNING OF FUZZY RULE" (PDF).
  14. ^ "Constructing fuzzy graphs from examples" (PDF).
  15. ^ "Widening: using parallel resources to improve model quality".
  16. ^ "Parallel Data Mining Revisited. Better, Not Faster".
  17. ^ "Diversity-Driven Widening".
  18. ^ "Bucket Selection: A Model-Independent Diverse Selection Strategy for Widening".
  19. ^ "Towards Data Science Design Patterns".