Michael Zeller

Michael Zeller

San Diego, California, United States
8K followers 500+ connections

About

What inspires me is innovative technology that has the potential to change our world and…

Articles by Michael

See all articles

Activity

Join now to see all activity

Experience

  • Temasek Graphic

    Temasek

    Singapore

  • -

    Singapore

  • -

    Singapore, Singapore

  • -

    San Francisco Bay Area

  • -

    Greater San Diego Area

  • -

    Greater San Diego Area

  • -

Education

Licenses & Certifications

Publications

  • Data Privacy in a Globally Competitive Reality

    TDWI Upside

    Protecting consumer privacy is key to securely providing the huge data sets required for innovations in AI analytics.

    See publication
  • The ABCs of Data Science Algorithms

    InformationWeek

    Data science algorithms are never a one-size-fits-all solution. Do you know what makes sense for your business?

    See publication
  • Offene Plattformen als Erfolgsfaktoren für Künstliche Intelligenz

    Künstliche Intelligenz: Mit Algorithmen zum wirtschaftlichen Erfolg. Editors: Buxmann, Peter, Schmidt, Holger. Springer

    Other authors
    See publication
  • Q&A with Michael Zeller

    ODBMS.org Operational Database Management Systems

  • Künstliche Intelligenz braucht schlaue menschliche Köpfe

    PingIT - Impulse für das digitale Unternehmen

  • Representing Model Ensembles in PMML

    useR! 2014 Conference

    The R pmml package is now able to export PMML for ensemble models via the ada and randomForest functions. In this presentation, we describe all the steps necessary to export random forest and stochastic boosting models from R into PMML and show how the PMML standard is capable of representing not only model ensembles but also any R specified treatments for missing and invalid values as well as outliers. Additional functions available to the data scientist through the R pmml package include the…

    The R pmml package is now able to export PMML for ensemble models via the ada and randomForest functions. In this presentation, we describe all the steps necessary to export random forest and stochastic boosting models from R into PMML and show how the PMML standard is capable of representing not only model ensembles but also any R specified treatments for missing and invalid values as well as outliers. Additional functions available to the data scientist through the R pmml package include the ability to perform data pre- and post-processing.

    Other authors
    See publication
  • The pmmlTransformations Package

    useR! 2014 Conference

    The R pmmlTransformations package implements many of the commonly used transformation operators used by data scientists, among them the Z-transform, linear transformation, data discretization, data normalization and value mapping. The result is not only the transformed data itself but also information to represent the transformation operators in PMML format.

    Other authors
    See publication
  • Extending the Naive Bayes Model Element in PMML: Adding Support for Continuous Input Variables

    ACM

    The Predictive Model Markup Language (PMML) is the de facto standard to represent data mining and predictive analytic models. With PMML, one can easily share a predictive solution among PMML-compliant applications and systems.

    PMML as a standard has evolved significantly over the years. PMML 4.1, the language’s latest version represents a major leap forward in terms of its ability to represent data post-processing and multiple models. It also provides entirely new model elements for…

    The Predictive Model Markup Language (PMML) is the de facto standard to represent data mining and predictive analytic models. With PMML, one can easily share a predictive solution among PMML-compliant applications and systems.

    PMML as a standard has evolved significantly over the years. PMML 4.1, the language’s latest version represents a major leap forward in terms of its ability to represent data post-processing and multiple models. It also provides entirely new model elements for supporting Scorecards and K-Nearest Neighbors. The same is no exception for PMML 4.2, currently being worked on by the Data Mining Group (DMG), the body responsible for maintaining and advancing the PMML standard. PMML 4.2 is bound to offer new elements and increased capabilities. This article describes one of such improvement. In particular, it proposes extending the existing model element for Naïve Bayes Classifiers to support continuous input fields.

    The R Project is a popular choice for data miners to analyze and build predictive models. Naïve Bayes is just one of a myriad of model types supported by R. The R e1071 package provides a
    naiveBayes function to build Naïve Bayes Models using categorical as well as continuous fields. The R pmml package has been recently extended to allow for the export of PMML code for objects built with the naiveBayes function. For now, it includes a PMML Extension element for continuous fields, but with the release of PMML 4.2, the support will be standardized. This article describes this process in view of our proposal to extend the current model element for Naïve Bayes Models.

    Other authors
    See publication
  • The R pmmlTransformations Package

    ACM

    As the de facto standard for data mining models, the Predictive Model Markup Language (PMML) provides tremendous benefits for business, IT, and the data mining industry in general. Due to the cross-platform and vendor-independent nature of such an open-standard, it allows for predictive models to be easily moved between applications.

    Although PMML has offered support for common data transformations for quite some time now, the release of PMML 4.0 in 2009 brought the support for data…

    As the de facto standard for data mining models, the Predictive Model Markup Language (PMML) provides tremendous benefits for business, IT, and the data mining industry in general. Due to the cross-platform and vendor-independent nature of such an open-standard, it allows for predictive models to be easily moved between applications.

    Although PMML has offered support for common data transformations for quite some time now, the release of PMML 4.0 in 2009 brought the support for data pre-processing steps to a new level. As a consequence, several data mining tools and model building platforms have been adding more and more support for data pre-processing into the PMML code they export. It is no
    surprise then that the same is true for R.

    R has become a popular statistical platform for all things analytics. The R Project allows for a myriad of specialized packages to be installed and utilized by its users as needed. These include packages and functions for predictive analytics and model building. A package for exporting PMML out of several model types is also available. Called the pmml package, it allows for a few data pre-processing steps to be exported together with the modeling technique itself. However, a package to enable data transformations in a generic way was still missing.

    This paper describes a package which intends to close this gap. The pmmlTransformations package provides R users with functions that greatly enhance the available data mining capabilities and PMML support by allowing transformations to be performed on the data before it is used for modeling. The pmmlTransformations package works in tandem with the pmml package so that data pre-processing can be represented together with the model in the resulting PMML code.

    Other authors
    See publication
  • Massively parallel in-database predictions using PMML

    Proceedings of the 2011 ACM SIGKDD Workshop on Predictive Model Markup Language

    Like all open standards, the Predictive Model Markup Language (PMML) enables interoperability and portability in the world of data mining and predictive analytics. This means that models developed in any environment and tool set can be deployed and used in a completely different system. Such a level of flexibility creates new opportunities for addressing exceedingly demanding business agility and performance requirements.

    One of these requirements is the urgent need to apply the power of…

    Like all open standards, the Predictive Model Markup Language (PMML) enables interoperability and portability in the world of data mining and predictive analytics. This means that models developed in any environment and tool set can be deployed and used in a completely different system. Such a level of flexibility creates new opportunities for addressing exceedingly demanding business agility and performance requirements.

    One of these requirements is the urgent need to apply the power of predictive analytics to derive reliable predictions and, hence, business decisions from vast amounts of data collected by many organizations. In this paper, we discuss how PMML enables embedding advanced predictive models directly into the database or the data warehouse, along side the actual data to be scored. More importantly, we show how we can easily take advantage of highly parallel database architectures to efficiently derive predictions from very large volumes of data.

    Other authors
    See publication
  • The PMML Path Towards True Interoperability in Data Mining

    Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

    As the de facto standard for data mining models, the Predictive Model Markup Language (PMML) provides tremendous benefits for business, IT, and the data mining industry in general, since it allows for predictive models to be easily moved between applications. Due to the cross-platform and vendor-independent nature of such an open-standard, auto-generated PMML code is often represented in different versions of PMML. A tool may export PMML 2.1 and another import PMML 4.0. This problem raises the…

    As the de facto standard for data mining models, the Predictive Model Markup Language (PMML) provides tremendous benefits for business, IT, and the data mining industry in general, since it allows for predictive models to be easily moved between applications. Due to the cross-platform and vendor-independent nature of such an open-standard, auto-generated PMML code is often represented in different versions of PMML. A tool may export PMML 2.1 and another import PMML 4.0. This problem raises the issue of conversion. For true interoperability, PMML needs to be easily converted from one version to another.
    In this paper, we describe the capabilities associated with the “PMML Converter”. This application represents a great step in the PMML path towards true interoperability in data mining. Besides converting older versions of PMML to its latest, the PMML converter checks PMML files for syntax issues and, if issues are encountered, automatically corrects them.
    This paper also describes the capabilities associated with an interactive PMML-based application, the “Transformations Generator.” Auto-generated PMML code can omit important data pre-processing steps which are an integral part of a predictive solution. The Transformations Generator aims to bridge this gap by providing a graphical interface for the development and expression of data pre-processing steps in PMML.

    Other authors
    See publication
  • Efficient deployment of predictive analytics through open standards and cloud computing

    ACM SIGKDD Explorations Newsletter: Volume 11, Issue 1

    Over the past decade, we have seen tremendous interest in the application of data mining and statistical algorithms, first in research and science and, more recently, across various industries. This has translated into the development of a myriad of solutions by the data mining community that today impact scientific and business applications alike. However, even in this scenario, interoperability and open standards still lack broader adoption among data miners and modelers.

    In this…

    Over the past decade, we have seen tremendous interest in the application of data mining and statistical algorithms, first in research and science and, more recently, across various industries. This has translated into the development of a myriad of solutions by the data mining community that today impact scientific and business applications alike. However, even in this scenario, interoperability and open standards still lack broader adoption among data miners and modelers.

    In this article we highlight the use of the Predictive Model Markup Language (PMML) standard, which allows for models to be easily exchanged between analytic applications. With a focus on interoperability and PMML, we also discuss here emerging trends in cloud computing and Software as a Service, which have already started to play a critical role in promoting a more effective implementation and widespread application of predictive models.

    As an illustration of how the benefits of open standards and cloud computing can be combined, we describe a predictive analytics scoring engine platform that leverages these elements to deliver an efficient deployment process for statistical models.

    Other authors
    See publication
  • Open standards and cloud computing: KDD-2009 panel report

    Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

    At KDD-2009 in Paris, a panel on open standards and cloud computing addressed emerging trends for data mining applications in science and industry. This report summarizes the answers from a distinguished group of thought leaders representing key software vendors in the data mining industry.

    Supporting open standards and the Predictive Model Markup Language (PMML) in particular, the panel members discuss topics regarding the adoption of prevailing standards, benefits of interoperability…

    At KDD-2009 in Paris, a panel on open standards and cloud computing addressed emerging trends for data mining applications in science and industry. This report summarizes the answers from a distinguished group of thought leaders representing key software vendors in the data mining industry.

    Supporting open standards and the Predictive Model Markup Language (PMML) in particular, the panel members discuss topics regarding the adoption of prevailing standards, benefits of interoperability for business users, and the practical application of predictive models. We conclude with an assessment of emerging technology trends and the impact that cloud computing will have on applications as well as licensing models for the predictive analytics industry.

    Other authors
    See publication
  • Speech/gesture interface to a visual-computing environment

    IEEE Computer Graphics and Applications. Volume: 20 , Issue: 2

  • Vision-based motion planning for a robot arm using topology representing networks

    Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Vol 2, 1900-1905

    Other authors
    See publication
  • Topologieerhaltende Neuronale Netzwerke in der Robotik: Adaptive visuo motorische Kontrolle und intelligente Pfadplanung.

    Pro Universitate Verlag, Sinzheim

  • A visual computing environment for very large scale biomolecular modeling

    Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

  • Learning the perceptual control manifold for sensor-based robot path planning

    Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. Towards New Computational Principles for Robotics and Automation

    Other authors
    See publication
  • Motion planning of a pneumatic robot using a neural network

    IEEE Control Systems Magazine. Volume: 17 , Issue: 3

    Other authors
    See publication
  • Vision-based motion planning of a pneumatic robot using a topology representing neural network

    Applications of Neural Adaptive Control Technology, 181-204. J. Kalkkuhl, K. Hunt, R. Zbikowski, A. Dzielinski (Editors). World Scientific Series in Robotics and Intelligent Systems, Vol. 17. World Scientific Publishing

    Other authors
    See publication
  • Vision-based motion planning of a pneumatic robot using a topology representing neural network

    Proceedings of the 1996 IEEE International Symposium on Intelligent Control

    Other authors
    See publication
  • Speech/gesture interface to a visual computing environment for molecular biologists

    Proceedings of the 13th International Conference on Pattern Recognition (ICPR). Vol. 3, 964-968. IEEE Computer Society Press

  • Topology representing maps and brain function

    Nova Acta Leopoldina NF 72, Nr. 294, 133-157

    Other authors
    • Klaus Schulten
    See publication
  • Topology representing network for sensor-based robot motion planning

    Proceedings of the 1996 World Congress on Neural Networks (WCNN) ) San Diego, 100-103. INNS Press

    Other authors
    See publication
  • Neural dynamics modeled by one-dimensional circle maps

    Chaos, Solitons & Fractals, Vol.5, No.6

    Other authors
    • Michael Bauer
    • Werner Martienssen
    See publication
  • Biological visuo-motor control of a pneumatic robot arm.

    Intelligent Engineering Systems Through Artificial Neural Networks Vol. 5, 645-650. ASME Press

    Other authors
    • Ken Wallace
    • Klaus Schulten
    See publication

Honors & Awards

  • ACM SIGKDD Service Award

    -

    The ACM SIGKDD Service Award is the highest recognition of service awarded in the field. The award honors an individual or group of collaborators for outstanding contributions to professional KDD societies or society-at-large through applications of knowledge discovery and data mining.

  • Gartner names Zementis a "Cool Vendor in Data Science, 2014"

    Gartner

    Zementis, a software company specializing in predictive analytics solutions for Big Data, announced that Gartner has included the company in its listing of “Cool Vendors in Data Science, 2014.” We are honored that Gartner has recognized Zementis as a ‘Cool Vendor’ as we believe this affirms our mission to empower clients through standards-based products which lower the cost and complexity of data science.

  • Zementis named as one of the "Top 20 most promising Big Data companies 2013"

    CIO Review

    Selected by a distinguished panel comprising of CEOs, CIOs, VCs, industry analysts and the editorial board of CIO Review, Zementis has been named by CIO Review as one of the "Top 20 Most Promising Big Data Companies in 2013."
    http://www.predictive-analytics.info/2013/10/cio-review-zementis-selected-as-one-of.html

Languages

  • English

    Native or bilingual proficiency

  • German

    Native or bilingual proficiency

More activity by Michael

View Michael’s full profile

  • See who you know in common
  • Get introduced
  • Contact Michael directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Michael Zeller in United States

Add new skills with these courses