Declarative machine learning systems

  title={Declarative machine learning systems},
  author={Piero Molino and Christopher R'e},
  journal={Communications of the ACM},
  pages={42 - 49}
The future of machine learning will depend on it being in the hands of the rest of us. 

Figures from this paper

Looper: An end-to-end ML platform for product decisions
Looper covers the end-to-end ML lifecycle from collecting training data and model training to deployment and inference, and extends support to personalization, causal evaluation with heterogenous treatment effects, and Bayesian tuning for product goals.
Landing AI on Networks: An equipment vendor viewpoint on Autonomous Driving Networks
  • Dario Rossi, Liang Zhang
  • Computer Science
    IEEE Transactions on Network and Service Management
  • 2022
Challenges and opportunities of Autonomous Driving Network (ADN) driven by AI technologies are discussed, and a system view is presented, clarifying how AI can be successfully landed in the network architecture.
Responsible Data Integration: Next-generation Challenges
This tutorial presents a tutorial on data integration and responsibility, highlighting the existing efforts in responsible data integration along with research opportunities and challenges and encourages the community to audit data integration tasks with responsibility measures and develop integration techniques that optimize the requirements of responsible data science.
You Do Not Need a Bigger Boat: Recommendations at Reasonable Scale in a (Mostly) Serverless and Open Stack
This work proposes a template data stack for machine learning at “reasonable scale”, and details how modern open source can provide a pipeline processing terabytes of data with limited infrastructure work.
A Complete Bibliography of Publications in Communications of the ACM : 2020–2029
A* [11]. Above [53]. abuse [120]. accelerators [157]. access [120]. accessibility [133]. achieve [21]. ACM [103, 74, 96, 99]. Across [45, 84]. adapting [96]. Adding [64]. address [151]. adoption
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
Text-to-image generation has seen an explosion of interest since 2021. Today, beautiful and intriguing digital images and artworks can be synthesized from textual inputs (“prompts”) with deep
Prompt Engineering for Text-Based Generative Art
Text-based generative art has seen an explosion of interest in 2021. Online communities around text-based generative art as a novel digital medium have quickly emerged. This short paper identifies


The hardware lottery
After decades of incentivizing the isolation of hardware, software, and algorithm development, the catalysts for closer collaboration are changing the paradigm.
Hidden Technical Debt in Machine Learning Systems
It is found it is common to incur massive ongoing maintenance costs in real-world ML systems, and several ML-specific risk factors to account for in system design are explored.
Real-World Learning with Markov Logic Networks
Application to a real-world university domain shows the Markov logic networks approach to be accurate, efficient, and less labor-intensive than traditional ones.
MLlib: Machine Learning in Apache Spark
MLlib is presented, Spark's open-source distributed machine learning library that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
DeepDive: Declarative Knowledge Base Construction
DeepDive is described, a system that combines database and machine learning ideas to help develop KBC systems, a long-standing problem in industry and research that encompasses problems of data extraction, cleaning, and integration.
Overton: A Data System for Monitoring and Improving Machine-Learned Products
Overton automates the life cycle of model construction, deployment, and monitoring by providing a set of novel high-level, declarative abstractions that shift developers to these higher-level tasks instead of lower-level machine learning tasks.
DeepDive is described, a system that combines database and machine learning ideas to help to develop KBC systems, to frame traditional extract-transform-load (ETL) style data management problems as a single large statistical inference task that is declaratively defined by the user.
SQLFlow: A Bridge between SQL and Machine Learning
Industrial AI systems are mostly end-to-end machine learning (ML) workflows. A typical recommendation or business intelligence system includes many online micro-services and offline jobs. We describe
Learning Probabilistic Relational Models
This paper describes both parameter estimation and structure learning -- the automatic induction of the dependency structure in a model and shows how the learning procedure can exploit standard database retrieval techniques for efficient learning from large datasets.
Ludwig: a type-based declarative deep learning toolbox
Ludwig is a flexible, extensible and easy to use toolbox which allows users to train deep learning models and use them for obtaining predictions without writing code, and introduces a general modularized deep learning architecture called Encoder-Combiner-Decoder that can be instantiated to perform a vast amount of machine learning tasks.