Declarative machine learning systems

  title={Declarative machine learning systems},
  author={Piero Molino and Christopher R'e},
  journal={Communications of the ACM},
  pages={42 - 49}
The future of machine learning will depend on it being in the hands of the rest of us. 

Figures from this paper

Landing AI on Networks: An equipment vendor viewpoint on Autonomous Driving Networks

  • Dario RossiLiang Zhang
  • Computer Science
    IEEE Transactions on Network and Service Management
  • 2022
Challenges and opportunities of Autonomous Driving Network (ADN) driven by AI technologies are discussed, and a system view is presented, clarifying how AI can be successfully landed in the network architecture.

Responsible Data Integration: Next-generation Challenges

This tutorial presents a tutorial on data integration and responsibility, highlighting the existing efforts in responsible data integration along with research opportunities and challenges and encourages the community to audit data integration tasks with responsibility measures and develop integration techniques that optimize the requirements of responsible data science.

You Do Not Need a Bigger Boat: Recommendations at Reasonable Scale in a (Mostly) Serverless and Open Stack

This work proposes a template data stack for machine learning at “reasonable scale”, and details how modern open source can provide a pipeline processing terabytes of data with limited infrastructure work.

A Complete Bibliography of Publications in Communications of the ACM : 2020–2029

A* [11]. Above [53]. abuse [120]. accelerators [157]. access [120]. accessibility [133]. achieve [21]. ACM [103, 74, 96, 99]. Across [45, 84]. adapting [96]. Adding [64]. address [151]. adoption

A Taxonomy of Prompt Modifiers for Text-To-Image Generation

Text-to-image generation has seen an explosion of interest since 2021. Today, beautiful and intriguing digital images and artworks can be synthesized from textual inputs (“prompts”) with deep

Prompt Engineering for Text-Based Generative Art

Text-based generative art has seen an explosion of interest in 2021. Online communities around text-based generative art as a novel digital medium have quickly emerged. This short paper identifies



The hardware lottery

After decades of incentivizing the isolation of hardware, software, and algorithm development, the catalysts for closer collaboration are changing the paradigm.

Hidden Technical Debt in Machine Learning Systems

It is found it is common to incur massive ongoing maintenance costs in real-world ML systems, and several ML-specific risk factors to account for in system design are explored.

Real-World Learning with Markov Logic Networks

Application to a real-world university domain shows the Markov logic networks approach to be accurate, efficient, and less labor-intensive than traditional ones.

MLlib: Machine Learning in Apache Spark

MLlib is presented, Spark's open-source distributed machine learning library that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.

DeepDive: Declarative Knowledge Base Construction

DeepDive is described, a system that combines database and machine learning ideas to help develop KBC systems, a long-standing problem in industry and research that encompasses problems of data extraction, cleaning, and integration.

Neural Architecture Search: A Survey

An overview of existing work in this field of research is provided and neural architecture search methods are categorized according to three dimensions: search space, search strategy, and performance estimation strategy.

Overton: A Data System for Monitoring and Improving Machine-Learned Products

Overton automates the life cycle of model construction, deployment, and monitoring by providing a set of novel high-level, declarative abstractions that shift developers to these higher-level tasks instead of lower-level machine learning tasks.


DeepDive is described, a system that combines database and machine learning ideas to help to develop KBC systems, to frame traditional extract-transform-load (ETL) style data management problems as a single large statistical inference task that is declaratively defined by the user.

SQLFlow: A Bridge between SQL and Machine Learning

Industrial AI systems are mostly end-to-end machine learning (ML) workflows. A typical recommendation or business intelligence system includes many online micro-services and offline jobs. We describe

Ludwig: a type-based declarative deep learning toolbox

Ludwig is a flexible, extensible and easy to use toolbox which allows users to train deep learning models and use them for obtaining predictions without writing code, and introduces a general modularized deep learning architecture called Encoder-Combiner-Decoder that can be instantiated to perform a vast amount of machine learning tasks.