Categorical Databases
Home |
Download |
Getting Started |
Manual |
Wiki |
Papers |
Screen Shots |
Github |
Google Group |
Conexus |
Contact
Read This First
The suggested introductory course of study includes an introductory
paper about CQL for computer scientists,
introductory
slides
and
video about CQL from a functional programming perspective,
introductory
slides
and
video about CQL from a mathematical perspective,
introductory
slides
about category theory from a knowledge management perspective,
a
paper
from
NIST arguing that category theory is critical for IT interoperability, and an
introductory
textbook
on category theory, as well as a data science
case study.
Papers
- Ologs: A Categorical Framework for Knowledge Representation (2011) [Pre-print]
| Describes how to use categories as database/ontology schemas.
- Functorial Data Migration (2013) [Pre-print]
| Describes how to migrate data between categorical databases.
- Database Queries and Constraints via Lifting Problems (2013) [Pre-print]
| Describes how certain common database queries and constraints can be encoded as topological lifts.
- Definition of CQL as a context-free grammar with equations (2015)
| Describes a canonical syntax associated with CQL, as well as an axiomatic semantics.
- Relational Foundations for Functorial Data Migration (2015) [Pre-print] [slides]
| Describes how to implement a fragment of CQL using SELECT/FROM/WHERE/UNION and a fresh-ID generator, and vice versa.
- QINL: Query-Integrated Languages (2015)
| Describes the relationship between CQL and comprehension/monad syntax.
- Algebraic Model Management: A Survey (2016) [Pre-print]
| Describes CQL entirely in terms of multi-sorted equational logic, and contrasts it with existing tools.
- Algebraic Databases (2017) [Pre-print]
| Describes CQL in detail, and in particular, user-defined functions.
- Algebraic Data Integration (2017) [Pre-print] [slides] [Aggregation supplement] [video]
| Describes CQL in detail, and in particular, how to implement it using automated theorem proving techniques and how to use it to integrate data.
- Informal Data Transformation Considered Harmful (2019)
| Describes how CQL can preserve the data quality required to power machine learning algorithms through various data management tasks.
- Fast Left Kan Extensions Using the Chase (2022)
| Describes CQL's sigma operation in detail, and in particular, how to implement it using a chase engine.
- Presenting Profunctors (2024)
| Motivated by problems in categorical database theory, we introduce and compare two notions of presentation for profunctors, uncurried and curried.
Case Studies
Presentations
- Boston Haskell (2011) [videos 1 2 3 4 5]
| Gives an introduction to the math behind CQL.
- Oracle (2014)
| Gives an introduction to the math behind CQL, tailored to database-centric audiences.
- Boston Haskell (2014) [video]
| Gives an introduction to the math behind CQL, tailored to functional-programming audiences.
- Lambda Conf (2017) [video]
| Gives an introduction to CQL for computer scientists.
- The Broad Institute (2017) [video]
| Gives an introduction to the math behind CQL, and lessons learned from applying it in practice.
- Dataversity Architecture Summit (2017)
| Gives an introduction to CQL tailored to data architects.
- Kensho (2019) [video]
| Gives an introduction CQL and the math behind it, and describes how it enables universal semantic IT inter-operability.
Related