IDML is a data preparation language designed to process unstructured data at high volumes.


  • Accessible - Designed for Analysts, Product Managers, Tech Writers, Support, Sales Engineers and Software Engineers alike
  • Concise - Free yourself from error-prone boilerplate code like null pointer checks and parse exception handlers
  • Extensible - Out of the box it has support for things like regular expressions, email and geolocation but it’s easy to add new modules
  • High-performance - Built for firehoses
  • Java-based - Integrates with the emerging big data stack, including Hadoop MapReduce and Kafka


IDML is a JVM-first data transform language


Transformations are always pure, and will return the same output every time


All actions can fail safely to ensure as much of your data was mapped as possible