top of page

better data,
better AI.

Search, quantify and edit data for LLMs

background.png
IMG_9645.png

trusted by

62c719b5b44be1961554a6de_edited_edited.p
Screenshot 2023-12-04 at 10_edited.png
2yfv_black_Screen_edited_edited.png
veryrough2_edited.png

Alignment Lab AI

Product

Clustering

IMG_9645.png

Semantic & keyword search

IMG_8785.jpeg

Edit & compare fields

IMG_4802.jpeg

PII, duplicates, language detection, or custom signal

IMG_9322.jpeg

Fuzzy-concept search with refinement

IMG_4849.png

Lilac Garden

Blazing fast dataset computations

Cluster and title 1 million data points in 20 mins

Embed your dataset at half a billion tokens per min

Accelerate your own data transformations

idyv4d98RT.png

Jonathan Talmi

Lead of Data Acquisition

“Lilac is an incredibly powerful tool for data exploration and quality control. We use Lilac daily to inspect and evaluate datasets, and then democratize them across the org. It is a critical part of our data quality evaluation pipeline.”

databricks-logo.png

Jonathan Frankle

Chief Neural Network Scientist

“Lilac provides a simple path to understanding the concepts in datasets and selecting the right data for a task.”

NousResearch

Teknium

Co-founder

“Everyone working with LLM Datasets should check out @lilac_ai data platform…Their clustering helped determine a lot of topics Hermes-2.5 covers today.”

Get started with Lilac in minutes...

Install

pip install lilac

Python

IMG_1172.png

User Interface

Screenshot 2023-12-05 at 5.42.47 PM.png
bottom of page