180,000 Products Available Now

The Most Comprehensive Beauty Products Dataset

180,000 INCI products with ingredients, research-backed irritancy and comedogenicity ratings, and function tags. License the raw dataset today as SQLite, CSV or JSONL. Use it however your product, formulator, or model needs.

Data updated as at 25 June 2026.

1000-product sample pack · CSV + JSON + Image · No payment required

One product from the datasetjsonl
{
  "id": 31,
  "brand": "The Ordinary",
  "name": "Niacinamide 10% + Zinc 1%",
  "image_name": "31.jpeg",
  "category": "skincare",
  "origin": "Canada",
  "contains_fragrance": false,
  "contains_drying_alcohol": false,
  "contains_parabens": false,
  "contains_sulfates": false,
  "contains_silicones": false,
180,000
Products indexed
24,000
Brands covered
0–5
Safety ratings
irritancy · comedogenicity
~18GB
Image archive
ID-matched, white background

The dataset

With All The Key Facts About Beauty Products

Aggregated from publicly available product information, then cleaned, normalized, and mapped to standard INCI definitions (Personal Care Products Council), so you can ship with it on day one.

180,000 Products

A massive catalog of global brands and formulations, ready for your use.

  • Brand & Product mapping
  • Category & country of origin
  • Ingredients list
  • Fragrance, alcohol, paraben, sulfate & silicone flags

Ingredients Intelligence

Every ingredient is parsed into a structured object for programmatic analysis and logic.

  • INCI names & common aliases
  • Functional classifications
  • Expert ingredient ratings
  • Concentration where publicly disclosed

Safety

Dermatological safety scores provided at the ingredient level for advanced filtering.

  • Comedogenic ratings (0–5)
  • Irritancy scores (0–5)

18GB Image Archive

High-resolution product photography mapped to every database entry via unique IDs.

  • ID-matched filenames
  • Standardized format with white background
  • Optimized for AI training
  • Ready for e-commerce UIs

Coverage & quality

Is your brand covered?

Check the data coverage on the page for your selected tier.

See the tiers & pricing →

Trusted by data scientists across the world

Hugging FaceHugging Face
KaggleDatarade

Who it’s for

This Dataset is Built For You

From recommendation models to formulation experts, teams use the same clean reference to skip months of data wrangling.

E-commerce & Retail

Enrich your storefront and search results using our beauty products data. Boost conversion and lower return rates with expert ingredient tooltips and safety badges.

What you'd build:

  • Ingredient-rich product pages to generate free organic leads
  • Meaningful product page with ingredient lists to increase conversions
  • Comedogenicity & irritancy filtering for sensitive skin
  • Trust signals & clean-beauty safety badges

Beauty-Tech Founders

Stop wasting months gathering data. Launch your routine-builder or skin-checker app on day one.

What you'd build:

  • Ingredient conflict logic based on comprehensive INCI ingredients list
  • Personalized routine builders using comedogenic & irritancy ratings
  • Skin-type matching engines
  • Filter-by-ingredient systems for 180,000 products

Market Research & R&D

Benchmark global formulations. Track ingredient trends across 180,000 beauty products and identify market gaps with our beauty products CSV dataset.

What you'd build:

  • Global competitor product benchmarking
  • Ingredient & formulation trends in the beauty products JSONL dataset
  • AI & Machine Learning training to decode packaging-to-ingredient correlations
  • High-accuracy computer vision models for automated retail audits
We were sizing up a 4-month web-scraping sprint to build out our skincare routine-builder app. Buying this dataset allowed us to map the products on day one. The INCI normalization alone saved our engineering team weeks of data wrangling. The image archive was perfectly standardized on a pure white background, meaning we didn’t have to hire editors. A massive win.
Rohan, Founder · Skincare Specialist AppRohanFounder · Skincare Specialist App

Integration

How It Works

License, download, and drop the data into the stack you already use. No vendor SDK required.

01

License the dataset

Pick the format, SQLite, CSV or JSONL, that fits your use case. Full data schema ship in the same bundle.

What you download
├── products.csv
├── ingredients.csv
├── product_ingredients.csv
├── products.jsonl
├── beauty.db
├── schema.md
└── images_{1-7}.zip
02

Integrated in minutes

Import the JSONL or CSV directly into your database, vector store, or Python environment. The schema is flat and clean, ready for analysis on day one.

Query it like any table
-- Find fragrance-free products with a key active
SELECT name, brand, image_name
FROM   products
WHERE  contains_fragrance = false
  AND  ingredients @> '[{"rating": "direct actives"}]'
ORDER  BY brand ASC
LIMIT  5;
03

Build what your users see

Render the same record as live product UI with safety badges, ingredient ratings, conflict checks. Once licensed, this layer is yours.

The Ordinary

Niacinamide 10% + Zinc 1%

alcohol-freefragrance-free
Niacinamide10%direct actives
Zinc PCA1%supporting actives
irritancy0/5
comedogenicity0/5

Example UI rendered from one dataset record. After licensing, this layer is yours to design.

Licensing

Choose Your Tier

One-time license for 180,000 products, yours forever.

Data refreshes at $0.0025/data point · Dataset licensees lock in early supporter API pricing

Products & Ingredients

$600one-time

Comprehensive product data with ingredient functionality and comedogenicity.

  • 180,000 products, including brand, name, category, origin & safety flags
  • Normalized ingredient dictionary, including irritancy, comedogenicity, ratings, functions
  • JSONL, SQLite and CSV formats
  • Full schema documentation
Most Popular

The Complete Archive

$950one-time

Comprehensive product data with ingredient functionality and comedogenicity. With the full image archive.

  • Everything in Products & Ingredients, plus:
  • 180,000 product images (~18GB)

Questions

Frequently Asked Questions

The essentials below. See the full FAQ for coverage, licensing, and the API.

What is included in the dataset?

The data comes in three tiers, all built on the same 180,000 products with brand names, product category, country of origin, and safety flags (fragrance, drying alcohol, parabens, sulfates, silicones). Basic Product Index adds a flat, label-order ingredient list per product. Products & Ingredients adds a normalized ingredient dictionary (irritancy, comedogenicity, ratings, functions) with a product↔ingredient link table. The Complete Archive is everything in Products & Ingredients plus the full product-image archive (~18GB, 180,000 images).

How is the data licensed?

Every tier includes an indefinite commercial license to the snapshot you purchase. You may integrate the data into your own app, run queries, and display the product images and information to your end-users. However, you may not redistribute, resell, or publicly host the raw files (CSV/JSONL/SQLite/images) for third parties to download. Future updates are sold separately (refer to the next question).

How often is the dataset updated, and how do I stay current?

We run continuous background audits and refresh the master dataset ad hoc, at least once a month (most recent: 25 June 2026). The dataset you buy is a static snapshot at the time of purchase. You have two ways to stay current: (1) Differential refreshes, where you buy only the data points that changed since your last purchase at $0.0025 per updated data point. (2) The hosted API for real-time freshness.

Will there be a live API?

Yes. A hosted REST API for real-time lookups is launching Fall 2026. It will support search by name, filtering by ingredient, batch requests, and a partner feedback loop to flag products for inclusion. Final pricing will be announced closer to launch; by licensing the full dataset today, you lock in early supporter-only rates and priority access.

Get started

The Definitive Skincare & Beauty Products Dataset

A massive, normalized database of 180,000 skincare and makeup products for market researchers, e-commerce brands, and developers. Access high-quality metadata in CSV and JSONL formats with INCI ingredients and 18GB of high-res product images.