The Most Comprehensive Beauty Products Dataset
180,000 INCI products with ingredients, research-backed irritancy and comedogenicity ratings, and function tags. License the raw dataset today as SQLite, CSV or JSONL. Use it however your product, formulator, or model needs.
Data updated as at 25 June 2026.
1000-product sample pack · CSV + JSON + Image · No payment required
{
"id": 31,
"brand": "The Ordinary",
"name": "Niacinamide 10% + Zinc 1%",
"image_name": "31.jpeg",
"category": "skincare",
"origin": "Canada",
"contains_fragrance": false,
"contains_drying_alcohol": false,
"contains_parabens": false,
"contains_sulfates": false,
"contains_silicones": false,The dataset
With All The Key Facts About Beauty Products
Aggregated from publicly available product information, then cleaned, normalized, and mapped to standard INCI definitions (Personal Care Products Council), so you can ship with it on day one.
180,000 Products
A massive catalog of global brands and formulations, ready for your use.
- Brand & Product mapping
- Category & country of origin
- Ingredients list
- Fragrance, alcohol, paraben, sulfate & silicone flags
Ingredients Intelligence
Every ingredient is parsed into a structured object for programmatic analysis and logic.
- INCI names & common aliases
- Functional classifications
- Expert ingredient ratings
- Concentration where publicly disclosed
Safety
Dermatological safety scores provided at the ingredient level for advanced filtering.
- Comedogenic ratings (0–5)
- Irritancy scores (0–5)
18GB Image Archive
High-resolution product photography mapped to every database entry via unique IDs.
- ID-matched filenames
- Standardized format with white background
- Optimized for AI training
- Ready for e-commerce UIs
Coverage & quality
Is your brand covered?
Check the data coverage on the page for your selected tier.
See the tiers & pricing →Trusted by data scientists across the world
Hugging Face

Who it’s for
This Dataset is Built For You
From recommendation models to formulation experts, teams use the same clean reference to skip months of data wrangling.
E-commerce & Retail
Enrich your storefront and search results using our beauty products data. Boost conversion and lower return rates with expert ingredient tooltips and safety badges.
What you'd build:
- Ingredient-rich product pages to generate free organic leads
- Meaningful product page with ingredient lists to increase conversions
- Comedogenicity & irritancy filtering for sensitive skin
- Trust signals & clean-beauty safety badges
Beauty-Tech Founders
Stop wasting months gathering data. Launch your routine-builder or skin-checker app on day one.
What you'd build:
- Ingredient conflict logic based on comprehensive INCI ingredients list
- Personalized routine builders using comedogenic & irritancy ratings
- Skin-type matching engines
- Filter-by-ingredient systems for 180,000 products
Market Research & R&D
Benchmark global formulations. Track ingredient trends across 180,000 beauty products and identify market gaps with our beauty products CSV dataset.
What you'd build:
- Global competitor product benchmarking
- Ingredient & formulation trends in the beauty products JSONL dataset
- AI & Machine Learning training to decode packaging-to-ingredient correlations
- High-accuracy computer vision models for automated retail audits
We were sizing up a 4-month web-scraping sprint to build out our skincare routine-builder app. Buying this dataset allowed us to map the products on day one. The INCI normalization alone saved our engineering team weeks of data wrangling. The image archive was perfectly standardized on a pure white background, meaning we didn’t have to hire editors. A massive win.
RohanFounder · Skincare Specialist AppIntegration
How It Works
License, download, and drop the data into the stack you already use. No vendor SDK required.
License the dataset
Pick the format, SQLite, CSV or JSONL, that fits your use case. Full data schema ship in the same bundle.
├── products.csv
├── ingredients.csv
├── product_ingredients.csv
├── products.jsonl
├── beauty.db
├── schema.md
└── images_{1-7}.zipIntegrated in minutes
Import the JSONL or CSV directly into your database, vector store, or Python environment. The schema is flat and clean, ready for analysis on day one.
-- Find fragrance-free products with a key active
SELECT name, brand, image_name
FROM products
WHERE contains_fragrance = false
AND ingredients @> '[{"rating": "direct actives"}]'
ORDER BY brand ASC
LIMIT 5;Build what your users see
Render the same record as live product UI with safety badges, ingredient ratings, conflict checks. Once licensed, this layer is yours.
The Ordinary
Niacinamide 10% + Zinc 1%
Example UI rendered from one dataset record. After licensing, this layer is yours to design.
Licensing
Choose Your Tier
One-time license for 180,000 products, yours forever.
Data refreshes at $0.0025/data point · Dataset licensees lock in early supporter API pricing
Products & Ingredients
Comprehensive product data with ingredient functionality and comedogenicity.
- 180,000 products, including brand, name, category, origin & safety flags
- Normalized ingredient dictionary, including irritancy, comedogenicity, ratings, functions
- JSONL, SQLite and CSV formats
- Full schema documentation
Questions
Frequently Asked Questions
The essentials below. See the full FAQ for coverage, licensing, and the API.
What is included in the dataset?
The data comes in three tiers, all built on the same 180,000 products with brand names, product category, country of origin, and safety flags (fragrance, drying alcohol, parabens, sulfates, silicones). Basic Product Index adds a flat, label-order ingredient list per product. Products & Ingredients adds a normalized ingredient dictionary (irritancy, comedogenicity, ratings, functions) with a product↔ingredient link table. The Complete Archive is everything in Products & Ingredients plus the full product-image archive (~18GB, 180,000 images).
How is the data licensed?
Every tier includes an indefinite commercial license to the snapshot you purchase. You may integrate the data into your own app, run queries, and display the product images and information to your end-users. However, you may not redistribute, resell, or publicly host the raw files (CSV/JSONL/SQLite/images) for third parties to download. Future updates are sold separately (refer to the next question).
How often is the dataset updated, and how do I stay current?
We run continuous background audits and refresh the master dataset ad hoc, at least once a month (most recent: 25 June 2026). The dataset you buy is a static snapshot at the time of purchase. You have two ways to stay current: (1) Differential refreshes, where you buy only the data points that changed since your last purchase at $0.0025 per updated data point. (2) The hosted API for real-time freshness.
Will there be a live API?
Yes. A hosted REST API for real-time lookups is launching Fall 2026. It will support search by name, filtering by ingredient, batch requests, and a partner feedback loop to flag products for inclusion. Final pricing will be announced closer to launch; by licensing the full dataset today, you lock in early supporter-only rates and priority access.
Get started
The Definitive Skincare & Beauty Products Dataset
A massive, normalized database of 180,000 skincare and makeup products for market researchers, e-commerce brands, and developers. Access high-quality metadata in CSV and JSONL formats with INCI ingredients and 18GB of high-res product images.