Questions

Frequently Asked Questions

Everything teams ask before licensing, grouped by topic. Still unsure? email us.

The dataset

What is included in the dataset?

The data comes in three tiers, all built on the same 180,000 products with brand names, product category, country of origin, and safety flags (fragrance, drying alcohol, parabens, sulfates, silicones). Basic Product Index adds a flat, label-order ingredient list per product. Products & Ingredients adds a normalized ingredient dictionary (irritancy, comedogenicity, ratings, functions) with a product↔ingredient link table. The Complete Archive is everything in Products & Ingredients plus the full product-image archive (~18GB, 180,000 images).

What formats do you deliver?

Every tier ships as industry-standard CSV, JSON line-delimited (JSONL), and a standalone SQLite database. Products & Ingredients and the Complete Archive use nested JSONL, relational CSVs, and SQLite with foreign keys and indexes, so they drop straight into Postgres, MongoDB, or data science environments like Pandas. A full data dictionary and schema guide are included with every tier.

How accurate is the ingredient analysis?

From the Products & Ingredients tier up, our normalized dictionary follows a research-backed schema mapping ingredients to their functional classes (e.g., humectants, exfoliants) and an expert safety rating, with deep chemical enrichment: CAS, EC, IUPAC, and Ph. Eur. names plus common synonyms. We use standardized INCI names and include common aliases to keep your search logic robust. Because we aggregate global public data, brand-name normalization is ongoing; we continuously merge aliases, but you may occasionally see variants. (The entry-level Basic Product Index ships a flat ingredient list without this analysis.)

How are the images mapped to the products?

Every product record contains a unique image_name field (e.g., 1042.jpeg). This ID matches the filename in the image archive, so you can link metadata to high-res imagery with a simple string-match. You host the ~18GB archive on your own infrastructure and may display the images directly to end users in your app. The image archive is included exclusively with the Complete Archive tier.

Coverage & quality

Is my brand covered?

We track more than 24,000 brands across 180,000 products. The fastest way to check is the live coverage tool on any product page, which confirms specific brands instantly.

What product categories are covered?

The catalogue is skincare-led. Approximate mix: • Skincare: 62% • Haircare: 12% • Suncare: 8% • Body & bath: 8% • Makeup: 7% • Other: 3%

How complete are the fields?

We are transparent about coverage rather than letting you discover it mid-evaluation. Weighted by ingredient frequency across the dataset: • Products with an ingredient list: ~100% • Ingredient functions: ~94% • CAS number: ~86% • EC number: ~75% • Comedogenicity rating: ~23% • Irritancy rating: ~23% • Concentration: <1% Concentration is intentionally sparse: we only capture it where a manufacturer publicly discloses it, and we never estimate or extrapolate. The free sample reflects the same coverage you get in the full set.

Licensing

How is the data licensed?

Every tier includes an indefinite commercial license to the snapshot you purchase. You may integrate the data into your own app, run queries, and display the product images and information to your end-users. However, you may not redistribute, resell, or publicly host the raw files (CSV/JSONL/SQLite/images) for third parties to download. Future updates are sold separately (refer to the next question).

Updates & the API

How often is the dataset updated, and how do I stay current?

We run continuous background audits and refresh the master dataset ad hoc, at least once a month (most recent: 25 June 2026). The dataset you buy is a static snapshot at the time of purchase. You have two ways to stay current: (1) Differential refreshes, where you buy only the data points that changed since your last purchase at $0.0025 per updated data point. (2) The hosted API for real-time freshness.

Will there be a live API?

Yes. A hosted REST API for real-time lookups is launching Fall 2026. It will support search by name, filtering by ingredient, batch requests, and a partner feedback loop to flag products for inclusion. Final pricing will be announced closer to launch; by licensing the full dataset today, you lock in early supporter-only rates and priority access.

How do API limits and overage work?

You can cap usage at your monthly limit, so any calls beyond it are declined as over-limit until your next cycle, or allow metered overage billed per call. Email alerts are supported, and batch requests are metered per item processed (a batch of 50 products counts as 50 calls). Final rate limits will be set at launch.

Buying & support

Can I see a sample before buying?

Absolutely. Each tier has its own 1000-product sample pack you can download instantly from this site. Every sample includes the full schema in CSV and JSONL formats so you can test your import scripts before committing to a tier.

What is your refund policy?

Due to the digital nature of the dataset and the immediate access provided upon purchase, all sales are final. Because the data cannot be 'returned' once it has been accessed, we are unable to offer refunds. We strongly recommend downloading the free sample pack for your chosen tier to verify the data quality and schema compatibility before completing your purchase.

Contact

Still need a hand?

Email Us

heythere@tinkerterror.com

Replies typically within 1 business day

For faster answers: mention your use case, the tier you’re considering, and your target stack.

What we can help with

Licensing & Pricing

Tier differences, invoices and payment, refresh pricing, or licensing terms for your use case.

Data & Schema

Field coverage, formats, INCI normalization, rating methodology, or whether the data fits your model.

Existing Licensees

Download issues, schema docs, version updates, or anything else after your purchase.

Hosted API

Want the live API?

A hosted REST API is launching Fall 2026. License the dataset now to lock in early supporter rates, and join the waitlist for early access.

No spam · One email when the API is ready

Ready to get the data?