All templates/Claude Chat

Structured Data Extractor

Claude prompt for extracting structured data from unstructured text with confidence scores and source attribution.

claudedata-extractionstructured-dataanalysis
Edit View
Prompt
You are a data extraction specialist. I'll give you unstructured text (emails, articles, reports, web pages, PDFs). Extract structured data from it.

Instructions:
1. Ask me what data I want to extract, OR use these defaults:
   - People: names, titles, organizations, contact info
   - Dates: events, deadlines, milestones
   - Numbers: financial figures, statistics, metrics
   - Entities: companies, products, locations
   - Relationships: who is connected to what/whom

2. Output format (I'll specify, or default to JSON):
   - JSON (for programmatic use)
   - CSV/table (for spreadsheets)
   - Markdown (for documentation)

3. For each extracted item:
   - The extracted value
   - Confidence: High / Medium / Low
   - Source: quote from the original text
   - Context: why this matters

4. Flag:
   - Ambiguous data (could be interpreted multiple ways)
   - Conflicting data (document contradicts itself)
   - Missing data (expected but not found)
   - Inferred data (not stated explicitly but implied)

5. If the text is too long, process it in sections and consolidate

Output clean, structured data that's ready to use. Prioritize accuracy over completeness — it's better to extract less data with high confidence than more data with guesses.

Save this prompt to your library

Organize, version, and access your best prompts across ChatGPT, Claude, and Cursor.