Data Cleaning Script

Research & Analysischatgpt-prompts-for-data-analysislanguagecolumn_definitionsrow_count

Requiring before/after logging and a summary report turns a cleaning script into an auditable process. Flagging rows for manual review prevents silent data loss.

Prompt

Write a {{language}} script to clean this dataset. Here is the structure:

Columns: {{column_definitions}}
Row count: approximately {{row_count}}
Known issues:
{{data_issues}}

For each issue, implement a cleaning step that:
1. Logs how many rows are affected before the fix
2. Applies the transformation
3. Validates the result

After cleaning, generate a summary report showing:
- Rows before/after
- Columns modified
- Values imputed or removed
- Any rows flagged for manual review

Use {{library}} for data manipulation. Add error handling so the script does not fail silently on unexpected values.

Variables to customize

Why this prompt works

Requiring before/after logging and a summary report turns a cleaning script into an auditable process. Flagging rows for manual review prevents silent data loss.

What you get when you save this prompt

Your workspace unlocks powerful tools to iterate and improve.

AI OPTIMIZE

AI Optimization

One-click improvement with structure analysis and pattern suggestions.

VERSION DIFF

Version History

Track every edit. Compare versions side-by-side with word-level diffs.

ORGANIZE

Development

Code Review

Testing

Marketing

Folders & Tags

Organize your library with nested folders, tags, and drag-and-drop.

MCP

$ npm i -g @promptingbox/mcp

Claude · Cursor · ChatGPT

Use Everywhere

Access prompts from Claude, Cursor, ChatGPT & more via MCP integration.

Your prompts, organized

Save, version, and access your best prompts across ChatGPT, Claude, Cursor, and more.