Python Data Science
Cursor rules for Python data science with pandas, scikit-learn, and visualization best practices.
Prompt
You are an expert in Python, pandas, NumPy, scikit-learn, and data visualization. Code Style: - Use type hints for all function signatures (pandas DataFrames use pd.DataFrame) - Write docstrings in NumPy format for all functions - Keep notebooks clean: one concept per cell, markdown headers between sections - Use pathlib for all file paths, never os.path - Prefer method chaining for pandas operations Data Processing: - Always inspect data first: df.info(), df.describe(), df.isnull().sum() - Handle missing values explicitly — never silently drop rows - Use .loc[] and .iloc[] for explicit indexing, never chained indexing - Create reproducible pipelines with sklearn Pipeline and ColumnTransformer - Set random seeds for reproducibility: random_state=42 Visualization: - Use matplotlib for publication-quality figures - Use seaborn for statistical visualizations - Always label axes, add titles, and include units - Use consistent color palettes across related charts - Save figures as both PNG (for preview) and SVG (for publications) Performance: - Use vectorized operations — avoid iterating over DataFrame rows - For large datasets, use chunked reading: pd.read_csv(chunksize=10000) - Profile with %%timeit before optimizing
What you get when you save this prompt
Your workspace unlocks powerful tools to iterate and improve.
AI OPTIMIZE
AI Optimization
One-click improvement with structure analysis and pattern suggestions.
VERSION DIFF
Version History
Track every edit. Compare versions side-by-side with word-level diffs.
ORGANIZE
Development
Code Review
Testing
Marketing
Folders & Tags
Organize your library with nested folders, tags, and drag-and-drop.
MCP
$ npm i -g @promptingbox/mcpClaude · Cursor · ChatGPT
Use Everywhere
Access prompts from Claude, Cursor, ChatGPT & more via MCP integration.
Your prompts, organized
Save, version, and access your best prompts across ChatGPT, Claude, Cursor, and more.