Duplicate customers, inconsistent dates, broken postcodes, mixed formats — I clean, deduplicate, standardise and validate spreadsheets, CRM exports and entire databases using Python (Pandas) and SQL. GDPR-compliant masking and anonymisation available, applied with the same approach used to secure 600+ enterprise tables.
What I Offer
The classic rescue job: one or more Excel/CSV files full of duplicates, inconsistent names, mixed date formats and stray characters — returned clean, consistent and import-ready.
Quality rules, constraint repair and cleansing run directly inside your database — SQL Server, PostgreSQL or MySQL — so bad data stops at the source instead of being patched downstream.
Share data with vendors, analysts or test environments without exposing personal information — dynamic masking, pseudonymisation and fully anonymised dataset generation, GDPR-aligned.
Raw data turned into model-ready datasets: outlier handling, missing-value strategy, encoding and normalisation — delivered as a documented, repeatable Python pipeline, not a one-off file.
Process
Email a small sample of your data (or just describe it). I’ll review the issues and reply with a fixed quote and turnaround — usually same day.
Cleaning runs through scripted, repeatable Python/SQL steps — never manual find-and-replace — so every change is logged and reversible.
You get the cleaned dataset, a summary of what changed, and (on request) the script itself so you can re-run it whenever new data arrives.
Why Me
Secured and masked 600+ tables of enterprise data with Microsoft Purview, dynamic masking and role-based access — your data is handled the same way.
Automated validation once flagged 20,000+ bogus records across 345 organisations’ submissions — scripted checks find what manual review can’t.
Every job is a script, not a manual edit. Re-run it next month on fresh data, or have me schedule it as an automated pipeline.
Most spreadsheet jobs are returned within 24–48 hours at a price agreed upfront from your sample — no hourly surprises.
Questions