CV Evaluation — Junior Data Engineering Role

Candidate: Tommaso Crippa Entry-level / Graduate

1. Overall Structure & Layout — 7/10

Clean single-column layout with logical ordering (Education → Experience → Projects → Skills). The header is minimal but missing a targeted summary for DE roles, meaning recruiters must infer fit from bullet points alone.

Feedback:

Suggested header summary:

“HPC graduate with hands-on experience building data pipelines and ML-backed analytics tools. Proficient in Python, SQL, Kafka, and distributed computing. Seeking a Junior Data Engineering role to apply my skills in building scalable data infrastructure and ETL processes.”


2. Impact of Achievements (Quantified Results) — 4/10

Almost every bullet describes what was done, not what was achieved. No throughput figures, dataset sizes, accuracy metrics, latency improvements, or user numbers appear anywhere. DE hiring managers look for scale and impact.

Feedback:

Before:

“Processing high-resolution multivariate time series metrics with heuristics and ML approaches to detect anomalies.”

After:

“Built an anomaly detection pipeline ingesting 80+ multivariate HPC metrics at sub-minute resolution; reduced false-positive rate by ~30% vs. threshold-only baseline on 3 real-world workloads.”


3. ATS Keyword Optimisation — 5/10

Strong HPC and ML keywords are present, but core Data Engineering terminology is sparse. A typical JDE job posting scans for: ETL/ELT, data pipeline, data warehouse, dbt, Airflow, orchestration, batch vs streaming, schema design, SQL, BigQuery/Snowflake/Redshift, Docker, Kubernetes, CI/CD. Several are entirely absent.

Feedback:

WhatsApp Wrapped bullet rewrite for ATS:

“Designed and deployed an end-to-end ETL pipeline in Python (pandas) that parses, transforms, and visualises WhatsApp chat data; served via a Flask REST API on Render with a JS/HTML frontend — 500+ public users.”


4. Skills Section Relevance — 6/10

Good honest proficiency labelling, and DVC/SLURM show pipeline and infra awareness. However, the section is HPC-first rather than DE-first, and key DE staples (SQL, cloud platforms, orchestration tools) are missing.

Feedback:

Suggested skills restructure:

Data Engineering : Python (Advanced), SQL (Intermediate), Spark (Academic), Kafka (Academic), dbt (Familiar)
Infrastructure   : Git (Advanced), Docker (Familiar), SLURM (Advanced), DVC (Advanced)
Cloud            : GCP / BigQuery (Familiar)   ← or whichever applies

Summary

  Detail
Strengths Strong academic pedigree (EUMaster4HPC, cum laude); real deployed project (WhatsApp Wrapped); HPC + ML pipeline experience; honest proficiency labels; GitHub links on every project
Weaknesses No quantified impact anywhere; SQL completely missing; no DE-targeted summary/headline; ETL/pipeline language absent; ATS-hostile icon characters
Priority fixes Add SQL + cloud platform to skills; quantify at least 3 bullets; write a DE-targeted header summary; reframe project bullets with ETL/pipeline language; fix icon/symbol characters

Top 3 Highest-Impact Changes

1. Add SQL and reframe projects with DE vocabulary

SQL is screened for in virtually every JDE job description. Its complete absence will cause ATS rejection before a human reads the CV. Adding it to skills and rewriting 2–3 project bullets to use “pipeline”, “ETL”, and “data transformation” will dramatically increase both ATS pass-rate and recruiter relevance signals.

2. Quantify at least 3 achievement bullets

Entry-level candidates who include numbers (dataset size, accuracy, users, speedup) stand out significantly from peers who only describe tasks. Even approximate figures (“~50k rows”, “3 production clusters”, “5 events, 300+ attendees”) shift bullets from responsibility statements to impact statements — which is what drives recruiter callbacks.

3. Add a 2-line DE-targeted profile summary

Most recruiters spend under 10 seconds on an initial scan. A clear summary at the top (“Junior Data Engineer Python · Spark · Kafka · HPC pipelines”) immediately signals fit and primes the reader to interpret your HPC experience as DE-relevant rather than niche. Without it, the CV reads as an HPC/ML profile, not a DE one.