Back to AstraZeneca Data Engineer interview

Interview question

Optimize Spark job with data skew on a large join

Asked in AstraZeneca Data Engineer interviews

Asked by

59% candidates

Difficulty

Medium

Round

Technical Round

AI synthesized from candidate reports

Back to Questions

Question 1 of 5

Optimize Spark job with data skew on a large join

Medium59% Candidate ReportsTECHNICALAstraZenecaData Engineer

Question

Optimize Spark job with data skew on a large join

High-Level Approach

Start with object lifecycle, then GC phases, then metrics you watch (heap, GC pause, allocation rate).

Optimize Spark job with data skew on a large join —… | InterviewForge AI