Posts

SQL Server Roadmap for Data Engineers (7 Days)

7-Day SQL Roadmap for Data Engineers Dataset Used: dbo.orders dbo.returns dbo.people Day 1 - SQL Fundamentals Topics SELECT WHERE ORDER BY DISTINCT TOP GROUP BY HAVING Aggregate Functions Practice Questions Total Sales Total Profit Total Orders Orders by Region Sales by Region Profit by Category Top 10 Products by Sales Customers with Sales > 5000 States with Negative Profit Average Sales by Category Day 2 - Joins & Data Modelling Topics INNER JOIN LEFT JOIN RIGHT JOIN FULL JOIN SELF JOIN Practice Questions Find Returned Orders Find Non-Returned Orders Return Percentage by Region Sales by Regional Manager Most Returned Products Regions without Returns Orders with Manager Information Sales by Person Profit by Person Customer Sales with Return Flag Day 3 - CTEs ...

Azure Data Engineering Roadmap

Azure Data Engineering Roadmap Azure Data Engineering Roadmap (8 Weeks) Week 1: Azure + ADF Foundations Goal: Confidently explain Azure resource hierarchy and build your first ADF pipelines. Day 1: Azure subscription, resource group, management group Day 2: Azure AD basics, tenant, users, groups, managed identity Day 3: Blob vs ADLS Gen2, containers, folders, security basics Day 4: ADF workspace, linked services, datasets, Integration Runtime Day 5: Simple copy pipeline Day 6: Copy activity settings Day 7: Mini project - copy different file types Week 2: Advanced ADF Copy & Loops Goal: Master copy patterns and looping over multiple files. Wildcards and recursive copy Column mapping and sink settings ForEach activity Dynamic paths u...

Prompts & Interview Preparations

 Prompt 1: Classifier Prompt (Decision Step) ------------------------------------------- You are a resume analyzer. INPUT: [RESUME_TEXT] TASK: Decide if the candidate is a Fresher or Experienced. RULES: - If experience is 0 years or no company name, label "Fresher". - if intern and company name, consider as "Fresher". - If experience is 1+ years or company/project work, label "Experienced". OUTPUT FORMAT (only this): Candidate_Type: Fresher OR Experienced Reason: (1 line) Prompt 2A: Fresher Interview Questions (Branch A) ------------------------------------------------ You are an interview trainer for freshers. INPUT: [RESUME_TEXT] TASK: Create 10 basic interview questions. RULES: - 4 HR questions - 6 basic technical questions - Simple and beginner friendly OUTPUT FORMAT: HR Questions: 1) ... 2) ... 3) ... 4) ... Technical Questions: 1) ... 2) ... ... ________________________________________ Prompt 2B: Experienced Interview Questions (Branch B) -----------...

Azure Data Engineering Project - End-to-End Azure Data Platform

Image
In my recent project, I built an end-to-end Azure data platform that ingested semi-structured JSON data from multiple APIs and delivered analytics-ready datasets modelled in a star schema within Azure Synapse . The solution was metadata-driven, secure, and well governed , and designed to scale as new data sources were onboarded. For ingestion, I used Azure Data Factory and followed a metadata-driven approach rather than hardcoding logic into pipelines. I maintained control and watermark tables that defined how each API source should be processed, including source name, API endpoint, authentication method, pagination logic, incremental load column, target schema and table, load type, and whether the data contained PII. The watermark tables stored the last successful load timestamp or record ID , pipeline run status, and load timings. At runtime, ADF pipelines read from these tables and updated the watermark after successful execution, enabling true incremental loading and...

Master Databricks Asset Bundles Through Hands-On Practice

15 min read | 100% Practical Guide Forget theory. Forget abstract examples. This is a hands-on, build-as-you-learn guide to mastering YAML through the lens of Databricks Asset Bundles (DABs) . By the end of this post, you'll go from never writing YAML to confidently deploying production-grade data pipelines as code. 🎯 What You'll Build: A complete Databricks workspace configuration including jobs, clusters, notebooks, and permissions—all defined in YAML and deployable with a single command. Level 0: YAML Basics BEGINNER The Golden Rules Rule #1: YAML uses spaces for indentation , never tabs. Standard is 2 spaces per level. Rule #2: YAML is case-sensitive . Name ≠ name Rule #3: Indentation = Structure . It defines parent-child relationships. ...

YAML Mastery for Data Engineers: The Configuration Language You Can't Ignore

YAML Mastery for Data Engineers: The Configuration Language You Can't Ignore YAML Mastery for Senior Data Engineers: The Configuration Language You Can't Ignore Published on January 21, 2026 | 8 min read As a senior data engineer, you've mastered SQL, conquered Python, and tamed distributed systems. But there's one skill that quietly determines whether your pipelines run smoothly or become maintenance nightmares: YAML proficiency . This humble configuration language is the backbone of modern data engineering workflows, and mastering it is non-negotiable. Why YAML Matters in Data Engineering YAML (YAML Ain't Markup Language) is a human-readable data serialization format that's become the de facto standard for configuration management. Unlike JSON or XML, YAML prioritizes readability while maintaining powerful data structuring capabilities. Where You'll Use ...

Terraform for Senior Data Engineers

Senior Data Engineer • DevOps for Data Terraform for Data Engineers: Why You Must Know It (and How You’ll Use It) Terraform is not “infra-only.” For modern data platforms (Azure / Databricks / Snowflake / Fabric / AWS), Terraform becomes the safest way to build, version, review, and reproduce environments across Dev → Test → Prod. Audience: Beginner → Advanced Outcome: Practical usage + interview-ready Includes: 10 most-used commands/scripts Includes: STAR interview Q&A Contents TL;DR What Terraform is (in one minute) Why it matters for Data Engineers Where Terraform is useful in a Data Engineer’s life Practical patterns you should follow 10 most used Terraform commands/scripts Interview questions + crisp STAR answers Quick checklist for “Terraform-ready” Data Engineers TL;DR Terraform = Infrastr...