Suffyan Asad (@suffyan_asad) 's Twitter Profile
Suffyan Asad

@suffyan_asad

Data Engineer, passionate about big data and processing big data on the cloud.

ID: 372890207

linkhttps://www.linkedin.com/in/suffyan-asad-421711126 calendar_today13-09-2011 16:17:02

1,1K Tweet

73 Followers

236 Following

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

Learn the use of Window Functions for #TimeSeriesAnalysis in #PySpark with my #Medium article! 🚀 Dive into practical Python code examples and enhance your data analysis skills. 📊🐍 Read & share for insightful tips: medium.com/@suffyan.asad1… #DataScience #ApacheSpark #Python

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

Learn working with Apache #Spark UDAFs with ease! 🚀 Check out my #Medium article for an in-depth introduce, with Scala code examples. Aimed at #DataEngineers and #BigData enthusiasts. Read, learn, and share your thoughts! medium.com/@suffyan.asad1… #DataScience #ApacheSpark

Alex Xu (@alexxubyte) 's Twitter Profile Photo

How do SQL Joins Work? The diagram below shows how 4 types of SQL joins work in detail. 🔹 INNER JOIN Returns matching rows in both tables. 🔹 LEFT JOIN Returns all records from the left table, and the matching records from the right table. 🔹 RIGHT JOIN Returns all records

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🌟Read my Medium article “A Deeper Look into Spark User-Defined Functions.” Dive into UDFs with practical #Python examples for handling complex data types like arrays, maps, structs. Ideal for beginners & pros! 🔗 medium.com/@suffyan.asad1… #spark #BigData #Analytics #DataScience

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🌟 Announcing my new Medium article Spark Essentials: A Guide to Setting Up, Packaging, and Running PySpark Projects It covers the process of setting up PySpark jobs, packaging them and running them on clusters medium.com/@suffyan.asad1… #spark #Python #bigdata #DataScience

Alex Xu (@alexxubyte) 's Twitter Profile Photo

How can Cache Systems go wrong? The diagram below shows 4 typical cases where caches can go wrong and their solutions. 1. Thunder herd problem This happens when a large number of keys in the cache expire at the same time. Then the query requests directly hit the database, which

Alex Xu (@alexxubyte) 's Twitter Profile Photo

My recommended materials for cracking your next technical interview Coding - Leetcode - Cracking the coding interview book - Neetcode System Design Interview - System Design Interview Book 1, 2 by Alex Xu, Sahn Lam - Grokking the system design by Design Guru - Design

Bytebytego (@bytebytego) 's Twitter Profile Photo

CAP theorem: one of the most misunderstood terms The CAP theorem is one of the most famous terms in computer science, but I bet different developers have different understandings. Let’s examine what it is and why it can be confusing. CAP theorem states that a distributed

CAP theorem: one of the most misunderstood terms

The CAP theorem is one of the most famous terms in computer science, but I bet different developers have different understandings. Let’s examine what it is and why it can be confusing. 

CAP theorem states that a distributed
Alex Xu (@alexxubyte) 's Twitter Profile Photo

Almost every software engineer has used Git before, but only a handful know how it works. To begin with, it's essential to identify where our code is stored. The common assumption is that there are only two locations - one on a remote server like Github and the other on our

Almost every software engineer has used Git before, but only a handful know how it works.

To begin with, it's essential to identify where our code is stored. The common assumption is that there are only two locations - one on a remote server like Github and the other on our
Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

Just published an article on Medium covering setting up and running Spark projects with Scala and sbt. Forr beginners and experts alike. Check it out: medium.com/@suffyan.asad1… #Spark #Scala #Java #DataEngineering #BigData #DataScience

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🚀 Sharing my guide titled Beginner’s Guide to Spark UI: How to Monitor and Analyze Spark Jobs on Medium. This article covers the basics or Spark UI, and is for beginners. Read here👉🏻: medium.com/@suffyan.asad1… #SparkUI #ApacheSpark #BigData #DataAnalytics #Spark #Python #PySpark

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🚀 Want to explore #SQL window functions? This beginner-friendly guide covers static & sliding windows, and has code examples. Learn more: medium.com/@suffyan.asad1… #RDBMS #DataAnalytics #databases

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

Want to learn blazing-fast ClickHouse? 🚀 Explore its key features, MergeTree engines, distributed setups & Materialized Views. Includes references for further study! medium.com/@suffyan.asad1… #ClickHouse #DataAnalytics #DataWarehousing #SQL #bigdata

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

Want to learn #PySpark? 🚀 My article covers PySpark project setup, local testing, dependencies, packaging & submitting to a Spark cluster — for beginners! 💡 #DataEngineering #Spark #Python Read here👉🏻: medium.com/@suffyan.asad1…

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🚀 Want to boost Spark job performance & avoid double runs for computing data statistics? Learn how Observations (Spark 3.3.0+) helps compute stats in one pass—saving time! Read here👉🏻: medium.com/@suffyan.asad1… #Spark #Observations #BigData #DataEngineering #PySpark

Suffyan Asad (@suffyan_asad) 's Twitter Profile Photo

🚀 Do you want to learn the basics of processing Big Data on AWS—from EMR basics to cluster creation via AWS UI, CLI, & Python (boto3)—check out Part 1 of my guide! Read more here: medium.com/@suffyan.asad1… #ApacheSpark #AmazonEMR #DataEngineering #AWS