HardikDG's Avatar

HardikDG

@hardikdg.bsky.social

Always learning new technologies and trends | Exploring and sharing Data domain learning and expr.

5 Followers  |  17 Following  |  291 Posts  |  Joined: 31.03.2025  |  1.4099

Latest posts by hardikdg.bsky.social on Bluesky

Along with that, it provides jobs, a catalog, and transformation management under one roof.

09.08.2025 11:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Using AWS for the Data pipelines and ETL and needing more granular access controls?
AWS Lake Formation is the key.

AWS Lake Formation offers fine-grained access control for your lakehouse. Control access by IAM role, column, or row. One lake, multiple secure users.

09.08.2025 11:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

No matter how advanced AI becomes, even the best LLM can’t lift for you.

Strength isn’t just physical, it’s mental resilience too.

Train both. don't just train systems

09.08.2025 10:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Don’t just automate work. Automate recovery.

Sleep routines
Movement habits
Tech-free time

Code your life like a system.

08.08.2025 10:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Important rule for metrics trust:
Your business KPIs deserve tested logic.

Use tests like not_null, unique, and accepted_values.

One unnoticed error can break the KPIs and trust. Prevention is better than cure

07.08.2025 14:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Learning for analytics tables:

- Denormalize where it counts.
- For wide reporting tables, fewer joins = faster dashboards.
- Use materialized views for complex joins to save BI tools from doing heavy lifting.

06.08.2025 09:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Building the future? You need future-proof energy.
Less sugar, more sunlight.
Less screen time, more steps.
It's not complex.

06.08.2025 06:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for query speed:

Query only what you need.
Avoid loading full tables into memory.

A dashboard query scanning 1TB was reduced to 80GB with proper partitioning + column pruning.

Less is faster. Always.

05.08.2025 14:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well-being is part of the strategy of your every product.

Don’t be the last to realize.

05.08.2025 06:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for data ingestion pipelines:
Validate, then transform.

Add row count checks, data type assertions, and null audits before every major step.
It’s cheaper to fail fast than fix late.

04.08.2025 14:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

No one ever scaled a company on bad sleep, skipped meals and unhealthy routines.

Ambition is great. But sustainability is smarter.

03.08.2025 12:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Active communication makes dashboards and data useful; silos only make chaos when something changes

02.08.2025 14:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for reliable dashboards:
Document, version, communicate actively.

Renaming a column without notice broke 6 dashboards overnight.

Use schema registries or contracts and never hotfix in production.

02.08.2025 14:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Seek excellence, not perfection.

01.08.2025 13:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for time zones:

Store in UTC on the server side. Convert only for display on the front side.
This will save you from lots of trouble and issues afterwards

One campaign ended 10 hours early because someone used NOW() in the wrong timezone.

In DBT or SQL, normalize time at the source.

01.08.2025 05:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You’re monitoring the server and the product's health.

But what about yours?
Fitness, heart rate, sleep quality, and stress levels.
Track what matters. It is what you are going to have lifelong

01.08.2025 05:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

- AWS DMS (Data Migration Service): Once it is ready, use DMS to migrate or replicate the data on the AWS DB services, based on your use-case.

You can choose either to migrate the data or if needed you can replicate the data on the AWS for some duration before you can shut down your old DB

31.07.2025 14:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Planning to get the benefit of the cloud and AI on the AWS cloud from other DB providers?

These are your best friends in the journey.

- Schema conversion tool: It helps in the verification and conversion of schema and data from other DB providers to the schema supported by AWS (Aurora, RDS, etc..)

31.07.2025 14:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Best practices for WHERE filters:

Filtering on computed columns disables indexes.
Bad: WHERE YEAR(order_date) = 2023

Good: WHERE order_date BETWEEN '2023-01-01' AND '2023-12-31'

Write a code that makes the process faster and provides less waiting time to the end user.

31.07.2025 05:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We fear AI will outthink us.

But the truth is, we're already over-fatiguing ourselves.
Rest isn't weakness, it's your upgrade cycle.

Rest, mediate, excercise and upgrade yourself

31.07.2025 05:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Your brain isn’t a black-box model like a big LLM.

You can understand it.

Meditate. Journal. Listen. Debug your thoughts and get better

30.07.2025 04:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I wish I had learned this earlier in data engineering:

β€œData quality > data quantity.”

Don’t just move data. Validate it. Profile it. Monitor it at every stage of data life cycle
Otherwise, you're shipping junk faster.

29.07.2025 14:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Prompt engineering won't fix poor posture.

Your desk setup, movements, and fitness are part of your workflow.

Fix it manually.

29.07.2025 06:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for NULLs:

NULLs don’t behave like empty strings.
In SQL, col != '' ignores NULLs.
In Spark, use .na.drop() or .fill() to stay explicit.

Treat NULLs as first-class citizens β€” or they’ll sneak into your reports.

29.07.2025 04:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Golden rule for SQL

Scan less, query faster.
Use queries for normal transactional DBs.

Use partitioning in Hive, Iceberg, or BigQuery to reduce data scanned.

One job went from 20 minutes to 2 with proper partition_column BETWEEN filters.

28.07.2025 04:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Systems can stay awake for hours without breaks.

However, why do you sit in the same place?

Movement gives mental clarity, fitness, and multiple benefits.

Stand. Stretch. Walk. Often.

28.07.2025 04:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Your body is your first and most important operating system.

Don’t run it on low quality fuel, zero sleep, and 100 tabs open.

27.07.2025 07:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A 1% daily improvement in you is transformational.
Compound interest isn't just for money and investments.

26.07.2025 04:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If your product's majority of work can be done in sheets/Excel or Notion pages kind of thing, would you go for a separate one for the same for the better UI?

This is one of the common issues where products don't get enough paid users or growth. People don't see the bigger picture in this

25.07.2025 13:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

AI and systems are designed to handle multiple tasks simultaneously, while your brain and body is made to do a single task with full focus.

Don't let multitasking slow down your progress. Do one task at a time in the best possible way, and you will be way faster than multitasking.

25.07.2025 05:31 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@hardikdg is following 17 prominent accounts