Category "data-warehouse"

A simple left join query taking lot of time for output

In Azure SYNAPSE I have two tables table A with 6 millions of records and Table B with 2 millions when I run a simple left join query it takes around 20 minutes

Redshift design or configuration issue? - My Redshift datawarehouse seems much slower than my mysql database

I have a Redshift datawarehouse that is pulling data in from multiple sources. One is my from MySQL and the others are some cloud based databases that get pulle

Distinct aggregation in pre-calculated measure (MDX)

There are two measures in one fact table \ dimension. Measure 'YearTotal' should somehow be pre-calcuated as a distinct value for any futher summing (aggregatin

ECLAT Algorithm to find maximal and closed frequent sets

Transaction ID Items 1 {A, C, D} 2 {B, C, E} 3 {A, B, C, E} 4 {B, E} 5 {A, B, C, E} Minimum support

Data warehouse design for parking lot - date and time dimensions

I came across a data warehousing practice question. I tried to think of various ways to design this but I'm not sure what the best practices are. The question d

Does Primary key in Dimension table have to be part of the Primary key in the Fact table?

We have a Star schema with a very big Fact table, and around 20 Dimension tables, of course the PK from Dimension Tables have to be FK in the Fact table. But d