One thing rarely discussed with the rise of big data is how to do *efficient* qu...

ekianjo · on Aug 26, 2018

exactly. Optimising queries becomes really a critical part of the job when you make complex JOINS on millions of records. Just getting the data can take a huge amount of time before you can even consider models.

tixocloud · on Aug 28, 2018

I’ve been told by IT from numerous organisations that Hadoop will solve all of our team’s query inefficiencies.

Also hence why we introduce new members of the team to learn how to do efficient queries and joins. And spend time upfront to structure their problems.

bioquestion · on Aug 26, 2018

I work as a biostatistician and I've been tasked recently with querying large databases using SQL, in addition to analysing the data. However, my programming background is very limited and thus I'm sure my queries are very inefficient.

Could you point me to some materials/texts about how to improve querying efficiency for SQL? If it's oriented for beginners then that would be ideal.

Thanks in advance.

collyw · on Aug 27, 2018

This site is great.

https://use-the-index-luke.com/

bioquestion · on Aug 29, 2018

Thank you! I've already started reading it and it seems to be just what I needed.