Data lakes take on big data
A data lake is a scalable data storage repository that can ingest large amounts of raw data and make it available on-demand. Robert Sheldon explains the benefits and challenges of data lakes.
2022-02-04
A data lake is a scalable data storage repository that can ingest large amounts of raw data and make it available on-demand. Robert Sheldon explains the benefits and challenges of data lakes.
2022-02-04
In this article we look at how to create a scale-out SQL Server PolyBase solution on AWS
2021-12-08
Data sizes are always growing. Stats on world data are astounding, as are the stats many of us experience in our lives. Plenty of us have moved from MB management to GBs, and I see plenty of people dealing with TB storage at home. Most of that data is likely from images and video, but […]
2020-09-09
136 reads
Steve talks data virtualization in the age of growing data sets and larger workloads
2020-02-27
255 reads
Why has Microsoft added new capabilities in SQL Server to connect to other types of data sources? Read on to learn more.
2021-05-14 (first published: 2019-09-09)
9,695 reads
When it comes to data warehouse modernization, we’re big fans of moving to the cloud. ...
2019-04-17
The overall importance of data and information within organizations has continued to grow. We’ve also seen the continued rise of megatrends like IoT, big data – even too much...
2019-04-16
For many companies, the initial attraction to Azure Databricks is the platform’s ability to process big data in a fast, secure, and collaborative environment. However, another highly advantageous feature is the Databricks dashboard.
2019-03-30
This post describes how to generate big datasets with Hive in HDInsight, specifically TPC-DS benchmarking datasets. There are many tools for generating sample data, and this one is particularly nice due to its familiarity and ability to generate massive...
2019-03-30
Whether you are running an RDBMS, or a Big Data system, it is important to consider your data-partitioning strategy. As the volume of data grows, so it becomes increasingly important to match the way you partition your data to the way it is queried, to allow 'pruning' optimisation. When you have huge imports of data to consider, it can get complicated. Bartosz explains how to get things right; not perfect but wisely.
2016-11-22
3,345 reads
By Steve Jones
Thanks to everyone that came to my talks. Slides are below. Best Practices for...
By Steve Jones
I can’t remember how I heard about Small Data SF 2024, but it caught...
By Steve Jones
moledro – n. a feeling of resonant connection with an author or artist you’ll...
Comments posted to this topic are about the item New article
Comments posted to this topic are about the item Monitoring Azure Blob Storage
Azure Data Lake Storage Gen 2 is built on ...?
See possible answers