Big Data

Article

A data lake is a scalable data storage repository that can ingest large amounts of raw data and make it available on-demand. Robert Sheldon explains the benefits and challenges of data lakes.

2022-02-04

Create a SQL Server PolyBase Scale-out Group Using Amazon Web Services

by Additional Articles

MSSQLTips.com

Big Data

Article

In this article we look at how to create a scale-out SQL Server PolyBase solution on AWS

2021-12-08

Do You Have Big Data?

by Steve Jones

SQLServerCentral

Big Data

Editorial

Data sizes are always growing. Stats on world data are astounding, as are the stats many of us experience in our lives. Plenty of us have moved from MB management to GBs, and I see plenty of people dealing with TB storage at home. Most of that data is likely from images and video, but […]

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2020-09-09

136 reads

Discuss

Data Orchestration

by Steve Jones

SQLServerCentral

Editorial

Steve talks data virtualization in the age of growing data sets and larger workloads

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2020-02-27

255 reads

Discuss

SQL Server Integrates Hadoop and Spark out-of-the box: The Why?

by Frank A. Banin

SQLServerCentral

Article

Why has Microsoft added new capabilities in SQL Server to connect to other types of data sources? Read on to learn more.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

4.75 (8)

You rated this post out of 5. Change rating

2021-05-14 (first published: 2019-09-09)

9,695 reads

Discuss

Reduce costs by adding a data lake to your cloud data warehouse

by administrator

SQLServerCentral

Big Data

DatabaseWeekly

When it comes to data warehouse modernization, we’re big fans of moving to the cloud. ...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-04-17

Top big data analytics trends hold true well into 2019

by administrator

SQLServerCentral

Big Data

DatabaseWeekly

The overall importance of data and information within organizations has continued to grow. We’ve also seen the continued rise of megatrends like IoT, big data – even too much...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-04-16

Databricks Dashboards: Data Exploration With Salary Classification

by DaveConvery

SQLServerCentral

Big Data

DatabaseWeekly

For many companies, the initial attraction to Azure Databricks is the platform’s ability to process big data in a fast, secure, and collaborative environment. However, another highly advantageous feature is the Databricks dashboard.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-03-30

Generate Big Datasets with Hive in HDInsight

by DaveConvery

SQLServerCentral

Big Data

DatabaseWeekly

This post describes how to generate big datasets with Hive in HDInsight, specifically TPC-DS benchmarking datasets. There are many tools for generating sample data, and this one is particularly nice due to its familiarity and ability to generate massive...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-03-30

Why Would I Ever Need to Partition My Big ‘Raw’ Data?

by Additional Articles

SimpleTalk

Big Data

Article

Whether you are running an RDBMS, or a Big Data system, it is important to consider your data-partitioning strategy. As the volume of data grows, so it becomes increasingly important to match the way you partition your data to the way it is queried, to allow 'pruning' optimisation. When you have huge imports of data to consider, it can get complicated. Bartosz explains how to get things right; not perfect but wisely.

2016-11-22

3,345 reads

Big Data

Data lakes take on big data

Create a SQL Server PolyBase Scale-out Group Using Amazon Web Services

Do You Have Big Data?

Data Orchestration

SQL Server Integrates Hadoop and Spark out-of-the box: The Why?

Reduce costs by adding a data lake to your cloud data warehouse

Top big data analytics trends hold true well into 2019

Databricks Dashboards: Data Exploration With Salary Classification

Generate Big Datasets with Hive in HDInsight

Why Would I Ever Need to Partition My Big ‘Raw’ Data?

Blogs

SQL Saturday Boston 2024 Slides

Small Data SF 2024

A New Word: Moledro

Forums

test new topic

New article

Monitoring Azure Blob Storage

Question of the Day

Azure Data Lake Storage Gen 2