Dell PowerEdge R640 55TB Data Warehouse Fast Track Reference Architecture for - Page 5

Microsoft SQL Server 2017: Data warehousing with, improved column-store technology

Page 5 highlights

Microsoft SQL Server 2017: Data warehousing with improved column-store technology 1 Microsoft SQL Server 2017: Data warehousing with improved column-store technology Microsoft SQL Server 2017 has made significant improvements in data warehousing technologies and performance, including column-store features as well as many other improvements. Column-store indices offer great advantages over traditional row stores for analytics and data warehousing queries. They are ideally suited for the star schemas, and tables with billions of rows which are commonly seen. Among their advantages for analytics are: Up to 10X compression in data size: Data warehouses are very large by nature and the compression offered by column-store index technologies offers both space and cost savings as well as significantly increased performance. These benefits are possible due to the dramatically reduced I/O requirements given by the compression and coupled by the ability to only scan the specific columns required by each query. This compression also reduces the amount of memory required to hold a given number of rows from the source data warehouse. Additional indices: SQL Server 2017 adds the capability to add B-tree indices to column store-based tables, which enables efficient single-row lookup. In addition to these architectural features, Microsoft has further optimized the processing of queries in column-store indices in the following ways: Operator pushdown: Pushdown refers to moving both filter and aggregation query operations closer to the data, so that many of the filters and calculations can be done in the scan operators, dramatically reducing the volume of data that needs to be handled further on in query processing. Batch-mode processing: SQL Server 2017 includes enhancements in batch-mode processing that handles many rows at a time rather than serially doing calculations on each individual row. These batch operations are further optimized by leveraging Single Instruction Multiple Data (SIMD) vector processing CPU instructions in the Intel® architectures. 1.1 Dell EMC Data Warehouse Fast Track reference architectures for SQL Server 2017 Dell EMC and Microsoft have refreshed the DWFT reference architecture offerings with the latest technology advancements in database, server, and storage technology. Dell EMC PowerEdge 14G servers with Intel® Xeon® processors, Dell EMC PowerVault ME4024 arrays, and Microsoft SQL Server 2017 are the latest additions to the list of reference architecture components. The Dell EMC DWFT reference architectures for SQL Server 2017 are engineered jointly by Dell EMC and Microsoft. The hardware and software optimizations are tested by Dell EMC and the performance results are crosschecked by Microsoft. This approach presents a fast time to value using integrated, balanced, and verified architectures. 5 55TB Data Warehouse Fast Track Reference Architecture for Microsoft SQL Server 2017 using Dell EMC PowerEdge R640 and Dell EMC PowerVault ME4024 | 3918-RA-SQL

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22

Microsoft SQL Server 2017: Data warehousing with improved column-store technology
5
55TB Data Warehouse Fast Track Reference Architecture for Microsoft SQL Server 2017 using Dell EMC
PowerEdge R640 and Dell EMC PowerVault ME4024 | 3918-RA-SQL
1
Microsoft SQL Server 2017: Data warehousing with
improved column-store technology
Microsoft SQL Server 2017 has made significant improvements in data warehousing technologies and
performance, including column-store features as well as many other improvements.
Column-store indices offer great advantages over traditional row stores for analytics and data warehousing
queries. They are ideally suited for the star schemas, and tables with billions of rows which are commonly
seen. Among their advantages for analytics are:
Up to 10X compression in data size:
Data warehouses are very large by nature and the compression
offered by column-store index technologies offers both space and cost savings as well as significantly
increased performance. These benefits are possible due to the dramatically reduced I/O requirements given
by the compression and coupled by the ability to only scan the specific columns required by each query. This
compression also reduces the amount of memory required to hold a given number of rows from the source
data warehouse.
Additional indices:
SQL Server 2017 adds the capability to add B-tree indices to column store-based tables,
which enables efficient single-row lookup.
In addition to these architectural features, Microsoft has further optimized the processing of queries in
column-store indices in the following ways:
Operator pushdown:
Pushdown refers to moving both filter and aggregation query operations closer to the
data, so that many of the filters and calculations can be done in the scan operators, dramatically reducing the
volume of data that needs to be handled further on in query processing.
Batch-mode processing:
SQL Server 2017 includes enhancements in batch-mode processing that handles
many rows at a time rather than serially doing calculations on each individual row. These batch operations are
further optimized by leveraging Single Instruction Multiple Data (SIMD) vector processing CPU instructions in
the Intel
®
architectures.
1.1
Dell EMC Data Warehouse Fast Track reference architectures for
SQL Server 2017
Dell EMC and Microsoft have refreshed the DWFT reference architecture offerings with the latest technology
advancements in database, server, and storage technology. Dell EMC PowerEdge 14G servers with Intel
®
Xeon
®
processors, Dell EMC PowerVault ME4024 arrays, and Microsoft SQL Server 2017 are the latest
additions to the list of reference architecture components.
The Dell EMC DWFT reference architectures for SQL Server 2017 are engineered jointly by Dell EMC and
Microsoft. The hardware and software optimizations are tested by Dell EMC and the performance results are
crosschecked by Microsoft. This approach presents a fast time to value using integrated, balanced, and
verified architectures.