Menu
Data Concepts– CREATE DATABASE dbName; GO

Data Concepts– CREATE DATABASE dbName; GO

There are many concepts you must be aware, comfortable, and competent with to manage data efficiently. This section covers many data concepts that will not only help you pass the Data Engineering on Microsoft Azure exam, but also help you do the job in the real world. Keep in mind that when discussing relational structure […]

Data Management– CREATE DATABASE dbName; GO

Data Management– CREATE DATABASE dbName; GO

Don’t confuse data management with database management, where the focus is on the mechanics of the DBMS. When you choose to run your database on the Azure platform and select a PaaS product, then the management of that database is no longer your or your company’s responsibility. Instead, the focus here is the management of […]

Table Categories – CREATE DATABASE dbName; GO

Table Categories – CREATE DATABASE dbName; GO

You might be wondering which distribution model you should use. The answer has to do with the table category to which the table you are creating belongs; see Table 2.3. TABLE 2.3 Table category distribution matrix Category Distribution model Staging ROUND_ROBIN Fact HASH Dimension (small table) REPLICATED Dimension (large table) HASH STAGING TABLE A staging […]

Unstructured– CREATE DATABASE dbName; GO

Unstructured– CREATE DATABASE dbName; GO

This kind of data is typically media files like audio, video, or images. There is no available interface for developers to use to query the contents of media files. There are some advancements happening in the Azure Cognitive Services area, where some artificial intelligence (AI) algorithms are able the identify visual or sound patterns. Those […]

Distributed Tables – CREATE DATABASE dbName; GO

Distributed Tables – CREATE DATABASE dbName; GO

Introduced in Chapter 1, “Gaining the Azure Data Engineer Associate Certification,” there are two options when it comes to Azure Synapse Analytics SQL pools: serverless and dedicated. When you run in serverless, there is no need to be overly concerned about database table structures because you are querying data that already exists in other sources. […]

Symmetric Multiprocessing (SMP)– CREATE DATABASE dbName; GO

Symmetric Multiprocessing (SMP)– CREATE DATABASE dbName; GO

You will find the MMP design in Azure Synapse Analytics and Symmetric Multiprocessing (SMP) design in Azure SQL Database. MMP processors (i.e., CPUs) are allocated with dedicated compute resources like memory, while SMP shares those compute resources. Consider the fact that the retrieval of datastored in memory is less latent than when stored on disk. […]

Create an Azure Cosmos DB– CREATE DATABASE dbName; GO

Create an Azure Cosmos DB– CREATE DATABASE dbName; GO

FIGURE 2.6 Azure Cosmos DB APIs FIGURE 2.7 Azure Cosmos Data Explorer FIGURE 2.8 Azure Cosmos Data Explorer SQL query The first query returns the scenario from all the files in that container. The second query returns the first reading for a specific scenario.

Semi‐Structured – CREATE DATABASE dbName; GO

Semi‐Structured – CREATE DATABASE dbName; GO

Two problems that have come up regarding relational databases over the years have to do with the complexity of SQL queries and the difficulty of representing objects. The simple structured database you created in the previous exercise contains six tables. Creating a query to join three, four, or even five of those tables together would […]