Data Engineering on Microsoft Azure (DP-203日本語版) (DP-203日本語) Free Practice Test

Question 1

Azure Stream Analytics のウィンドウ関数を実装しています。
要件ごとにどのウィンドウ関数を使用する必要がありますか?回答するには、回答エリアで適切なオプションを選択してください。
注: 正しい選択ごとに 1 ポイントの価値があります。

Correct Answer:

Question 2

統合パイプラインにバージョン管理された変更を実装する必要があります。ソリューションは、データ統合の要件を満たす必要があります。
どの順序でアクションを実行する必要がありますか？回答するには、すべてのアクションをアクションのリストから回答領域に移動し、正しい順序で配置します。

Correct Answer:

Explanation:

Scenario: Identify a process to ensure that changes to the ingestion and transformation activities can be version-controlled and developed independently by multiple data engineers.
Step 1: Create a repository and a main branch
You need a Git repository in Azure Pipelines, TFS, or GitHub with your app.
Step 2: Create a feature branch
Step 3: Create a pull request
Step 4: Merge changes
Merge feature branches into the main branch using pull requests.
Step 5: Publish changes
Reference:
https://docs.microsoft.com/en-us/azure/devops/pipelines/repos/pipeline-options-for-git
Topic 1, Contoso Case StudyTransactional Date
Contoso has three years of customer, transactional, operation, sourcing, and supplier data comprised of 10 billion records stored across multiple on-premises Microsoft SQL Server servers. The SQL server instances contain data from various operational systems. The data is loaded into the instances by using SQL server integration Services (SSIS) packages.
You estimate that combining all product sales transactions into a company-wide sales transactions dataset will result in a single table that contains 5 billion rows, with one row per transaction.
Most queries targeting the sales transactions data will be used to identify which products were sold in retail stores and which products were sold online during different time period. Sales transaction data that is older than three years will be removed monthly.
You plan to create a retail store table that will contain the address of each retail store. The table will be approximately 2 MB. Queries for retail store sales will include the retail store addresses.
You plan to create a promotional table that will contain a promotion ID. The promotion ID will be associated to a specific product. The product will be identified by a product ID. The table will be approximately 5 GB.
Streaming Twitter Data
The ecommerce department at Contoso develops and Azure logic app that captures trending Twitter feeds referencing the company's products and pushes the products to Azure Event Hubs.
Planned Changes
Contoso plans to implement the following changes:
* Load the sales transaction dataset to Azure Synapse Analytics.
* Integrate on-premises data stores with Azure Synapse Analytics by using SSIS packages.
* Use Azure Synapse Analytics to analyze Twitter feeds to assess customer sentiments about products.
Sales Transaction Dataset Requirements
Contoso identifies the following requirements for the sales transaction dataset:
* Partition data that contains sales transaction records. Partitions must be designed to provide efficient loads by month. Boundary values must belong: to the partition on the right.
* Ensure that queries joining and filtering sales transaction records based on product ID complete as quickly as possible.
* Implement a surrogate key to account for changes to the retail store addresses.
* Ensure that data storage costs and performance are predictable.
* Minimize how long it takes to remove old records.
Customer Sentiment Analytics Requirement
Contoso identifies the following requirements for customer sentiment analytics:
* Allow Contoso users to use PolyBase in an A/ure Synapse Analytics dedicated SQL pool to query the content of the data records that host the Twitter feeds. Data must be protected by using row-level security (RLS). The users must be authenticated by using their own A/ureAD credentials.
* Maximize the throughput of ingesting Twitter feeds from Event Hubs to Azure Storage without purchasing additional throughput or capacity units.
* Store Twitter feeds in Azure Storage by using Event Hubs Capture. The feeds will be converted into Parquet files.
* Ensure that the data store supports Azure AD-based access control down to the object level.
* Minimize administrative effort to maintain the Twitter feed data records.
* Purge Twitter feed data records;itftaitJ are older than two years.
Data Integration Requirements
Contoso identifies the following requirements for data integration:
Use an Azure service that leverages the existing SSIS packages to ingest on-premises data into datasets stored in a dedicated SQL pool of Azure Synaps Analytics and transform the data.
Identify a process to ensure that changes to the ingestion and transformation activities can be version controlled and developed independently by multiple data engineers.

Question 3

R を主要言語としてサポートするが、Scale と SOL もサポートする Azure Databricks で新しいノートブックを作成しています。言語を切り替えるには、どのスイッチを使用する必要がありますか?

A. %<Language>

B. \\(<Language>)

C. \\(<Language>)

D. @<Language>

Correct Answer: A

Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).

Question 4

重要な顧客の連絡先情報を保護するために何を使用することをお勧めしますか?

A. 列レベルのセキュリティ

B. データラベル

C. 行レベルのセキュリティ

D. 透過的データ暗号化 (TDE)

Correct Answer: A

Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).

Question 5

Azure SynapseAnalytics専用のSQLプールにテーブルを作成することを計画しています。
表のデータは5年間保持されます。年に一度、5年以上前のデータは削除されます。
データがパーティション間で均等に分散されていることを確認する必要があります。ソリューションでは、古いデータの削除に必要な時間を最小限に抑える必要があります。
Transact-SQLステートメントをどのように完了する必要がありますか？答えるには、適切な値を正しいターゲットにドラッグします。各値は、1回使用することも、複数回使用することも、まったく使用しないこともできます。コンテンツを表示するには、ペイン間で分割バーをドラッグするか、スクロールする必要がある場合があります。
注：正しい選択はそれぞれ1ポイントの価値があります。

Correct Answer:

Explanation:
Box 1: HASH
Box 2: OrderDateKey
In most cases, table partitions are created on a date column.
A way to eliminate rollbacks is to use Metadata Only operations like partition switching for data management.
For example, rather than execute a DELETE statement to delete all rows in a table where the order_date was in October of 2001, you could partition your data early. Then you can switch out the partition with data for an empty partition from another table.
Reference:
https://docs.microsoft.com/en-us/sql/t-sql/statements/create-table-azure-sql-data-warehouse
https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/best-practices-dedicated-sql-pool

Question 6

SQL サーバーから Azure Data Lake Storage に日次インベントリデータをインポートするために、一緒に使用することを推奨する Azure Data Factory コンポーネントはどれですか?答えるには、答えで適切なオプションを選択します。
注: 正しい選択ごとに 1 ポイントの価値があります。

Correct Answer:

Explanation:

Box 1: Self-hosted integration runtime
A self-hosted IR is capable of running copy activity between a cloud data stores and a data store in private network.
Box 2: Schedule trigger
Schedule every 8 hours
Box 3: Copy activity
Scenario:
Customer data, including name, contact information, and loyalty number, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.
Product data, including product ID, name, and category, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.

Question 7

Microsoft Visual Studio の Stream Analytics プロジェクトソリューションである Azure Stream Analytics ジョブがあります。ジョブは、JSON 形式で IoT デバイスによって生成されたデータを受け入れます。
IoT デバイスによって生成されたデータを Protobuf 形式で受け入れるようにジョブを変更する必要があります。
Visual Studio から順番に実行する必要がある 3 つのアクションはどれですか?回答するには、アクションのリストから適切なアクションを回答エリアに移動し、正しい順序で並べてください。

Correct Answer:

Explanation:

Step 1: Add an Azure Stream Analytics Custom Deserializer Project (.NET) project to the solution.
Create a custom deserializer
1. Open Visual Studio and select File > New > Project. Search for Stream Analytics and select Azure Stream Analytics Custom Deserializer Project (.NET). Give the project a name, like Protobuf Deserializer.

2. In Solution Explorer, right-click your Protobuf Deserializer project and select Manage NuGet Packages from the menu. Then install the Microsoft.Azure.StreamAnalytics and Google.Protobuf NuGet packages.
3. Add the MessageBodyProto class and the MessageBodyDeserializer class to your project.
4. Build the Protobuf Deserializer project.
Step 2: Add .NET deserializer code for Protobuf to the custom deserializer project Azure Stream Analytics has built-in support for three data formats: JSON, CSV, and Avro. With custom .NET deserializers, you can read data from other formats such as Protocol Buffer, Bond and other user defined formats for both cloud and edge jobs.
Step 3: Add an Azure Stream Analytics Application project to the solution Add an Azure Stream Analytics project In Solution Explorer, right-click the Protobuf Deserializer solution and select Add > New Project. Under Azure Stream Analytics > Stream Analytics, choose Azure Stream Analytics Application. Name it ProtobufCloudDeserializer and select OK.
Right-click References under the ProtobufCloudDeserializer Azure Stream Analytics project. Under Projects, add Protobuf Deserializer. It should be automatically populated for you.
Reference:
https://docs.microsoft.com/en-us/azure/stream-analytics/custom-deserializer

Question 8

Sales という名前のテーブルを含む、Pool1 という名前の Azure Synapse Analytics 専用 SQL プールがあります。Sales には行レベルセキュリティ (RLS) が適用されています。RLS は、次の述語フィルターを使用します。

SalesUser1 という名前のユーザーには、Pool1 の db_datareader ロールが割り当てられています。SalesUser1 がテーブルにクエリを実行すると、Sales テーブルのどの行が返されますか?

A. すべての行

B. SalesRep 列の値が Manager である行のみ

C. User_Name 列の値が SalesUser1 である行のみ

D. SalesRep 列の値が SalesUser1 である行のみ

Correct Answer: C

Question 9

Contacts という名前のテーブルを含む Azure Synapse Analystics 専用の SQL プールがあります。連絡先には、電話という名前の列が含まれています。
特定の役割のユーザーが [電話] 列をクエリするときに、電話番号の下 4 桁のみが表示されるようにする必要があります。
ソリューションには何を含める必要がありますか?

A. 動的データマスキング

B. 行レベルのセキュリティ (RLS)

C. デフォルト値

D. テーブルパーティション

E. 列の暗号化

Correct Answer: A

Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).

Question 10

16 個のパーティションを持つ、retailhub という名前の Azure イベントハブがあります。トランザクションは Retailhub に転記されます。各トランザクションには、トランザクション ID、個々の項目、および支払いの詳細が含まれます。トランザクション ID は、パーティションキーとして使用されます。
あなたは、小売店での不正の可能性があるトランザクションを特定するための Azure Stream Analytics ジョブを設計しています。ジョブは、retailhub を入力として使用します。このジョブは、トランザクション ID、個々の項目、支払いの詳細、不正スコア、および不正インジケータを出力します。
出力を、fairhub という名前の Azure イベントハブに送信する予定です。
不正検出ソリューションが高度にスケーラブルであり、トランザクションを可能な限り迅速に処理することを確認する必要があります。
Stream Analytics ジョブの出力をどのように構造化する必要がありますか?回答するには、回答エリアで適切なオプションを選択してください。
注: 正しい選択ごとに 1 ポイントの価値があります。

Correct Answer:

Explanation:

Box 1: 16
For Event Hubs you need to set the partition key explicitly.
An embarrassingly parallel job is the most scalable scenario in Azure Stream Analytics. It connects one partition of the input to one instance of the query to one partition of the output.
Box 2: Transaction ID
Reference:
https://docs.microsoft.com/en-us/azure/event-hubs/event-hubs-features#partitions

Welcome to TestSimulate

Microsoft Data Engineering on Microsoft Azure (DP-203日本語版) (DP-203日本語) Free Practice Test