To minimize the number of rows added to the Orders table during refreshes, which solution component should be included?

Prepare for the DP-600 Fabric Analytics Engineer Exam. Study with flashcards and multiple choice questions, each offering hints and detailed explanations. Enhance your chances of success on the exam!

Multiple Choice

To minimize the number of rows added to the Orders table during refreshes, which solution component should be included?

Explanation:
Incremental loading by using the highest OrderID already in the destination lets the system pull only new rows. The Azure Data Factory pipeline with a dataflow can read the current max OrderID from the destination lakehouse, then load only orders with OrderID greater than that value. This keeps the destination table sized by only new data, avoids reloading existing rows, and reduces processing and bandwidth compared to re-reading the entire dataset. In contrast, reading the whole Orders dataset and appending would duplicate existing data; a Power BI refresh task isn’t an ETL step for updating the lakehouse, and truncating and reloading wipes and rebuilds the table instead of updating it incrementally.

Incremental loading by using the highest OrderID already in the destination lets the system pull only new rows. The Azure Data Factory pipeline with a dataflow can read the current max OrderID from the destination lakehouse, then load only orders with OrderID greater than that value. This keeps the destination table sized by only new data, avoids reloading existing rows, and reduces processing and bandwidth compared to re-reading the entire dataset. In contrast, reading the whole Orders dataset and appending would duplicate existing data; a Power BI refresh task isn’t an ETL step for updating the lakehouse, and truncating and reloading wipes and rebuilds the table instead of updating it incrementally.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy