We offers exam 70 475. "Designing and Implementing Big Data Analytics Solutions", also known as 70-475 exam, is a Microsoft Certification. This set of posts, Passing the 70-475 exam with exam 70 475, will help you answer those questions. The exam 70 475 covers all the knowledge points of the real exam. 100% real 70 475 exam and revised by experts!
Check 70-475 free dumps before getting the full version:
NEW QUESTION 1
You are designing a data-driven data flow in Microsoft Azure Data Factory to copy data from Azure Blob storage to Azure SQL Database.
You need to create the copy activity.
How should you complete the JSON code? To answer, drag the appropriate code elements to the correct targets. Each element may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content
NOTE: Each correct selection is worth one point.
Answer:
Explanation: 
NEW QUESTION 2
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the states goals. Some question sets might have more than one correct solution, while the others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to implement a new data warehouse.
You have the following information regarding the data warehouse:
The first data files for the data warehouse will be available in a few days.
Most queries that will be executed against the data warehouse are ad-hoc.
The schemas of data files that will be loaded to the data warehouse change often.
One month after the planned implementation, the data warehouse will contain 15 TB of data. You need to recommend a database solution to support the planned implementation.
Solution: You recommend a Microsoft SQL server on a Microsoft Azure virtual machine. Does this meet the goal?
- A. Yes
- B. No
Answer: B
NEW QUESTION 3
You are planning a solution that will have multiple data files stored in Microsoft Azure Blob storage every hour. Data processing will occur once a day at midnight only.
You create an Azure data factory that has blob storage as the input source and an Azure HD Insight activity that uses the input to create an output Hive table.
You need to identify a data slicing strategy for the data factory.
What should you identify? To answer, select the appropriate options in the answer area. NOTE: Each correct selection is worth one point.
Answer:
Explanation: 
NEW QUESTION 4
Your company builds hardware devices that contain sensors. You need to recommend a solution to process the sensor data and. What should you include in the recommendation?
- A. Microsoft Azure Event Hubs
- B. API apps in Microsoft Azure App Service
- C. Microsoft Azure Notification Hubs
- D. Microsoft Azure IoT Hub
Answer: A
NEW QUESTION 5
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the states goals. Some question sets might have more than one correct solution, while the others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to implement a new data warehouse.
You have the following information regarding the data warehouse:
The first data files for the data warehouse will be available in a few days.
Most queries that will be executed against the data warehouse are ad-hoc.
The schemas of data files that will be loaded to the data warehouse change often.
One month after the planned implementation, the data warehouse will contain 15 TB of data. You need to recommend a database solution to support the planned implementation.
Solution: You recommend an Apache Spark system. Does this meet the goal?
- A. Yes
- B. No
Answer: B
NEW QUESTION 6
You have a Microsoft Azure Data Factory pipeline.
You discover that the pipeline fails to execute because data is missing. You need to rerun the failure in the pipeline.
Which cmdlet should you use?
- A. Set-AzureAutomationJob
- B. Resume-AzureDataFactoryPipeline
- C. Resume-AzureAutomationJob
- D. Set-AzureDataFactotySliceStatus
Answer: B
NEW QUESTION 7
You have a Microsoft Azure data factory named ADF1 that contains a pipeline named Pipeline1. You plan to automate updates to Pipeline1.
You need to build the URL that must be called to update the pipeline from the REST API.
How should you complete the URL? To answer, drag the appropriate URL elements to the correct locations. Each URL element may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation: 
NEW QUESTION 8
You plan to implement a Microsoft Azure Data Factory pipeline. The pipeline will have custom business logic that requires a custom processing step.
You need to implement the custom processing step by using C#.
Which interface and method should you use? To answer, select the appropriate options in the answer area. NOTE: Each correct selection is worth one point.
Answer:
Explanation: References:
https://github.com/MicrosoftDocs/azure-docs/blob/master/articles/data-factory/v1/data-factory-use-custom-activ
NEW QUESTION 9
You need to create a query that identifies the trending topics.
How should you complete the query? To answer, drag the appropriate values to the correct targets. Each value may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation: From scenario: Topics are considered to be trending if they generate many mentions in a specific country during a 15-minute time frame.
Box 1: TimeStamp
Azure Stream Analytics (ASA) is a cloud service that enables real-time processing over streams of data flowing in from devices, sensors, websites and other live systems. The stream-processing logic in ASA is expressed in a SQL-like query language with some added extensions such as windowing for performing temporal calculations.
ASA is a temporal system, so every event that flows through it has a timestamp. A timestamp is assigned automatically based on the event's arrival time to the input source but you can also access a timestamp in your event payload explicitly using TIMESTAMP BY:
SELECT * FROM SensorReadings TIMESTAMP BY time Box 2: GROUP BY
Example: Generate an output event if the temperature is above 75 for a total of 5 seconds SELECT sensorId, MIN(temp) as temp
FROM SensorReadings TIMESTAMP BY time
GROUP BY sensorId, SlidingWindow(second, 5) HAVING MIN(temp) > 75
Box 3: SlidingWindow
Windowing is a core requirement for stream processing applications to perform set-based operations like counts or aggregations over events that arrive within a specified period of time. ASA supports three types of windows: Tumbling, Hopping, and Sliding.
With a Sliding Window, the system is asked to logically consider all possible windows of a given length and output events for cases when the content of the window actually changes – that is, when an event entered or existed the window.
NEW QUESTION 10
You use Microsoft Azure Data Factory to orchestrate data movement and data transformation within Azure. You need to identify which data processing failures exceed a specific threshold. What are two possible ways to achieve the goal? Each correct answer presents a complete solution. NOTE: Each correct selection is worth one point.
- A. View the Diagram tile on the Data Factory blade of the Azure portal.
- B. Set up an alert to send an email message when the number of failed validations is greater than the threshold.
- C. View the data factory metrics on the Data Factory blade of the Azure portal.
- D. Set up an alert to send an email message when the number of failed slices is greater than or equal to the threshold.
Answer: A
NEW QUESTION 11
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while the others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to deploy a Microsoft Azure SQL data warehouse and a web application.
The data warehouse will ingest 5 TB of data from an on-premises Microsoft SQL Server database daily. The web application will query the data warehouse.
You need to design a solution to ingest data into the data warehouse.
Solution: You use AzCopy to transfer the data as text files from SQL Server to Azure Blob storage, and then you use PolyBase to run Transact-SQL statements that refresh the data warehouse database.
Does this meet the goal?
- A. Yes
- B. No
Answer: A
Explanation: If you need the best performance, then use PolyBase to import data into Azure SQL warehouse.
Note: Often the speed of migration is an overriding concern compared to ease of setup and maintainability,
particularly when there’s a large amount of data to move. Optimizing purely for speed, a source controlled differentiated approach relying on bcp to export data to files, efficiently moving the files to Azure Blob storage, and using the Polybase engine to import from blob storage works best.
References: https://docs.microsoft.com/en-us/azure/sql-data-warehouse/sql-data-warehouse-migrate-data
NEW QUESTION 12
You have a data warehouse that contains the sales data of several customers.
You plan to deploy a Microsoft Azure data factory to move additional sales data to the data warehouse. You need to develop a data factory job that reads reference data from a table in the source data.
Which type of activity should you add to the control flow of the job?
- A. a ForEach activity
- B. a lookup activity
- C. a web activity
- D. a GetMetadata activity
Answer: B
Explanation: References:
https://docs.microsoft.com/en-us/azure/data-factory/control-flow-lookup-activity
NEW QUESTION 13
You need to create a new Microsoft Azure data factory by using Azure PowerShell. The data factory will have a pipeline that copies data to and from Azure Storage.
Which four cmdlets should you use in sequence? To answer, move the appropriate cmdlets from the list of cmdlets to the answer area and arrange them in the correct order.
Answer:
Explanation: Perform these operations in the following order:
Create a data factory.
Create linked services.
Create datasets.
Create a pipeline.
Step 1: New-AzureRMDataFactory Create a data factory
The New-AzureRmDataFactory cmdlet creates a data factory with the specified resource group name and location.
Step 2: New-AzureRMDataFactoryLinkedService
Create linked services in a data factory to link your data stores and compute services to the data factory. The New-AzureRmDataFactoryLinkedService cmdlet links a data store or a cloud service to Azure Data
Factory.
Step 3: New-AzureRMDataFactoryDataset
You define a dataset that represents the data to copy from a source to a sink. It refers to the Azure Storage linked service you created in the previous step.
The New-AzureRmDataFactoryDataset cmdlet creates a dataset in Azure Data Factory.
Step 4: New-AzureRMDataFactoryPipeline You create a pipeline.
The New-AzureRmDataFactoryPipeline cmdlet creates a pipeline in Azure Data Factory. References:
https://docs.microsoft.com/en-us/azure/data-factory/quickstart-create-data-factory-powershell https://docs.microsoft.com/en-us/powershell/module/azurerm.datafactories/new-azurermdatafactory
NEW QUESTION 14
You have an analytics solution in Microsoft Azure that must be operationalized.
You have the relevant data in Azure Blob storage. You use an Azure HDInsight Cluster to process the data. You plan to process the raw data files by using Azure HDInsight. Azure Data Factory will operationalize the
solution.
You need to create a data factory to orchestrate the data movement. Output data must be written back to Azure Blob storage.
Which four actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
Answer:
Explanation: 
NEW QUESTION 15
You extend the dashboard of the health tracking application to summarize fields across several users. You need to recommend a file format for the activity data in Azure that meets the technical requirements.
What is the best recommendation to achieve the goal? More than one answer choice may achieve the goal. Select the BEST answer.
- A. ORC
- B. TSV
- C. CSV
- D. JSON
- E. XML
Answer: E
NEW QUESTION 16
You are automating the deployment of a Microsoft Azure Data Factory solution. The data factory will interact with a file stored in Azure Blob storage.
You need to use the REST API to create a linked service to interact with the file.
How should you complete the request body? To answer, drag the appropriate code elements to the correct locations. Each code may be used once, more than once, or not at all. You may need to drag the slit bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation: 
NEW QUESTION 17
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while the others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have a Microsoft Azure deployment that contains the following services:
Azure Data Lake
Azure Cosmos DB
Azure Data Factory
Azure SQL Database
You load several types of data to Azure Data Lake.
You need to load data from Azure SQL Database to Azure Data Lake. Solution: You use the Azure Import/Export service.
Does this meet the goal?
- A. Yes
- B. No
Answer: A
P.S. Easily pass 70-475 Exam with 102 Q&As Certleader Dumps & pdf Version, Welcome to Download the Newest Certleader 70-475 Dumps: https://www.certleader.com/70-475-dumps.html (102 New Questions)