{"id":820,"date":"2022-12-17T13:14:02","date_gmt":"2022-12-17T13:14:02","guid":{"rendered":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/aws-certified-data-analytics-specialty-das-c01-question134\/"},"modified":"2022-12-17T13:14:02","modified_gmt":"2022-12-17T13:14:02","slug":"aws-certified-data-analytics-specialty-das-c01-question134","status":"publish","type":"post","link":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/aws-certified-data-analytics-specialty-das-c01-question134\/","title":{"rendered":"AWS Certified Data Analytics &#8211; Specialty DAS-C01 &#8211; Question134"},"content":{"rendered":"<div class=\"question\">A data architect at a large financial institution is building a data platform on AWS with the intent of implementing fraud detection by identifying duplicate customer accounts. The fraud detection algorithm will run in a batch mode to identify when a newly created account matches one for a user that was previously fraudulent.<br \/>\nWhich approach MOST cost-effectively meets these requirements?<br \/><strong><br \/>A.<\/strong> Build a custom deduplication script by using Apache Spark on an Amazon EMR cluster. Use PySpark to compare the data frames that represent the new customers and the fraudulent customer set to identify matches.<br \/><strong>B.<\/strong> Load the data to an Amazon Redshift cluster. Use custom SQL to build deduplication logic.<br \/><strong>C.<\/strong> Load the data to Amazon S3 to form the basis of a data lake. Use Amazon Athena to build a deduplication script.<br \/><strong>D.<\/strong> Load the data to Amazon S3. Use the AWS Glue FindMatches transform to implement deduplication logic.<\/div>\n<p><\/p>\n<style> .hidden-div{ display:none } <\/style>\n<p>\t\t\t\t\t\t\t<button onclick=\"getElementById('hidden-div').style.display = 'block'\"> Show Answer <\/button> <button onclick=\"getElementById('hidden-div').style.display = 'none'\">Hide Answer<\/button><\/p>\n<div class=\"hidden-div\" id=\"hidden-div\"><span style=\"\"><\/p>\n<div class=\"answer\">Correct Answer: <strong>D<\/strong><\/div>\n<p><\/strong><\/span> <\/div>\n","protected":false},"excerpt":{"rendered":"<p>A data architect at a large financial institution is building a data platform on AWS with the intent of implementing fraud detection by identifying duplicate customer accounts. The fraud detection algorithm will run in a batch mode to identify when a newly created account matches one for a user that was previously fraudulent. Which approach [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[3,137],"class_list":["post-820","post","type-post","status-publish","format-standard","hentry","category-aws-certified-data-analytics-specialty-das-c01","tag-aws-certified-data-analytics-specialty-das-c01","tag-question-134"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts\/820","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/comments?post=820"}],"version-history":[{"count":0,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts\/820\/revisions"}],"wp:attachment":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/media?parent=820"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/categories?post=820"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/tags?post=820"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}