{"id":726,"date":"2022-12-17T13:12:25","date_gmt":"2022-12-17T13:12:25","guid":{"rendered":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/aws-certified-data-analytics-specialty-das-c01-question040\/"},"modified":"2022-12-17T13:12:25","modified_gmt":"2022-12-17T13:12:25","slug":"aws-certified-data-analytics-specialty-das-c01-question040","status":"publish","type":"post","link":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/aws-certified-data-analytics-specialty-das-c01-question040\/","title":{"rendered":"AWS Certified Data Analytics &#8211; Specialty DAS-C01 &#8211; Question040"},"content":{"rendered":"<div class=\"question\">A gaming company is building a serverless data lake. The company is ingesting streaming data into Amazon Kinesis Data Streams and is writing the data to Amazon S3 through Amazon Kinesis Data Firehose. The company is using 10 MB as the S3 buffer size and is using 90 seconds as the buffer interval. The company runs an AWS Glue ETL job to merge and transform the data to a different format before writing the data back to Amazon S3.<br \/>\nRecently, the company has experienced substantial growth in its data volume. The AWS Glue ETL jobs are frequently showing an OutOfMemoryError error.<br \/>\nWhich solutions will resolve this issue without incurring additional costs? (Choose two.)<br \/><strong><br \/>A.<\/strong> Place the small files into one S3 folder. Define one single table for the small S3 files in AWS Glue Data Catalog. Rerun the AWS Glue ETL jobs against this AWS Glue table.<br \/><strong>B.<\/strong> Create an AWS Lambda function to merge small S3 files and invoke them periodically. Run the AWS Glue ETL jobs after successful completion of the Lambda function.<br \/><strong>C.<\/strong> Run the S3DistCp utility in Amazon EMR to merge a large number of small S3 files before running the AWS Glue ETL jobs.<br \/><strong>D.<\/strong> Use the groupFiles setting in the AWS Glue ETL job to merge small S3 files and rerun AWS Glue ETL jobs.<br \/><strong>E.<\/strong> Update the Kinesis Data Firehose S3 buffer size to 128 MB. Update the buffer interval to 900 seconds.<\/div>\n<p><\/p>\n<style> .hidden-div{ display:none } <\/style>\n<p>\t\t\t\t\t\t\t<button onclick=\"getElementById('hidden-div').style.display = 'block'\"> Show Answer <\/button> <button onclick=\"getElementById('hidden-div').style.display = 'none'\">Hide Answer<\/button><\/p>\n<div class=\"hidden-div\" id=\"hidden-div\"><span style=\"\"><\/p>\n<div class=\"answer\">Correct Answer: <strong>AC<\/strong><\/div>\n<p><strong>Explanation:<\/strong> <\/p>\n<div class=\"explanation\">\nReference: <a href=\"https:\/\/docs.aws.amazon.com\/glue\/latest\/dg\/grouping-input-files.html\" title=\"External link\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/docs.aws.amazon.com\/glue\/latest\/dg\/grouping-input-files.html<\/a><br \/>\n<a href=\"https:\/\/docs.aws.amazon.com\/glue\/latest\/dg\/grouping-input-files.html\" title=\"External link\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/docs.aws.amazon.com\/glue\/latest\/dg\/grouping-input-files.html<\/a><\/div>\n<p><\/strong><\/span> <\/div>\n","protected":false},"excerpt":{"rendered":"<p>A gaming company is building a serverless data lake. The company is ingesting streaming data into Amazon Kinesis Data Streams and is writing the data to Amazon S3 through Amazon Kinesis Data Firehose. The company is using 10 MB as the S3 buffer size and is using 90 seconds as the buffer interval. The company [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[3,207],"class_list":["post-726","post","type-post","status-publish","format-standard","hentry","category-aws-certified-data-analytics-specialty-das-c01","tag-aws-certified-data-analytics-specialty-das-c01","tag-question-040"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts\/726","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/comments?post=726"}],"version-history":[{"count":0,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/posts\/726\/revisions"}],"wp:attachment":[{"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/media?parent=726"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/categories?post=726"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/exampracticetests.com\/aws\/Data_Analytics-Specialty_DAS-C01\/wp-json\/wp\/v2\/tags?post=726"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}