Managing Big Data in Clusters and Cloud Storage Quiz
Managing Big Data in Clusters and Cloud Storage Quiz Answer. This course is offered by “Coursera”. In this post you will get Managing Big Data in Clusters and Cloud Storage Coursera Quiz Answer | 100% Correct Answer
Managing Big Data in Clusters and Cloud Storage Quiz Coursera Quiz Answer
1.
Question 1
Use the Table Browser in Hue to view the tables in the fun database. (See the environment installation instructions for how to log in.) Which of the following tables are in the fun database? Check all that apply.
2.
Question 2
Use the Table Browser in Hue to view the columns in the employees table in the default database. Which of the following columns are in the employees table? Check all that apply.
6.
Question 6
What delimiter is used to separate the values in the lines of the text file containing the data in the orders table in the default database?
2.
Question 2
A new table is created using the following statement. The database used is in the default storage location in the Hive warehouse. Which statements describe the expected outcomes of this statement? Check all that apply.
CREATE TABLE thisdb.thistable (id TINYINT, name STRING);
1 point
The table is configured to store data in a directory named thistable
The table is configured to store data in /user/hive/warehouse/thistable
The table’s storage directory is a subdirectory of /user/hive/warehouse/thisdb.db
The table’s name is thisdb
The table is in the database thisdb
The table has four columns called id, TINYINT, name, and STRING
3.
Question 3
A data file specifies a song (such as “Bohemian Rhapsody”) on an album (in this case A Night At The Opera) by an artist or group (Queen). The file uses the pipe character (|) to separate values, so the example would look like this row:
Bohemian Rhapsody|A Night At The Opera|Queen
Which statement is appropriate to define a table using data in this format?
1 point
CREATE TABLE songs (song STRING, album STRING, artist STRING)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ‘\|’;
CREATE TABLE songs (song STRING, album STRING, artist STRING)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY |;
CREATE TABLE songs (song STRING, album STRING, artist STRING)
ROW FORMAT DELIMITED BY ‘\|’;
CREATE TABLE songs (song STRING, album STRING, artist STRING)
ROW FORMAT DELIMITED BY |;
CREATE TABLE songs (song STRING, album STRING, artist STRING)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ‘|’;
CREATE TABLE songs (song STRING, album STRING, artist STRING)
4.
Question 4
The data for a table to be called weblogs is provided in Parquet files (with format PARQUET) and are placed in S3 in a directory named weblogs in the bucket named training-coursera1. Which statement correctly creates this table? (Assume the column list is correct.)
6.
Question 6
An alternative to using CREATE EXTERNAL TABLE when creating an externally managed table is to set the table property EXTERNAL to TRUE. Which of the following would correctly do this?
8.
Question 8
Which statements describe the differences between dropping int_table, which was created using CREATE TABLE (with no later alterations), and ext_table, which was created using CREATE EXTERNAL TABLE (with no later alterations)?
1 point
Dropping int_table might delete the directory in which its data is stored, but dropping ext_table will not.
Dropping int_table will not delete the directory in which its data is stored, but dropping ext_table might delete the directory for its table.
Dropping int_table will not delete the data for int_table, but dropping ext_table might drop the data for that table.
Dropping int_table might delete the data for the table, but dropping ext_table will not drop the data for that table.
10.
Question 10
Suppose you have been querying a table named mytable using Impala. You then added data to mytable using an hdfs dfs command, and you want to query the table in Impala again, with the new data. What is your best course of action?
1 point
Run REFRESH mytable; then query as usual
Run REFRESH; then query as usual
Run INVALIDATE METADATA; then query as usual
Query as usual, no other action needed
Run REFRESH METADATA mytable; then query as usual
Week 3 Graded Quiz
1.
Question 1
Which of these data types allows the greatest precision?
6.
Question 6
You have a column of integer values that you expect to range from -10 to 127. Using Impala, what’s the smallest integer data type you can use that allows you to identify which values are out of this range? The ranges of the integer data types are provided below as a reminder.
Integer Type Range
TINYINT -128 to 127
SMALLINT -32,768 to 32,767
INT -2,147,483,648 to 2,147,483,647 (approximately 2.1 billion)
1 point
4.
Question 4
On the course VM, which command could you use to upload the local file /home/training/training_materials/analyst/data/games.csv to an S3 bucket named bucket-o-games? For purposes of this question, assume you have write access to this bucket.
1 point
aws s3 put /home/training/training_materials/analyst/data/games.csv s3://bucket-o-games/