0% found this document useful (0 votes)
330 views5 pages

Hive Interview Questions

This document contains 93 questions about Hive architecture, concepts, and usage. It covers topics like Hive architecture, map-side joins, schema on read vs write, data types, indexes, performance optimization, joins, user defined functions, views, partitioning, bucketing, multiple inserts, renaming/altering tables, metastore, SerDes, loading data, querying performance, and security. Many questions focus on understanding Hive concepts, usage best practices, and troubleshooting query errors.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
330 views5 pages

Hive Interview Questions

This document contains 93 questions about Hive architecture, concepts, and usage. It covers topics like Hive architecture, map-side joins, schema on read vs write, data types, indexes, performance optimization, joins, user defined functions, views, partitioning, bucketing, multiple inserts, renaming/altering tables, metastore, SerDes, loading data, querying performance, and security. Many questions focus on understanding Hive concepts, usage best practices, and troubleshooting query errors.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

1.

Explain HIVE architecture

2. What is a map side join?

3. Explain Schema on Read Versus Schema on Write?

4. Difference between "Sort By" and "Group by" in Hive. How they work?

6. Explain various data types in HIVE

7. Explain type of index in HIVE?

8. What is the main difference between Hive and SQL

9. How to optimize Hive Performance?

10. Explain the joins different from RDBMS

11. Can we write map reduce scripts in HIVE?

12. What type of user defined functions exists in HIVE?

13. How HIVE views are different from RDBMS views?

14. How do you check the performance of a HIVE query?

15. Assume there is a array column in a table, how will you show the data in row format?

16. Explain partition and bucket concept in HIVE?

17. Explain multiple inserts in HIVE table.

18. Can you create data structure (columns and data type) of a new table which is similar to
existing table?

19. What is the difference between Drop table verses TRUNCATE?

20. Is LIMIT clause in HIVE really random?

21. Is Hive suitable to be used for OLTP systems? Why?

22. Can a table be renamed in Hive?


23. Can we change the data type of a column in a hive table?

24. What is a metastore in Hive?

25. What is the need for custom Serde?

26. Why do we need Hive?

27. What is the default location where hive stores table data?

28. What are the three different modes in which hive can be run?

29. Is there a date data type in Hive?

30. What are collection data types in Hive?

31. What is a Hive variable? What for we use it?

32. What is the importance of .hiverc file?

33. What are the default record and field delimiter used for hive text files?

34. What do you mean by schema on read?

35. How do you list all databases whose name starts with p?

36. What does the “USE” command in hive do?

37. How can you delete the DBPROPERTY in Hive?

38. What is the significance of the line set hive. mapred . mode =strict;

39. How do you check if a particular partition exists?

40. Which java class handles the Input record encoding into files which store the tables in
Hive?

41. Which java class handles the output record encoding into files which result from Hive
queries?

42. What is the significance of ‘IF EXISTS” clause while dropping a table?

43. When you point a partition of a hive table to a new directory, what happens to the data?
44. Write a query to insert a new column? Can you add a column with a default value in Hive?

45. Since there's no IFNULL, ISNULL, or NVL function supported on Hive, how to convert
NULL to 0?

46. Does the archiving of Hive tables give any space saving in HDFS?

47. How can you stop a partition form being queried?

48. While loading data into a hive table using the LOAD DATA clause, how do you specify it is
a hdfs file and not a local file ?

49. If you omit the OVERWRITE clause while creating a hive table, what happens to file which
are new and files which already exist?

50. What does the following query do? INSERT OVERWRITE TABLE employees PARTITION
(country, state) SELECT .., FROM staged_employees;

51. What is a Table generating Function on hive?

52. How can Hive avoid mapreduce?

53. What is the difference between LIKE and RLIKE operators in Hive?

54. Is it possible to create Cartesian join between 2 tables, using Hive?

55. What should be the order of table size in a join query?

56. What is the usefulness of the DISTRIBUTED BY clause in Hive?

57. What is the main difference between dynamic and static partitioning in hive?

58. How do you convert string to int or bigint in hive?

59. Can the name of a view be same as the name of a hive table?

60. Can we LOAD data into a view?

61. What types of costs are associated in creating index on hive tables?

62. Give the command to see the indexes on a table.


63. What is Hadoop Buffering vs Streaming?

64. What is the importance of STREAM TABLE?

65. Can a partition be archived? What are the advantages and Disadvantages?

66. What is a generic UDF in hive?

67. The following statement failed to execute. What can be the cause? LOAD DATA LOCAL
INPATH ‘${env:HOME}/country/state/’ OVERWRITE INTO TABLE address;

68. How do you specify the table creator name when creating a table in Hive?

69. What is SQL INTERFACE FOR HADOOP? HCATALOG

70. How does impala provide faster query response compared to hive?

71. What is the Hive configuration precedence order?

72. How do change settings within Hive Session?

73. How to print header on Hive query results?

74. How to get detailed description of a table in Hive?

75. How to access sub directories recursively in Hive queries?

76. How to skip header rows from a table in Hive?

77. Is it possible to create multiple table in hive for same data?

78. What is the maximum size of string data type supported by Hive?

79. What are the Binary Storage formats supported in Hive?

80. Is HQL case sensitive?

81. Describe CONCAT function in Hive with Example?

82. Describe REPEAT function in Hive with example?

83. Describe REVERSE function in Hive with example?


84. Describe TRIM function in Hive with example?

85. Describe RLIKE in Hive with an example?

86. How do you import and load .csv file into and out of HIVE?

87. Explain what happens when you run a query in HIVE?

88. How jason files are loaded and queried in HIVE?

89. Explain Data Serialization with Avro in Hive.

90. What are different types of meta store in Hive?

91. Which classes are used by the hive to read and write hdfs files?

92. What is the best way to implement row-based security in Hive?

93. Can you mention some lessons learned based on your experience?

You might also like