KIRTHIGA M
An enthusiastic candidate and a high energy driven professional looking for a challenging role in an
organization, which provides opportunities to enhance my skills and expand my knowledge for the growth of
the company
+919600997520 kirthigam95@gmail.com
Professional Summary:
Having 2 years of extensive experience in IT industry including Big Data technologies in Hadoop
Ecosystem technologies (HDFS, Hive, Sqoop, Apache Spark and AWS).
Good understanding of Hadoop architecture and its components such as HDFS, MapReduce,
Partitioning, Bucketing, Controlling.
Experience in importing and exporting data using Sqoop from HDFS to RDBMS and vice-versa.
Experience in using Sqoop to import and export data from and to cloud-based data storage services
such as Amazon S3.
Worked with different file formats like JSON, XML, AVRO data files and text files.
Experience in developing Spark applications using RDD transformations (Spark core) and Data
Frame (Spark SQL) with Spark DSL & SQL functions.
Worked on AWS Components like S3, EMR and EC2.
Good knowledge in incremental imports, partitioning and bucketing concepts in Hive and Spark
SQL needed for optimization.
Strong experience in configuring Sqoop to handle complex data structures such as nested and
hierarchical data.
Knowledge of Hive table formats, including ORC, Parquet, and Avro, and their advantages and
disadvantages for different use cases.
Proficient in using Sqoop to import and export data in various file formats such as CSV, Avro, and
Parquet.
Expertise in writing Hadoop Jobs for analyzing data using Hive.
Designed and implemented ETL processes using Spark.
Database experience in SQL Server and MYSQL.
Have good problem solving and analytical skills and ready to innovate in order to perform better.
Have strong Inter personal skills and communication skills.
Technical Skills:
Data Eco System : Hadoop, Sqoop, Hive, Apache Spark
Cloud Skills : GCP, AWS
Distribution : Cloudera
Databases : SQL Server, MySQL
Languages : C, C++, Scala, Python, SQL
Operating Systems : Linux and Windows
Professional Experience:
Zeyobron Analytics - September 2023 to Present (0.6)
Straive SPI Global - March 2022 to August 2023 (1.5)
Shivasakthi Textile Processing Mills and Private Limited - April 2018 to Feb 2022 (4)
Work Experience:
August 2023 to February 2024: Big Data Developer in Zeyobron analytics
Key Result Areas:
Maintained and monitored Spark clusters on AWS EMR, ensuring high availability and fault
tolerance.
Knowledge of Hive query tuning best practices, such as minimizing data transfers, avoiding
unnecessary data conversions, and using appropriate data formats and compression codes.
Strong understanding of Hive integration with other big data technologies, such as Hadoop,
Spark, and Impala and their impact on query performance and resource utilization.
Familiarity with Hive performance tuning tools, such as Hive Query Profiler, Hive Query Plan
Visualization, and Hive Load Testing Tools and their features and limitations.
Experience in identifying and resolving performance bottlenecks in Hive, such as data skew,
inefficient joins and excessive shuffling.
Proficient in designing Avro schema for Hive tables and managing schema evolution to
accommodate changes in data structure and format.
Expertise in using Avro tools and libraries, such as the Avro command-line interface,
Avro IDL, and Avro schema resolution rules, to manage schema evolution in Hive.
Skilled in configuring Hive Avro serialization and deserialization settings, such as the
schema registry URL, the schema file path, and the schema versioning strategy.
Skilled in leveraging Hive partitioning, bucketing, indexing, and caching features to improve
query performance and reduce data processing overhead.
Knowledge of Hive serialized data processing best practices, such as choosing
appropriate serialization formats and codecs, optimizing data compression and encoding,
and avoiding serialization overhead in data processing.
Strong understanding of Hive serialized data processing performance optimization
techniques, such as using columnar storage, data partitioning, and indexing, and their
trade-offs in terms of query performance and resource utilization.
Experience working with other big data technologies, such as Hadoop, Spark, and Impala,
and integrating serialized data processing workflows with other data processing and
analytics tools.
Worked with Spark's data serialization formats (Avro, Parquet, JSON, etc.).
Implemented data lineage and tracking in Spark applications.
Ensured data security and access control in Spark applications.
Documented Spark workflows and best practices.
March 2022 to August 2023: ESG - Data Analyst in Straive SPI Global
Key Result Areas:
Calculating the source of energy used by the organization and grouping it under
renewable or non-renewable sources.
Analyze data’s related to various types of wastes generated by the company and segregate
it under hazardous & non-hazardous category.
Collect data’s related to the societal policies offered by the company such as retirement
program, medical insurance, schemes for disabled persons, maternity plans etc., to ensure
wellness of its employees.
Preparing organizational structure of the company from the given data file and also
capturing details of work done by each of them in the organizational structure.
Analysis of company related financial data’s like revenue, debt and cash & cash
equivalent data.
April 2018 to Feb 2022: Accounts Manager in Shivasakthi Textile Processing Mills and Private Limited:
Key Result Areas:
Managing accounting-related documents like invoices, bills, purchases, taxation, GST,
accounts payable and account receivable etc.., and having excellent knowledge in above
concepts.
Academic Details:
2016 -
2018
M.E Power Electronics and Drives from Kongu Engineering College with a CGPA of 8.49
2012 - B.E Electrical & Electronics Engineering from Velalar College of Engineering and Technology
2016
with a CGPA of 7.37
HSC from URC Palaniammal Matric Hr. Sec School with 79.8%
2011
2009 -
2010 SSLC from URC Palaniammal Matric Hr. Sec School with 80%
I solemnly declare that all the information furnished is in accordance with the fact or truth up to
my knowledge and I undertake the responsibilities for the correctness of the mentioned particulars.
Kirthiga M