GCP Cloud

The job description is for a GCP Data Engineer position. The responsibilities include designing, developing and maintaining data pipelines using GCP services like BigQuery, Dataflow, and Cloud Storage. Candidates should have 2-4 years of experience in GCP data engineering, strong Python and SQL skills, and experience building pipelines to handle large datasets. A certification in Google Cloud skills and the ability to work effectively in teams are also requirements for this role.

Uploaded by

Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

175 views5 pages

GCP Cloud

Uploaded by

Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Job description

JD GCP Data Engineering

 Between 2 to 4 years' experience in GCP Data Engineering.
 Design, develop, and maintain data pipelines using GCP services.
 Strong data engineering experience using Python, PySpark programming languages or Spark on
Google Cloud.
 Should have worked on handling big data.
 Strong communication skills.
 experience in Agile methodologies ETL, ELT skills, Data movement skills, Data processing skills.
 Certification on Professional Google Cloud Data engineer will be an added advantage.
 Proven analytical skills and Problem-solving attitude.
 Ability to effectively function in a cross-team environment.
Primary Skill
 GCP, data engineering
 Python/PySpark, SQL Spark on GCP, Programming experience
 GCS (Cloud Storage), Composer (Airflow) and BigQuery experience
 experience building data pipelines using above skills.
 Pipeline development experience using Dataflow or Dataproc (Apache Beam etc)
 Experience in GCP services or databases like Cloud SQL, Datastore, Bigtable, Spanner, Cloud
Run, Cloud Functions etc
 Proven analytical skills and Problem-solving attitude.
 Excellent Communication Skills

Job description
Responsibilities:
• Strong development skills in Python.
• Writing effective and scalable Python codes.
• Strong experience in processing data and drawing insights from large data sets
• Good familiarity with one or more libraries: pandas, NumPy, SciPy etc.
• In-depth knowledge of spaCy and similar NLP libraries like NLTK, textacy etc.
• Experience with Python development environments, including, but not limited to Jupyter,
Google Colab notebooks, Matplotlib, Plotly, and geoplotlib.
• Advanced working SQL knowledge and experience working with relational databases, query authoring
(SQL) as well as working familiarity with a variety of databases.
• Experience performing root cause analysis on internal and external data and processes to answer specific
business questions and identify opportunities for improvement.
• Strong analytic skills related to working with unstructured datasets.
Good to have some exposure to
• Experience with setting up and maintaining Data warehouse (Google BigQuery, Redshift, Snowflake)
and Data Lakes (GCS, AWS S3 etc.) for an organization
• Experience with relational SQL and NoSQL databases, including Postgres and Cassandra / MongoDB.
• Experience with data pipeline and workflow management tools: Airflow, Dataflow, Dataproc, etc.
• Exposure to any Business Intelligence (BI) tools like Tableau, Dundas, Power BI etc.
• Agile software development methodologies.
• Working in multi-functional, multi-location teams
4+ years experience developing Big Data & Analytic solutions
 Experience building data lake solutions leveraging Google Data Products (e.g. Dataproc, AI
Building Blocks, Looker, Cloud Data Fusion, Dataprep, etc.), Hive, Spark
 Experience with relational SQL/No SQL
 Experience with Spark (Scala/Python/Java) and Kafka
 Work experience with using Databricks (Data Engineering and Delta Lake components)
 Experience with source control tools such as GitHub and related dev process
 Experience with workflow scheduling tools such as Airflow
 In-depth knowledge of any scalable cloud vendor(GCP preferred)
 Has a passion for data solutions
 Strong understanding of data structures and algorithms
 Strong understanding of solution and technical design
 Has a strong problem solving and analytical mindset
 Experience working with Agile Teams.
 Able to influence and communicate effectively, both verbally and written, with team members and
business stakeholders
 Able to quickly pick up new programming languages, technologies, and frameworks
 Bachelors Degree in computer science

 4+ Years of Experience in Data Engineering and building and maintaining large-scale data
pipelines.
 Experience with designing and implementing a large-scale Data-Lake on Cloud Infrastructure
 Strong technical expertise in Python and SQL and Shell Scripting.
 Extremely well-versed in Google Compute Platform including BigQuery, Cloud Storage, Cloud
Composer, DataProc, Dataflow, Pub/Sub.
 Experience with Big Data Tools such as Hadoop and Apache Spark (Pyspark)
 Experience Developing DAGs in Apache Airflow 1.10.x or 2. x
 Good Problem-Solving Skills
 Detail Oriented
 Strong Analytical skills working with a large store of Databases and Tables
 Ability to work with geographically diverse teams.
Good to Have:
 Certification in GCP service.
 Experience with Kubernetes.
 Experience with Docker
 Experience with CircleCI for Deployment
 Experience with Great Expectations.
Responsibilities:
 Build Data and ETL pipelines in GCP.
 Support migration of data to the cloud using Big Data Technologies like Spark, Hive, Talend, Java
 Interact with customers on daily basis to ensure smooth engagement.
 Responsible for timely and quality deliveries.
 Fulfill organization responsibilities – Sharing knowledge and experience with the other groups in
the organization, and conducting various technical training sessions.

Job description
Greeting from HCL!!
We are looking for GCP-Data Engineer for Chennai Location
Exp:4+ yrs
Skills:
Hands-on experience in Google Cloud (BigQuery)
Strong SQL programming knowledge and hands-on experience on real-time projects.
Good data analysis and problem solving skills
Good communication skills and a quick learner

If you are interested please share your resume to jyothiveerabh.akula@hcl.com

Roles and Responsibilities

In this Role, the GCP Data Engineer is responsible to:
 Design, develop, test, and implement technical solutions using GCP data technologies/tools.
 Develop data solutions in distributed microservices and full stack systems.
 Utilize programming languages like Python, Java and GCP technologies like BigQuery, Data
Proc, Data Flow, Cloud SQL, Cloud Functions, Cloud Run, Cloud Composer, Pub/Sub, APIs
 Lead performance engineering and ensure the systems are scalable.
Desired Candidate Profile Technology & Engineering Expertise
 Overall 5+ years of overall experience in implementing data solutions using Cloud/On-prem
technologies.
 At least 3+ years of experience in data pipeline development using GCP cloud technologies.
 Proficient in data ingestion, store and processing using GCP technologies like BigQuery, Data
Proc, Data Flow, Cloud SQL, Cloud Functions, Cloud Run, Cloud Composer, Pub/Sub and
APIs
 Proficient in pipeline development using ELT and ETL approaches.
 Experience in Microservices implementations on GCP
 Knowledge in Master data management
 Knowledge in Data Catalog, Data Governance, Data Security
 Excellent SQL skills
 Must be google certified.
 Experience with different development methodologies (RUP | Scrum | XP) Soft skills
Soft skills

Desired Candidate Profile

We are looking for a GCP Data Engineer for Full time/Part time.
Needs to be Very strong in Python,GCP,Data Flow, BigQuery,Data Processing,ETL,API

Candidates with BE/BTech/MCA / MSC having the required experience .Tech stack: BigQuery , any ETL
tool (Informatica, Talend, DataStage), Dataflow, Dataproc
• 3-5 years Experience in Data warehouse and Data lake implementation
• 1-2 years of experience in Google Cloud Platform (especially Big Query).
• 1-2 years of working experience in converting ETL jobs( in Informatica/Talend,/DataStage) into
Dataflow or Dataproc and migrated in CI\CD pipeline
• Design, develop and deliver data integration/data extraction solutions using IBM DataStage or other ETL
tools and Data Warehouse platforms like Teradata, BigQuery.
• Proficiency in Linux/Unix shell scripting and SQL.
• Knowledge of data Modelling, database design, and the data warehousing ecosystem.
• Ability to troubleshoot and solve complex technical problems.
• Excellent analytical and problem-solving skills.
• Knowledge of working in Agile environments.

Essential Skills
 3+ years’ experience in developing large scale data pipelines in a at least one Cloud Services-
Azure, AWS, GCP
 Expertise in one or more (data base + ETL /pipeline + Visualization Reporting) of the following
skills, Azure: Synapse, ADF, HD Insights,AWS: Redshift, Glue, EMR
 Highly Proficient in any or more of market leading ETL tools like Informatica, DataStage, SSIS,
Talend, etc.,
 Fundamental knowledge in Data warehouse/Data Mart architecture and modelling
 Define and develop data ingest, validation, and transform pipelines.
 Fundamental knowledge of distributed data processing and storage
 Fundamental knowledge of working with structured, unstructured, and semi structured data
 For cloud data engineer, experience with ETL/ELT patterns, preferably using Azure Data
Factory and Databricks jobs
 Nice to have on premise platform understanding covering one or
more of the below skills Teradata, Cloudera, Netezza, Informatica, DataStage, SSIS, BODS, SAS,
Business Objects, Cognos, MicroStrategy, WebFocus, Crystal
Essential Qualification
 BE/Btech in Computer Science, Engineering or relevant field

Responsibilities:

1. Data Migration: Collaborate with cross-functional teams to migrate data from various sources to
GCP. Develop and implement efficient data migration strategies, ensuring data integrity and security
throughout the process.

2. Data Pipeline Development: Design, develop, and maintain robust data pipelines that extract,
transform, and load (ETL) data from different sources into GCP. Implement data quality checks and
ensure scalability, reliability, and performance of the pipelines.

3. Data Management: Build and maintain data models and schemas in GCP, ensuring optimal storage,
organization, and accessibility of data. Collaborate with data scientists and analysts to understand
their data requirements and provide solutions to meet their needs.

4. GCP Data Service Expertise: Utilize your deep understanding of GCP data services, including
BigQuery, Big Data, Data Proc, and other relevant services, to architect and implement efficient and
scalable data solutions. Stay up to date with the latest advancements in GCP data services and
recommend innovative approaches to leverage them.

5. Performance Optimization: Identify and resolve performance bottlenecks within the data pipelines
and GCP data services. Optimize queries, job configurations, and data processing techniques to
improve overall system efficiency.

6. Data Governance and Security: Implement data governance policies, access controls, and data
security measures to ensure compliance with regulatory requirements and protect sensitive data.
Monitor and troubleshoot data-related issues, ensuring high availability and reliability of data
systems.

7. Documentation and Collaboration: Create comprehensive technical documentation, including data

flow diagrams, system architecture, and standard operating procedures. Collaborate with cross-
functional teams, including data scientists, analysts, and software engineers, to understand their
requirements and provide technical expertise

Akhil Reddy GCP
No ratings yet
Akhil Reddy GCP
8 pages
Senior Data Engineer Resume
No ratings yet
Senior Data Engineer Resume
4 pages
Senior Data Engineer Resume SEO
100% (1)
Senior Data Engineer Resume SEO
4 pages
Database Architect Expertise
No ratings yet
Database Architect Expertise
4 pages
GCP Migration Guide for IT Admins
No ratings yet
GCP Migration Guide for IT Admins
6 pages
Google Cloud Insights for Enterprises
No ratings yet
Google Cloud Insights for Enterprises
22 pages
TOC - GCP Cloud Architect (Advanced) - 3 Days
No ratings yet
TOC - GCP Cloud Architect (Advanced) - 3 Days
4 pages
GCP Data Engineering Cheet Sheet
No ratings yet
GCP Data Engineering Cheet Sheet
11 pages
Google Data Engineer Certification Guide
No ratings yet
Google Data Engineer Certification Guide
4 pages
Cloud DevOps with GCP Tools
No ratings yet
Cloud DevOps with GCP Tools
15 pages
Google Cloud Engineer Certification Guide
No ratings yet
Google Cloud Engineer Certification Guide
1 page
Cloud & BI Engineer Resume
No ratings yet
Cloud & BI Engineer Resume
3 pages
GCP Solutions for IT Professionals
No ratings yet
GCP Solutions for IT Professionals
3 pages
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
No ratings yet
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
24 pages
Google Cloud Dev & Deploy Guide
No ratings yet
Google Cloud Dev & Deploy Guide
15 pages
Reetesh Jain2
No ratings yet
Reetesh Jain2
4 pages
Challenges of Cloud Integration
No ratings yet
Challenges of Cloud Integration
9 pages
BigQuery Pricing Guide
No ratings yet
BigQuery Pricing Guide
18 pages
Case Study: Google Cloud
No ratings yet
Case Study: Google Cloud
13 pages
(Omran) Introduction To Google Cloud Platform
No ratings yet
(Omran) Introduction To Google Cloud Platform
45 pages
GCP Products 4 Words or Less 2017-10-25
No ratings yet
GCP Products 4 Words or Less 2017-10-25
1 page
GCP - (Fall-2022) GCP Soc 2.
No ratings yet
GCP - (Fall-2022) GCP Soc 2.
176 pages
Batch Processing Vs Stream Processing
No ratings yet
Batch Processing Vs Stream Processing
3 pages
Developing, Deploying, and Monitoring in The Cloud
No ratings yet
Developing, Deploying, and Monitoring in The Cloud
19 pages
Analyse Data in GCP
No ratings yet
Analyse Data in GCP
14 pages
Diagnostic Questions: Security and Compliance
No ratings yet
Diagnostic Questions: Security and Compliance
9 pages
Reading Preparing For ACE Module 2 v2.0
No ratings yet
Reading Preparing For ACE Module 2 v2.0
30 pages
Dice Resume CV SAI KARTHIK
No ratings yet
Dice Resume CV SAI KARTHIK
4 pages
Notes GCP PCA Preparation
No ratings yet
Notes GCP PCA Preparation
7 pages
Google Cloud Platform: Quick Overview: Build, Test and Deploy Applications On Google's Infrastructure
No ratings yet
Google Cloud Platform: Quick Overview: Build, Test and Deploy Applications On Google's Infrastructure
22 pages
Big Query Interview Q&A
100% (1)
Big Query Interview Q&A
8 pages
GCP Cheat Sheet for Cloud Management
No ratings yet
GCP Cheat Sheet for Cloud Management
3 pages
Pavan Resume
No ratings yet
Pavan Resume
3 pages
AWS Glue Consultant Job in CA
No ratings yet
AWS Glue Consultant Job in CA
16 pages
Prasanth Vemula - AI ML Engineer Big Data New
No ratings yet
Prasanth Vemula - AI ML Engineer Big Data New
4 pages
Azure
No ratings yet
Azure
8 pages
ETL Mastery for Data Professionals
100% (1)
ETL Mastery for Data Professionals
15 pages
Intro To ETL
No ratings yet
Intro To ETL
43 pages
Technical Guftgu GCP Notes-1
No ratings yet
Technical Guftgu GCP Notes-1
50 pages
Cloud Dataprep for Ecommerce Data
0% (1)
Cloud Dataprep for Ecommerce Data
39 pages
Akshitha SR SRE Resume
No ratings yet
Akshitha SR SRE Resume
10 pages
Associate Cloud Engineer Exam - Free Actual Q&As, Page 5 - ExamTopics
No ratings yet
Associate Cloud Engineer Exam - Free Actual Q&As, Page 5 - ExamTopics
3 pages
Course Presentation GoogleCloudDigitalLeader
No ratings yet
Course Presentation GoogleCloudDigitalLeader
182 pages
Google Cloud Platform
100% (1)
Google Cloud Platform
20 pages
Associate Cloud Engineer
No ratings yet
Associate Cloud Engineer
9 pages
GCP Questions - Topics
No ratings yet
GCP Questions - Topics
1 page
Google Associate Cloud Engineer Dumps Available Here at
No ratings yet
Google Associate Cloud Engineer Dumps Available Here at
6 pages
GCP Notes Linkedin
100% (1)
GCP Notes Linkedin
67 pages
Google Cloud Fund M8 Big Data and Machine Learning in The Cloud
No ratings yet
Google Cloud Fund M8 Big Data and Machine Learning in The Cloud
44 pages
GCP Associate Guide
No ratings yet
GCP Associate Guide
14 pages
Big Data Testing
100% (1)
Big Data Testing
34 pages
Google Ai ML Virtual Internship Report
No ratings yet
Google Ai ML Virtual Internship Report
29 pages
GCP Data Storage & BigQuery Guide
No ratings yet
GCP Data Storage & BigQuery Guide
15 pages
Amazon Redshift-Lab
100% (1)
Amazon Redshift-Lab
14 pages
Matillion Cloud Data Integration Guide
No ratings yet
Matillion Cloud Data Integration Guide
17 pages
What Is Bigquery: Enterprise Data Warehouse
No ratings yet
What Is Bigquery: Enterprise Data Warehouse
2 pages
JD - GCP Data Engineer
No ratings yet
JD - GCP Data Engineer
1 page
Sivaganeshsomu GCP Big Query Hyderabad
No ratings yet
Sivaganeshsomu GCP Big Query Hyderabad
4 pages
SPEC - The Smart Cube - Lead Engineer
No ratings yet
SPEC - The Smart Cube - Lead Engineer
2 pages
PrinceKumar GCP ETL
No ratings yet
PrinceKumar GCP ETL
1 page
Report
No ratings yet
Report
3 pages
Santhosh D: Mongo DBA
No ratings yet
Santhosh D: Mongo DBA
4 pages
hp840g8 47F04UC
No ratings yet
hp840g8 47F04UC
1 page
Linux Admin
No ratings yet
Linux Admin
14 pages
Linux Admin Job Responsibilities
No ratings yet
Linux Admin Job Responsibilities
5 pages
Mulesoft Error Handling Guide
No ratings yet
Mulesoft Error Handling Guide
3 pages
I'm Your Name: Hello
No ratings yet
I'm Your Name: Hello
1 page
Appscscl:: Srikakulam General Merit List of Candidates Who Applied For Accountant To Work in Ap State Civil Supplies Corporation LTD., Srikakulam District Merit List As Per Weightage of Marks
No ratings yet
Appscscl:: Srikakulam General Merit List of Candidates Who Applied For Accountant To Work in Ap State Civil Supplies Corporation LTD., Srikakulam District Merit List As Per Weightage of Marks
7 pages
Government of Andhra Pradesh
No ratings yet
Government of Andhra Pradesh
8 pages
Naveen Kumar Postgresqldba
No ratings yet
Naveen Kumar Postgresqldba
8 pages
Red Hat® Linux Automation With Ansible (RH294) : Training Program
No ratings yet
Red Hat® Linux Automation With Ansible (RH294) : Training Program
2 pages
DevOps Engineer Skills Guide
No ratings yet
DevOps Engineer Skills Guide
24 pages
IBM AI Reference Architecture White Paper
No ratings yet
IBM AI Reference Architecture White Paper
28 pages
Challenges For Mapreduce in Big Data: Scholarship@Western
No ratings yet
Challenges For Mapreduce in Big Data: Scholarship@Western
10 pages
Dremio vs. SQL Engines: Performance & Features
No ratings yet
Dremio vs. SQL Engines: Performance & Features
55 pages
Revision For Mid-Term Test 15 - 3 - 2024
No ratings yet
Revision For Mid-Term Test 15 - 3 - 2024
43 pages
Tiger Jobs List
No ratings yet
Tiger Jobs List
11 pages
Linkedin Learning Courses
No ratings yet
Linkedin Learning Courses
274 pages
JPC - 15553 - Bhavyasri Tanneeru
No ratings yet
JPC - 15553 - Bhavyasri Tanneeru
8 pages
Apache Spark Vs Apache Flink, Reproducible Experiments On Cloud
No ratings yet
Apache Spark Vs Apache Flink, Reproducible Experiments On Cloud
10 pages
Databricks On AWS 01 Getting Started Apache Spark Slides
100% (1)
Databricks On AWS 01 Getting Started Apache Spark Slides
29 pages
Metric Monitoring - by Alex Xu - ByteByteGo Newsletter
No ratings yet
Metric Monitoring - by Alex Xu - ByteByteGo Newsletter
3 pages
Spark Development for Developers
No ratings yet
Spark Development for Developers
172 pages
FX RTM
No ratings yet
FX RTM
15 pages
Aatish Reddy Cloud Data Engineer 1+yrs AWS Snowflake Pyspark Resume
No ratings yet
Aatish Reddy Cloud Data Engineer 1+yrs AWS Snowflake Pyspark Resume
2 pages
Strategy & Roadmap For Bigtop & Ambari
No ratings yet
Strategy & Roadmap For Bigtop & Ambari
5 pages
Lec 10
No ratings yet
Lec 10
28 pages
Fraud Detection in Transactions Using Apache Spark
No ratings yet
Fraud Detection in Transactions Using Apache Spark
11 pages
Spark RDD Transformations Guide
No ratings yet
Spark RDD Transformations Guide
9 pages
MOUNT
No ratings yet
MOUNT
47 pages
Full Stack Data Science Brochure
No ratings yet
Full Stack Data Science Brochure
15 pages
CSE 3002 Big Data Technologies - 7sem
No ratings yet
CSE 3002 Big Data Technologies - 7sem
19 pages
Spark Runtime Architecture Overview
No ratings yet
Spark Runtime Architecture Overview
5 pages
500+ Data Engineering Interview - Questions
No ratings yet
500+ Data Engineering Interview - Questions
118 pages
Khandayfaisalresume
No ratings yet
Khandayfaisalresume
4 pages
Edureka Training - Data Engineer Masters Program
No ratings yet
Edureka Training - Data Engineer Masters Program
49 pages
Spark Setup
No ratings yet
Spark Setup
4 pages
Azure Landing Zone: Kevin Harmer
No ratings yet
Azure Landing Zone: Kevin Harmer
60 pages
Big Data Seminar Report Rahul Jain
No ratings yet
Big Data Seminar Report Rahul Jain
41 pages
Comparative Analysis of Supervised Machine Learnin
No ratings yet
Comparative Analysis of Supervised Machine Learnin
10 pages
Data & Analytics Modernization - Final
100% (1)
Data & Analytics Modernization - Final
34 pages
Chapter 3. Graph Platforms and Processing: Platform Considerations
No ratings yet
Chapter 3. Graph Platforms and Processing: Platform Considerations
12 pages

GCP Cloud

Uploaded by

GCP Cloud

Uploaded by

Job description

JD GCP Data Engineering

If you are interested please share your resume to jyothiveerabh.akula@hcl.com

Roles and Responsibilities

Desired Candidate Profile

7. Documentation and Collaboration: Create comprehensive technical documentation, including data

You might also like