hadoop-developer-jobs-in-pune, Pune

102 Hadoop Developer Jobs in Pune

Toggle to save search

posted 1 month ago

Data Engineer

CAPGEMINI TECHNOLOGY SERVICES INDIA LIMITED

6 to 11 Yrs

Pune, Bangalore

sql
scala
cloud
django
hadoop
python
flask
devops
pyspark

Job Description- Data EngineerTotal yrs of exp should be 6+yrs Must have 3+ years in Pyspark. Strong programming experience, Python, Pyspark, Scala is preferred. Experience in designing and implementing CI/CD, Build Management, and Development strategy. Experience with SQL and SQL Analytical functions, experience participating in key business, architectural and technical decisions Scope to get trained on AWS cloud technology Proficient in leveraging Spark for distributed data processing and transformation. Skilled in optimizing data pipelines for efficiency and scalability. Experience with real-time data processing and integration. Familiarity with Apache Hadoop ecosystem components. Strong problem-solving abilities in handling large-scale datasets. Ability to collaborate with cross-functional teams and communicate effectively with stakeholders. Primary Skills : Pyspark SQL Secondary Skill: Experience on AWS/Azure/GCP would be added advantage

INTERVIEW ASSURED IN 15 MINS

Top Companies are Hiring in Your City

For Multiple Roles

Jio Platforms Ltd

posted 3 weeks ago

Java IOT with Hadoop

Cognizant

1 to 5 Yrs

Pune, All India

Java
Spring boot
J2EE
Spring Framework
REST
SOAP
Tomcat
JETTY
JBOSS
Relational Databases
Oracle
Informix
MySQL
Kafka
WMQ
Active MQ
RabbitMQ
JUnit
TestNG
Selenium
XML
JSON
Cloud
Azure
AWS
Data structures
Micro services
Java Application Servers
TOMEE
Messaging Systems
Testing frameworks
Web Services APIs
CICD
Algorithm

As a Java Developer at our company, you will be responsible for the following: - Developing Web Service REST and SOAP and Micro services using Java and Spring framework. - Having a good understanding and working knowledge with any one or more of Java Application Servers such as Tomcat, TOMEE, JETTY, JBOSS, WAS. - Working with Relational Databases like Oracle, Informix, MySQL. - Using Messaging Systems like Kafka, WMQ, Active MQ, RabbitMQ. - Implementing Testing frameworks like JUnit, TestNG, Selenium. - Working with Web Services APIs including REST, SOAP, XML, JSON. Qualifications required for this role include: - Strong background in Java, Spring boot, J2EE, Spring Framework, Spring Boot development. - Minimum One year experience in Cloud platforms such as Azure or AWS. - Exposure to modern deployment and CI/CD processes. - Excellent programming skills including data structures and algorithms. If you join our team, you will have the opportunity to work with cutting-edge technologies and contribute to the development of innovative solutions. As a Java Developer at our company, you will be responsible for the following: - Developing Web Service REST and SOAP and Micro services using Java and Spring framework. - Having a good understanding and working knowledge with any one or more of Java Application Servers such as Tomcat, TOMEE, JETTY, JBOSS, WAS. - Working with Relational Databases like Oracle, Informix, MySQL. - Using Messaging Systems like Kafka, WMQ, Active MQ, RabbitMQ. - Implementing Testing frameworks like JUnit, TestNG, Selenium. - Working with Web Services APIs including REST, SOAP, XML, JSON. Qualifications required for this role include: - Strong background in Java, Spring boot, J2EE, Spring Framework, Spring Boot development. - Minimum One year experience in Cloud platforms such as Azure or AWS. - Exposure to modern deployment and CI/CD processes. - Excellent programming skills including data structures and algorithms. If you join our team, you will have the opportunity to work with cutting-edge technologies and contribute to the development of innovative solutions.

ACTIVELY HIRING

posted 2 months ago

Scala Spark Developer

LTIMindtree

3 to 8 Yrs

Pune, Maharashtra

Apache Spark
Scala
Big Data Hadoop Ecosystem
SparkSQL

Role Overview: As a Scala Spark Developer, you will be responsible for leveraging your expertise in Apache Spark, Big Data Hadoop Ecosystem, SparkSQL, and Scala to design and develop big data platforms. Your deep understanding of modern data processing technology stacks and streaming data architectures will be pivotal in enabling real-time and low-latency data processing. Key Responsibilities: - Possessing 5+ years of Scala and Spark experience, with a focus on Data Engineering - Demonstrating hands-on experience in the design and development of big data platforms - Utilizing expertise in Spark, HBase, Hive, and other Hadoop ecosystem technologies for development using Scala - Understanding and implementing streaming data architectures and technologies for real-time and low-latency data processing - Applying agile development methods and principles, including Continuous Integration/Delivery - Having experience with NoSQL technologies and a passion for software craftsmanship - Demonstrating strong knowledge of Spark internals, configuration, memory management, Scala, and Databricks Qualifications Required: - 5+ years of experience in Scala and Spark - Expertise in Data Engineering - Deep understanding of modern data processing technology stacks - Hands-on experience with Spark, HBase, Hive, and other Hadoop ecosystem technologies - Proficiency in Scala and SparkSQL - Experience with agile development methods - Understanding of Continuous Integration/Delivery principles - Familiarity with NoSQL technologies is a plus - Passion for software craftsmanship - Experience in the Financial Industry is a plus Please share your updated resume to preethi.r@ltimindtree.com to apply for this role.,

ACTIVELY HIRING

Are these jobs relevant for you?

posted 1 week ago

Data/Information Mgt Int Anlst

Citi

5 to 9 Yrs

Pune, Maharashtra

Hive
Hadoop
SQL
Excel
Tableau
Power BI
Google Analytics
Adobe Analytics
Data visualization
Stakeholder management
Project management
Mentoring
Data analysis
Communication
BigData systems
Spark Python

Role Overview: You will be a part of Citi Analytics Information Management, a global community that connects and analyzes information to create actionable intelligence for business leaders. As a member of this fast-growing organization, you will be responsible for developing and maintaining reporting systems, collaborating with cross-functional teams, interpreting data to provide insights, and managing end-to-end project communications. Key Responsibilities: - Develop and maintain reporting systems to track key performance metrics, collaborating with cross-functional teams for accurate and timely delivery of reports and dashboards. - Rationalize, enhance, transform, and automate reports as required, performing adhoc and root cause analysis to address specific challenges. - Interpret data to identify trends, patterns, and anomalies, providing insights to stakeholders for informed decision-making. - Translate data into customer behavioral insights for targeting and segmentation strategies, effectively communicating with business partners and senior leaders. - Collaborate and manage project communication with onsite business partners and team in India, leading projects and mentoring a team of analysts. - Ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. Qualifications Required: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. - Proficiency in SQL, Excel, and data visualization tools like Tableau, Power BI, or similar software. - Knowledge of digital channels, marketing, and tools used for audience engagement. - Expertise in Google Analytics/Adobe Analytics for tracking and reporting website traffic and journey analytics. - Strong background in reporting and data analysis, excellent communication and stakeholder management skills. - Ability to create presentations, present reports, findings, and recommendations to diverse audiences. - Proven ability to manage projects, mentor teams, and contribute to organizational initiatives. - Bachelor's degree in computer science, Engineering, or related field. Additional Company Details: Citi Analytics Information Management was established in 2003 with locations across multiple cities in India including Bengaluru, Chennai, Gurgaon, Mumbai, and Pune. The function aims to balance customer needs, business strategy, and profit objectives using best-in-class analytic methodologies. (Note: Omitted the irrelevant sections such as EEO Policy Statement and Other Relevant Skills),

ACTIVELY HIRING

posted 1 week ago

Data/Information Mgt Int Anlst - C11

Early Career

5 to 9 Yrs

Pune, Maharashtra

Hive
Hadoop
SQL
Excel
Tableau
Power BI
Google Analytics
Adobe Analytics
Data visualization
Stakeholder management
Project management
Mentoring
Data analysis
Communication
BigData systems
Spark Python

As a member of Citi Analytics Information Management, you will play a crucial role in developing and maintaining reporting systems to track key performance metrics aligned with the organization's goals. Your responsibilities will include collaborating with cross-functional teams to ensure accurate and timely delivery of reports and dashboards. Additionally, you will rationalize, enhance, transform, and automate reports as required, while performing adhoc analysis and root cause analysis to address specific reporting challenges. You will be expected to interpret data to identify trends, patterns, and anomalies, providing valuable insights to stakeholders to support informed decision-making. Your role will involve translating data into customer behavioral insights to drive targeting and segmentation strategies. Effective communication skills will be essential as you will be required to clearly and effectively communicate with business partners and senior leaders. Furthermore, you will collaborate individually and manage end-to-end project communication with onsite business partners and the team in India. Your leadership skills will be put to the test as you lead projects and mentor a team of analysts to maintain a high standard of work. It will be crucial to ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. To excel in this role, you should possess 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. Proficiency in SQL, Excel, and data visualization tools such as Tableau, Power BI, or similar software is required. Knowledge of digital channels, marketing methods, and tools used by businesses to engage with their audience is essential. Additionally, expertise in using Google Analytics or Adobe Analytics to track and report website traffic, funnel performance, and journey analytics is preferred. Your educational background should include a Bachelor's degree in computer science, engineering, or a related field. Strong communication and stakeholder management skills are a must, as well as the ability to create presentations and present reports, findings, and recommendations to diverse audiences. Your proven ability to manage projects and mentor a team will be valuable assets in this role. This job description provides an overview of the responsibilities and expertise required for the position. Other duties may be assigned as needed. Position: C11 Job Family Group: Decision Management Job Family: Data/Information Management Time Type: Full time Most Relevant Skills: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies - Proficiency in SQL, Excel, and data visualization tools such as Tableau, Power BI or similar software - Knowledge of digital channels, marketing methods, and tools used by businesses to engage with their audience - Expertise in using Google Analytics or Adobe Analytics - Strong communication and stakeholder management skills - Project management and team mentoring experience Preferred Qualifications: - Exposure to Digital Business and Expertise in Adobe Site Catalyst, Clickstream Data If you require a reasonable accommodation to use our search tools or apply for a career opportunity due to a disability, please review Accessibility at Citi.,

ACTIVELY HIRING

posted 2 weeks ago

Technology Lead - Big Data/Cloud Platform

Atgeir Solutions

7 to 11 Yrs

Pune, Maharashtra

Big Data
Hadoop
Spark
GCP
AWS
Azure
Snowflake
Cloud technologies
Databricks

At Atgeir Solutions, you have the opportunity to join as a dynamic Technical Lead specializing in Big Data and Cloud technologies, with a clear growth path towards becoming a Technical Architect. **Key Responsibilities:** - **Technical Expertise:** Utilize your deep knowledge and hands-on experience in Big Data and Cloud technologies to contribute to system design, development, and implementation. - **Leadership:** Lead and inspire a team of professionals, offering technical guidance and mentorship to create a collaborative and innovative project environment. - **Problem Solving:** Address complex technical challenges strategically, providing innovative solutions and guiding the team in overcoming obstacles in Big Data and Cloud environments. - **Team Development:** Invest in the growth of team members by identifying training needs, conducting knowledge-sharing sessions, and fostering a culture of continuous learning. - **Collaboration:** Work closely with stakeholders, including clients, to understand requirements and translate them into technical solutions. Align technology strategies with business goals in the realm of Big Data and Cloud technologies. **Qualifications:** - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - 7-10 years of software development experience with a proven track record of technical leadership in Big Data and Cloud environments. - Strong proficiency in technologies, frameworks, and programming languages related to Big Data (e.g., Hadoop, Spark) and Cloud platforms (e.g., GCP, AWS, Azure). - Proficiency with Databricks/Snowflake will be an added advantage. - Excellent communication and interpersonal skills to convey complex technical concepts to both technical and non-technical stakeholders. - Demonstrated ability to lead and mentor teams, fostering a positive and collaborative work environment. In addition, at Atgeir Solutions, we are dedicated to achieving excellence in every endeavor, making us pioneers in technological innovation.,

ACTIVELY HIRING

posted 2 weeks ago

Sr. Data Engineer/Architect

Barclays

5 to 9 Yrs

Pune, All India

ETL
APIs
JSON
Avro
Glue
Snowflake
data modeling
data quality
data integration
data governance
application design
architecture modeling
data analysis
data governance
distributed systems
DBT
PCI DSS
tokenization
encryption
Parquet
AWS services
S3
Databricks
Hadoop ecosystem
Databricks
Delta Lake
Medallion architecture
data design patterns
database technologies
RDMBS
NoSQL databases
PA DSS

As a Sr. Data Engineer/Architect at Barclays, you will play a vital role in driving innovation and excellence in the digital landscape. You will utilize cutting-edge technology to enhance digital offerings, ensuring exceptional customer experiences. Working alongside a team of engineers, business analysts, and stakeholders, you will tackle complex technical challenges that require strong analytical skills and problem-solving abilities. **Key Responsibilities:** - Experience and understanding in ETL, APIs, various data formats (JSON, Avro, Parquet) and experience in documenting/maintaining interface inventories. - Deep understanding of AWS services (e.g., Glue, S3, Databricks, Snowflake) and Hadoop ecosystem for data processing and storage. - Familiarity with Databricks, Delta Lake, and Medallion architecture for advanced analytics and fraud detection use cases. - Build logical and physical data models, enforce data quality, and integrate data across multiple systems. - Data Design and Requirements Analysis: Able to apply data design patterns and frameworks, working knowledge of schemas and normalization. - Experience in preparing architecture vision documents, data flow diagrams, and maintain auditable governance documentation. - Understands user requirement gathering to define data flow, model and design. - Knowledge of basic activities and deliverables of application design; ability to utilize application design methodologies, tools and techniques to convert business requirements and logical models into a technical application design. - Knowledge of Architecture Modelling; ability to develop and modify enterprise architecture through conceptual, logical and physical approaches. - Knowledge of data, process and events; ability to use tools and techniques for analyzing and documenting logical relationships among data, processes or events. - Knows the tools and techniques used for data governance. Understands the relevance of following, creating and improving policies to ensure data is secure including data privacy (e.g. token generation). - Knowledge on the right platform for the data transmission and ensure the cloud / on prem servers are appropriately used. Also, ensure the cost is considered while choosing the cloud vs on-perm platform. - Knowledge on the database and latest updates to help provide the right tools and design. - Proficient in communicating data standards and demonstrating their value to the wider audience. **Qualifications Required:** - Educated to degree or MBA level to be able to meet the intellectual demands of the job, or can demonstrate equivalent experience. - Good understanding of distributed systems and databases. - Good understanding of DBT (Data Build Tool). - Good understanding of AWS database technologies e.g. Databricks, Snowflake. - Knowledge of PCI DSS and PA DSS tokenization and encryption. - Understands basic features of RDMBS and NoSQL databases. The role is based in Pune. In this role, you will build and maintain data architectures pipelines, design and implement data warehouses and data lakes, develop processing and analysis algorithms, and collaborate with data scientists to deploy machine learning models. Your responsibilities also include advising on decision making, contributing to policy development, and ensuring operational effectiveness. All colleagues at Barclays are expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence, and Stewardship. Additionally, adherence to the Barclays Mindset to Empower, Challenge and Drive is crucial for creating a culture of excellence and integrity within the organization. As a Sr. Data Engineer/Architect at Barclays, you will play a vital role in driving innovation and excellence in the digital landscape. You will utilize cutting-edge technology to enhance digital offerings, ensuring exceptional customer experiences. Working alongside a team of engineers, business analysts, and stakeholders, you will tackle complex technical challenges that require strong analytical skills and problem-solving abilities. **Key Responsibilities:** - Experience and understanding in ETL, APIs, various data formats (JSON, Avro, Parquet) and experience in documenting/maintaining interface inventories. - Deep understanding of AWS services (e.g., Glue, S3, Databricks, Snowflake) and Hadoop ecosystem for data processing and storage. - Familiarity with Databricks, Delta Lake, and Medallion architecture for advanced analytics and fraud detection use cases. - Build logical and physical data models, enforce data quality, and integrate data across multiple systems. - Data Design and Requirements Analysis: Able to apply data design patterns and frameworks, working knowledge of schemas and normalization. - Experience in preparing architecture vision documents, data flow diagrams, and maintain auditable governance documentation. - Understands user requirement gathering to define data flow, model and design. - Knowledge of ba

ACTIVELY HIRING

posted 2 months ago

PySpark Developers

LTIMindtree

3 to 8 Yrs

Pune, Maharashtra

Apache Spark
Python
AWS
Snowflake
System Design
Big Data Hadoop Ecosystem
SparkSQL
CICD

Role Overview: As a PySpark Developer, you will be responsible for developing and maintaining PySpark applications. You will work with various technologies within the Big Data ecosystem, such as Apache Spark, Hadoop, and SparkSQL. Additionally, you will be expected to have a good understanding of AWS and Snowflake, as well as experience with CICD and system design. Candidates with prior experience in technologies related to Fund transfer AML will have an added advantage. Strong communication skills, the ability to work under pressure, and a proactive attitude towards learning are essential for this role. Key Responsibilities: - Develop and maintain PySpark applications using Apache Spark and Hadoop - Work with technologies like SparkSQL, Python, and PLSQL - Have a good understanding of AWS and Snowflake - Implement CICD practices and contribute to system design - Utilize your knowledge of Fund transfer AML technologies, if applicable - Communicate effectively with team members and stakeholders - Adapt to multitasking and working under strict deadlines - Collaborate with various internal systems and stakeholders Qualifications Required: - 3-8 years of experience in Big Data technologies, specifically PySpark, Hadoop, and SparkSQL - Proficiency in Apache Spark, Big Data Hadoop Ecosystem, SparkSQL, and Python - Familiarity with AWS and Snowflake - Strong understanding of CICD practices and system design - Prior experience in Fund transfer AML technologies is a plus - Excellent written and oral communication skills - Proactive and self-motivated with quick learning abilities - Ability to multitask and thrive in a fast-paced environment - Capability to collaborate with multiple stakeholders Please note that interested candidates are requested to share their updated resume to preethi.r@ltimindtree.com.,

ACTIVELY HIRING

posted 1 day ago

Data Engineer (Python, PySpark, Iceberg) - Assistant Vice President

Early Career

8 to 14 Yrs

Pune, Maharashtra

Python
Apache Spark
Hadoop
AWS
Azure
GCP
SQL
Oracle
PostgreSQL
Docker
Kubernetes
Data Pipeline Development
Big Data Infrastructure
Apache Iceberg
Apache Hudi
Trino
Apache Airflow
Prefect

As an Applications Development Senior Programmer Analyst at our company, you will play a crucial role in establishing and implementing new or revised application systems and programs in coordination with the Technology team. Your main responsibilities will involve conducting feasibility studies, providing IT planning, developing new applications, and offering user support. You will need to leverage your specialty knowledge to analyze complex problems, recommend security measures, and consult with users on advanced programming solutions. Additionally, you will be responsible for ensuring operating standards are followed and acting as an advisor to junior analysts. **Key Responsibilities:** - Conduct feasibility studies, time and cost estimates, and risk analysis for applications development - Monitor and control all phases of the development process including analysis, design, testing, and implementation - Provide user support and operational assistance on applications - Analyze complex problems and provide evaluation of business and system processes - Recommend and develop security measures for successful system design - Consult with users on advanced programming solutions and assist in system installations - Define operating standards and processes and serve as an advisor to junior analysts **Qualifications:** - 8-14 years of relevant experience - Strong experience in systems analysis and programming - Proven track record in managing and implementing successful projects - Working knowledge of consulting and project management techniques - Ability to work under pressure and manage deadlines effectively - Proficiency in Python programming language - Expertise in data processing frameworks like Apache Spark, Hadoop - Experience with cloud data platforms such as AWS, Azure, or GCP - Strong knowledge of SQL and database technologies - Familiarity with data orchestration tools like Apache Airflow or Prefect - Experience with containerization technologies like Docker and Kubernetes would be a plus In addition to the above responsibilities and qualifications, please note that this job description provides a high-level overview of the work performed. Additional duties may be assigned as required. If you are a person with a disability and require a reasonable accommodation to apply for this role, please review the Accessibility at Citi information.,

ACTIVELY HIRING

posted 2 months ago

Senior Python Pyspark Developer

Virtusa

3 to 7 Yrs

Pune, Maharashtra

Python
ETL
Hadoop
EMR
SQL
Git
Airflow
Oozie
DevOps
Kafka
Docker
Kubernetes
PySpark
Databricks
Luigi
AWS EMR
Azure Databricks
GCP DataProc
CICD
Spark Streaming

As a skilled and proactive Python / PySpark Developer at our company, you will join our data engineering or analytics team. Your primary responsibility will be to build scalable data pipelines, perform large-scale data processing, and collaborate with data scientists, analysts, and business stakeholders. Key Responsibilities: - Design, develop, and optimize ETL data pipelines using PySpark on big data platforms (e.g., Hadoop, Databricks, EMR). - Write clean, efficient, and modular code in Python for data processing and integration tasks. - Work with large datasets to extract insights, transform raw data, and ensure data quality. - Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions. - Implement performance tuning and debugging of PySpark jobs. - Monitor and troubleshoot data workflows and batch jobs in production environments. - Document solutions and maintain code repositories (e.g., Git). Required Skills & Qualifications: - Proficient in Python with experience in building data-centric applications. - Strong experience with PySpark and understanding of Spark internals (RDDs, DataFrames, Spark SQL). - Hands-on experience with Hadoop ecosystem, Hive, or cloud-based big data platforms like AWS EMR, Azure Databricks, or GCP DataProc. - Familiarity with workflow orchestration tools like Airflow, Oozie, or Luigi. - Good understanding of SQL and relational databases. - Experience with version control systems like Git. - Strong problem-solving skills and ability to work independently or in a team. - Bachelors degree in Computer Science, Engineering, or a related field. Preferred Qualifications: - Experience with CI/CD pipelines and DevOps practices. - Knowledge of data warehousing and data modeling. - Exposure to streaming technologies Kafka, Spark Streaming. - Familiarity with containerization tools like Docker or Kubernetes. At Virtusa, we embody values like teamwork, quality of life, professional and personal development. When you join our team of 27,000 people globally, you become a part of a community that cares about your growth and provides you with exciting projects and opportunities to work with state-of-the-art technologies throughout your career. We value collaboration, the team environment, and seek to provide a dynamic place for great minds to nurture new ideas and foster excellence.,

ACTIVELY HIRING

posted 3 days ago

Cloud Data Engineer

Hitachi Careers

6 to 10 Yrs

Pune, Maharashtra

SQL
Python
Hadoop
Spark
AWS
Azure
GCP
Data governance
Data security
Compliance
ETL processes

As a Data Engineer at the company, you will be responsible for designing, implementing, and maintaining the data infrastructure and pipelines necessary for AI/ML model training and deployment. Working closely with data scientists and engineers, you will ensure that data is clean, accessible, and efficiently processed. Key Responsibilities: - Build and maintain scalable data pipelines for data collection, processing, and analysis. - Ensure data quality and consistency for training and testing AI models. - Collaborate with data scientists and AI engineers to provide the required data for model development. - Optimize data storage and retrieval to support AI-driven applications. - Implement data governance practices to ensure compliance and security. Qualifications Required: - 6-8 years of experience in data engineering, preferably in financial services. - Strong proficiency in SQL, Python, and big data technologies (e.g., Hadoop, Spark). - Experience with cloud platforms (e.g., AWS, Azure, GCP) and data warehousing solutions. - Familiarity with ETL processes and tools, as well as knowledge of data governance, security, and compliance best practices. At GlobalLogic, you will experience a culture of caring where people come first. You will be part of an inclusive culture of acceptance and belonging, building meaningful connections with collaborative teammates, supportive managers, and compassionate leaders. The company is committed to your continuous learning and development, offering opportunities to try new things, sharpen your skills, and advance your career. You will work on projects that matter, engage your curiosity and problem-solving skills, and contribute to cutting-edge solutions shaping the world today. GlobalLogic supports balance and flexibility in work and life, providing various career areas, roles, and work arrangements. Joining GlobalLogic means being part of a high-trust organization where integrity is key, and trust is fundamental to relationships with employees and clients.,

ACTIVELY HIRING

posted 2 weeks ago

Senior Staff Engineer

Nagarro

8 to 12 Yrs

Pune, Maharashtra

Talend
SSIS
DataStage
Azure
GCP
NoSQL
Spark
Dynatrace
JIRA
Process automation
Performance tuning
Communication skills
Stakeholder management
Team leadership
Mentoring
Compliance management
Automation
Scripting
Metadata management
Master data management
ETL development
Technical solutioning
Leading largescale data initiatives
ETL tools Informatica
Data pipeline orchestration tools Apache Airflow
Azure Data Factory
Cloud platforms AWS
Databases SQL
Big data ecosystems Hadoop
Monitoring tools Splunk
Prometheus
Grafana
ITSM platforms ServiceNow
CICD
DevOps practices
RFP solutioning
Client proposals
Technical strategy alignment
Data management standards
Emerging technologies evaluation
Legacy systems modernization
Security standards
Audit standards
Regulatory standards
Selfhealing mechanisms
Workflow orchestration
Lin

As a candidate for the position at Nagarro, you will be responsible for leading large-scale data initiatives and RFP responses with over 7.5 years of experience in data operations, ETL development, and technical solutioning. Your role will involve hands-on experience with ETL tools such as Informatica, Talend, SSIS, and data pipeline orchestration tools like Apache Airflow and Azure Data Factory. Exposure to cloud platforms (AWS, Azure, GCP), databases (SQL, NoSQL), and big data ecosystems (Hadoop, Spark) will be essential. Additionally, familiarity with monitoring tools (e.g., Splunk, Dynatrace, Prometheus, Grafana), ITSM platforms (e.g., ServiceNow, JIRA), CI/CD, DevOps practices, and monitoring tools for data environments is required. Key Responsibilities: - Lead the technical and business solutioning of RFPs and client proposals related to data engineering, data operations, and platform modernization. - Collaborate with architecture, governance, and business teams to align on technical strategy and data management standards. - Evaluate emerging technologies and frameworks to modernize legacy systems and improve efficiency. - Mentor and guide a team of data engineers and operations analysts, fostering a culture of technical excellence and continuous improvement. - Act as a primary liaison with business, support, and governance teams for operational matters. - Drive automation, self-healing mechanisms, and process automation across data operations to reduce manual interventions and improve system resilience. - Implement and enforce best practices including metadata management, lineage tracking, data quality monitoring, and master data management. Qualifications: - Bachelors or masters degree in computer science, Information Technology, or a related field. Please note that Nagarro is a Digital Product Engineering company with a dynamic and non-hierarchical work culture, comprising over 17500 experts across 39 countries. Join us in building products, services, and experiences that inspire, excite, and delight.,

ACTIVELY HIRING

posted 2 weeks ago

Sr. Big Data Engineer

Facile Services

5 to 9 Yrs

Pune, Maharashtra

Hadoop
Apache Spark
EMR
Athena
Glue
Python
JSON
NoSQL
Databricks
Delta Tables
PySpark
AWS data analytics services
Parquet file format
RDBMS databases

As a Big Data Engineer, you will play a crucial role in developing and managing the Big Data solutions for our company. Your responsibilities will include designing and implementing Big Data tools and frameworks, implementing ELT processes, collaborating with development teams, building cloud platforms, and maintaining the production system. Key Responsibilities: - Meet with managers to determine the companys Big Data needs. - Develop Big Data solutions on AWS using tools such as Apache Spark, Databricks, Delta Tables, EMR, Athena, Glue, and Hadoop. - Load disparate data sets and conduct pre-processing services using Athena, Glue, Spark, etc. - Collaborate with software research and development teams. - Build cloud platforms for the development of company applications. - Maintain production systems. Qualifications Required: - 5+ years of experience as a Big Data Engineer. - Proficiency in Python & PySpark. - In-depth knowledge of Hadoop, Apache Spark, Databricks, Delta Tables, and AWS data analytics services. - Extensive experience with Delta Tables, JSON, Parquet file format. - Experience with AWS data analytics services like Athena, Glue, Redshift, EMR. - Familiarity with Data warehousing will be a plus. - Knowledge of NoSQL and RDBMS databases. - Good communication skills. - Ability to solve complex data processing and transformation related problems.,

ACTIVELY HIRING

posted 2 weeks ago

Big Data Solution Architect

Emergys

8 to 12 Yrs

Pune, Maharashtra

Cost Optimization
Hadoop Ecosystem Expertise
DevOps Cloud Preferably Azure
Architectural Leadership
EndtoEnd Big Data Delivery
Project Discovery Due Diligence

Role Overview: As a Big Data Architect with over 8 years of experience, you will be responsible for demonstrating expertise in the Hadoop Ecosystem, DevOps & Cloud (Preferably Azure), Architectural Leadership, End-to-End Big Data Delivery, Cost Optimization, and Project Discovery / Due Diligence. Your role will involve leading end-to-end projects, optimizing cloud costs, and defining solution strategies. Key Responsibilities: - Strong hands-on experience with core Hadoop components and related big data technologies - Utilize DevOps tools and CI/CD processes on cloud platforms, with a preference for Microsoft Azure - Lead at least three end-to-end projects as a Big Data Architect - Deliver complete big data solutions from requirement gathering to post-production support - Optimize cloud and infrastructure costs for data platforms - Participate in project discovery, assessment, or due diligence phases to define scope and solution strategy Qualifications Required: - 8+ years of experience in Big Data Architecture - Expertise in the Hadoop Ecosystem - Experience with DevOps tools and cloud platforms, preferably Microsoft Azure - Proven track record of leading end-to-end projects - Strong ability to optimize cloud and infrastructure costs - Previous involvement in project discovery or due diligence processes If you find that your skills match the requirements of this role, we encourage you to apply directly. Feel free to refer or share this opportunity with someone who you believe would be a strong fit.,

ACTIVELY HIRING

posted 1 week ago

Data/Information Management Analyst - C11

Citi

5 to 9 Yrs

Pune, Maharashtra

Hive
Hadoop
SQL
Excel
Tableau
Power BI
Google Analytics
Adobe Analytics
Data visualization
Stakeholder management
Project management
Mentoring
Communication skills
BigData systems
Spark Python

You will be working with Citi Analytics Information Management, a global community that connects and analyzes information to create actionable intelligence for business leaders. As part of your role, you will have the following responsibilities: - Develop and maintain reporting systems to track key performance metrics, collaborating with cross-functional teams for accurate and timely delivery. - Rationalize, enhance, transform, and automate reports as required, performing adhoc and root cause analysis. - Interpret data to identify trends, patterns, and anomalies, providing insights to stakeholders for informed decision-making. - Translate data into customer behavioral insights to drive targeting and segmentation strategies, communicating effectively to business partners and senior leaders. - Collaborate and manage end-to-end project communication with onsite business partners and team in India. - Lead projects and mentor a team of analysts, ensuring high-quality work. - Ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. To excel in this role, you should have: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. - Proficiency in SQL, Excel, and data visualization tools like Tableau, Power BI, or similar software. - Knowledge of digital channels, marketing, and various methods/tools businesses use to engage with their audience. - Expertise in using Google Analytics/Adobe Analytics for tracking and reporting website traffic and journey analytics. - Strong background in reporting and data analysis, with excellent communication and stakeholder management skills. - Ability to manage projects, mentor a team, and contribute to organizational initiatives. Preferred qualifications include exposure to Digital Business and expertise in Adobe Site Catalyst, Clickstream Data. Educational Requirement: - Bachelor's degree in computer science, Engineering, or related field This job description provides an overview of the work performed, and other job-related duties may be assigned as required.,

ACTIVELY HIRING

posted 2 weeks ago

Big Data/Cloud Architect

Atgeir Solutions

10 to 15 Yrs

Pune, Maharashtra

Big Data
ETL
Hadoop
Spark
Presto
Hive
Database Management
Networking
Storage
GCP
Azure
AWS
Snowflake
Python
Java
Go
Scala
Cloud Architect
Analytics Systems Development
Data Warehouses
Hardware Optimization
Databricks

You are invited to join Atgeir's Advanced Data Analytics team as a Big Data / Cloud Architect. Your role will involve collaborating with customers to understand their requirements and translate them into architectural models for large-scale and high-performance operations. You will provide guidance on running these models on both traditional Data Platforms (such as Hadoop Based) and Modern Data Platforms (such as Cloud Based). Additionally, you will work with customers to develop data management platforms using Open Source Technologies and Cloud Native services. Key Responsibilities: - Collaborate closely with customers to translate requirements into architectural models - Build data management platforms using Open Source Technologies and Cloud Native services - Extract best-practice knowledge and reference architectures for sharing with the Advanced Analytics Centre of Excellence team - Utilize your technical and analytical skills with over 10 years of experience in ETL and analytics systems development - Demonstrate strong verbal and written communication skills to work effectively across internal and external organizations - Prototype systems based on complex business requirements with quick turnaround time - Implement and optimize solutions in the Big Data Ecosystem, Databases, and Data Warehouses - Have knowledge of foundation infrastructure requirements and hands-on experience with cloud platforms and data cloud platforms - Utilize at least one programming language among Python, Java, Go, or Scala - Be willing to work hands-on on projects and lead large teams - Possess an architect level certification on one of the cloud platforms (GCP / Azure / AWS) will be an added advantage Qualifications Required: - Minimum 10 years of experience in ETL and analytics systems development - Strong technical skills and experience in the Big Data Ecosystem, Databases, and Data Warehouses - Hands-on experience with cloud platforms (GCP / Azure / AWS) and data cloud platforms (Databricks / Snowflake) - Proficiency in at least one programming language among Python, Java, Go, or Scala - Strong communication skills and ability to work effectively across teams Join Atgeir's team and contribute your expertise as a Big Data / Cloud Architect to drive innovation in Advanced Data Analytics.,

ACTIVELY HIRING

posted 2 weeks ago

Lead Software Engineer - Full Stack

Mastercard

8 to 12 Yrs

Pune, Maharashtra

Java
Spring Boot
Hadoop
Snowflake
API development
Microservices
Azure
AWS
Security
Performance tuning
Debugging
Troubleshooting
Mentoring
Software Architecture
React
PostgresDB
Apache NiFi
PCF
AgileScrumSAFe
Design Principles

Role Overview: As a Lead Software Engineer at our company, you will have the opportunity to lead a talented team and influence the technical direction of a critical security product called Crypto Secure. You will be responsible for designing, developing, and maintaining scalable applications using a variety of modern technologies. Additionally, you will play a key role in mentoring and guiding engineers, collaborating across teams, and championing engineering excellence. Key Responsibilities: - Design, develop, and maintain scalable applications using Java, Spring Boot, React, PostgresDB, Apache NiFi, Hadoop, Snowflake, and other modern technologies. - Break down high-level requirements into well-defined technical solutions and estimates. - Drive technical decision-making, ensuring alignment with architecture and security best practices. - Lead technical refinements, provide accurate work estimates, and manage technical dependencies. - Identify and remove technical roadblocks, ensuring smooth team execution. - Take ownership of non-functional requirements (performance, security, scalability, etc.). - Provide technical leadership, mentoring, and coaching to engineers of all levels. - Conduct code reviews and ensure adherence to best practices. - Foster a culture of continuous learning and innovation within the team. - Partner with Product Managers and System Architects to align technical and business priorities. - Work closely with Quality Engineers, Business Analysts, and other stakeholders to ensure well-defined and actionable backlog items. - Support project managers in identifying and managing technical dependencies. - Lead demos and contribute to stakeholder presentations. - Advocate for clean, maintainable, and testable code. - Stay up to date with industry trends and emerging technologies, continuously improving engineering practices. - Promote DevOps and CI/CD best practices for efficient and reliable software delivery. - Drive adoption of accessibility (A11y), internationalization (i18n), and performance best practices. Qualifications Required: - 8+ years of experience developing enterprise-grade applications. - Strong full-stack experience with Java, Spring Boot, React, and relational databases (PostgresDB preferred). - Knowledge of big data technologies like Hadoop, Snowflake, and Apache NiFi. - Experience designing and building scalable, high-performance systems. - Proficiency in API development, microservices, and cloud platforms (Azure, AWS, PCF). - Deep understanding of security, performance, and non-functional requirements. - Proven experience mentoring and guiding engineers. - Strong communication skills - able to translate complex technical concepts for non-technical stakeholders. - Experience working in Agile/Scrum/SAFe environments. - Ability to manage expectations across product and technology teams, ensuring clarity and alignment. - Ownership mentality - taking pride in delivering high-quality solutions. - Detail-oriented but able to see the bigger picture - balancing technical excellence with business needs. - Passionate about innovation - always looking for better ways to build software. (Note: The additional details of the company were not included in the job description.),

ACTIVELY HIRING

posted 1 week ago

Software Specialist - Big Data - Hadoop, Redshift, Spark

Augusta Infotech

5 to 9 Yrs

Pune, Maharashtra

Java
Linux
Scala
Hadoop
Spark
MapReduce
Tableau
R
Matlab
SPSS
MongoDB
Cassandra
Big Data Analytics
Redshift
AgileScrum
Mark Logic
Teradata

You will be a part of the Big Data Software development group, working alongside a small but highly skilled team of engineers. Your responsibilities will include evaluating and implementing massive data stores, working on data science, data security, and architecture design projects for a highly visible web presence, and building complex web analytic tools. Key Responsibilities: - Develop back-end big data web service-based distributed data ingestion/processing software - Write code in Java and Linux, with Scala experience being desired - Work in an Agile/SCRUM environment and process large amounts of structured and unstructured data - Utilize technologies such as Hadoop, Redshift, Spark, MongoDB, Cassandra, Mark Logic, Teradata, etc. - Collaborate with SQL Databases and various programming languages and statistical packages - Gain experience in real-time analytics and business intelligent platforms like Tableau Software - Implement machine learning skills and demonstrate data science familiarity - Stay updated on industry trends and advancements in Big Data Analytics Qualifications Required: - 5+ years of experience in developing big data web service-based software - Strong application development skills with the ability to write simple and elegant code - Proficiency in Java, Linux, and experience with technologies like Hadoop, Redshift, Spark - Familiarity with Agile/SCRUM methodologies and MapReduce - Knowledge of SQL Databases, programming languages like R, Java, Scala, Matlab or SPSS - Previous exposure to big data analytics tools such as MongoDB, Cassandra, Mark Logic, Teradata - Banking/Financial sector experience is highly preferred - Education: B.E/B.Tech in relevant field Please note that the company specializes in IT/Computers-Software industry and values expertise in Big Data, Hadoop, Redshift, Spark, Agile/Scrum, and other related technologies. For any inquiries or to apply for this role, please send your resume to jobs@augustainfotech.com.,

ACTIVELY HIRING

posted 2 weeks ago

Associate Principal - Architecture (Big Data)

RiverForest Connections Private Limited

3 to 8 Yrs

Pune, Maharashtra

Python
Unix scripting
Hive
SQL
Data migration
PySpark
SparkSQL
Hadoop Ecosystem
AWS Glue
AWS S3
Lambda function
Step Function
EC2
Insurance domain knowledge

Role Overview: You will be responsible for utilizing your strong experience in PySpark, Python, and Unix scripting to work on data processing tasks. Your expertise in SparkSQL and Hive will be essential in writing SQLs and creating views. Additionally, your excellent communication skills will be valuable in collaborating with team members. Knowledge of the Insurance domain is a plus, along with a good understanding of the Hadoop Ecosystem and Architecture, including HDFS, Map Reduce, Pig, Hive, Oozie, and Yarn. You will also need familiarity with AWS services such as Glue, AWS S3, Lambda function, Step Function, and EC2. Data migration exposure from platforms like Hive/S3 to Data Bricks will be part of your responsibilities. Your ability to prioritize, plan, organize, and manage multiple tasks efficiently while maintaining high-quality work is crucial for success in this role. Key Responsibilities: - Utilize strong experience in PySpark, Python, and Unix scripting for data processing tasks - Write SQLs, create views, and work with SparkSQL and Hive - Collaborate effectively with team members using excellent oral and written communication skills - Demonstrate knowledge of the Insurance domain and the Hadoop Ecosystem and Architecture - Use AWS services like Glue, AWS S3, Lambda function, Step Function, and EC2 - Perform data migration from platforms like Hive/S3 to Data Bricks Qualifications Required: - Technical experience of 6-8 years in Pyspark, AWS (Glue, EMR, Lambda & Steps functions, S3) - 3+ years of experience in Bigdata/ETL with Python+Spark+Hive and 3+ years of experience in AWS - Proficiency in Pyspark, AWS (Glue, EMR, Lambda & Steps functions, S3), and Big data with Python+Spark+Hive experience - Exposure to big data migration Additional Company Details: There are no additional details about the company mentioned in the job description.,

ACTIVELY HIRING

posted 2 weeks ago

Abinitio Developer

M/S. B. NANDI

10 to 20 Yrs

24 - 36 LPA

Pune, Bangalore

developers
abinitio
development management
developer relations
technology evangelism

Job Role Duties And Responsibilities. Ab Initio Developer is responsible for giving team status on a variety of projects. Their focus is to escalate an issue as necessary, assess and communicate risks to the development schedule and project to represent the data integration development teams interests in cross-functional project teams by ensuring project success as an ultimate goal. Responsibilities Monitor and Support existing production data pipelines developed in Ab Initio Analysis of highly complex business requirements, designs and/or data requires evaluation of intangible variance factors Debug daily production issues and rerun the jobs after understanding the issues Collaborate throughout the organisation on effective identification of technical issues Participates and provides feedback in design reviews Complete component design documents on assigned projects Participate and provide feedback in design reviews Requirements 1.7+ years of actual development experience building Etl applications/processes using Sas Relevant years of Hands-on experience with Ab Initio and Hadoop technologies (Hdfs, Hive, Impala etc Need to have good understanding of Etl concepts like Informatica, Data stage, Clover Etl Experience in Relational Databases like Oracle, Sql Server and Pl/Sql Understanding of Agile methodologies as well as Sdlc life-cycles and processes. Experience in writing Technical, Functional documentation Soft Skills Ability to work as an individual with minimal guidance/support Strong communication/team skills Strong analytical skills.