hadoop-administrator-jobs-in-ratnagiri, Ratnagiri

158 Hadoop Administrator Jobs nearby Ratnagiri

Toggle to save search
posted 1 month ago

Data Engineer

CAPGEMINI TECHNOLOGY SERVICES INDIA LIMITED
CAPGEMINI TECHNOLOGY SERVICES INDIA LIMITED
experience6 to 11 Yrs
location
Pune, Bangalore+1

Bangalore, Chennai

skills
  • sql
  • scala
  • cloud
  • django
  • hadoop
  • python
  • flask
  • devops
  • pyspark
Job Description
Job Description-  Data EngineerTotal yrs of exp should be 6+yrs   Must have 3+ years in Pyspark. Strong programming experience, Python, Pyspark, Scala is preferred. Experience in designing and implementing CI/CD, Build Management, and Development strategy. Experience with SQL and SQL Analytical functions, experience participating in key business, architectural and technical decisions Scope to get trained on AWS cloud technology Proficient in leveraging Spark for distributed data processing and transformation. Skilled in optimizing data pipelines for efficiency and scalability. Experience with real-time data processing and integration. Familiarity with Apache Hadoop ecosystem components. Strong problem-solving abilities in handling large-scale datasets. Ability to collaborate with cross-functional teams and communicate effectively with stakeholders. Primary Skills : Pyspark SQL Secondary Skill: Experience on AWS/Azure/GCP would be added advantage
INTERVIEW ASSURED IN 15 MINS

Top Companies are Hiring in Your City

For Multiple Roles

Jio Platforms Ltd
Jio Platforms Ltdslide-preview-Genpact
posted 3 weeks ago
experience1 to 5 Yrs
location
Pune, All India
skills
  • Java
  • Spring boot
  • J2EE
  • Spring Framework
  • REST
  • SOAP
  • Tomcat
  • JETTY
  • JBOSS
  • Relational Databases
  • Oracle
  • Informix
  • MySQL
  • Kafka
  • WMQ
  • Active MQ
  • RabbitMQ
  • JUnit
  • TestNG
  • Selenium
  • XML
  • JSON
  • Cloud
  • Azure
  • AWS
  • Data structures
  • Micro services
  • Java Application Servers
  • TOMEE
  • Messaging Systems
  • Testing frameworks
  • Web Services APIs
  • CICD
  • Algorithm
Job Description
As a Java Developer at our company, you will be responsible for the following: - Developing Web Service REST and SOAP and Micro services using Java and Spring framework. - Having a good understanding and working knowledge with any one or more of Java Application Servers such as Tomcat, TOMEE, JETTY, JBOSS, WAS. - Working with Relational Databases like Oracle, Informix, MySQL. - Using Messaging Systems like Kafka, WMQ, Active MQ, RabbitMQ. - Implementing Testing frameworks like JUnit, TestNG, Selenium. - Working with Web Services APIs including REST, SOAP, XML, JSON. Qualifications required for this role include: - Strong background in Java, Spring boot, J2EE, Spring Framework, Spring Boot development. - Minimum One year experience in Cloud platforms such as Azure or AWS. - Exposure to modern deployment and CI/CD processes. - Excellent programming skills including data structures and algorithms. If you join our team, you will have the opportunity to work with cutting-edge technologies and contribute to the development of innovative solutions. As a Java Developer at our company, you will be responsible for the following: - Developing Web Service REST and SOAP and Micro services using Java and Spring framework. - Having a good understanding and working knowledge with any one or more of Java Application Servers such as Tomcat, TOMEE, JETTY, JBOSS, WAS. - Working with Relational Databases like Oracle, Informix, MySQL. - Using Messaging Systems like Kafka, WMQ, Active MQ, RabbitMQ. - Implementing Testing frameworks like JUnit, TestNG, Selenium. - Working with Web Services APIs including REST, SOAP, XML, JSON. Qualifications required for this role include: - Strong background in Java, Spring boot, J2EE, Spring Framework, Spring Boot development. - Minimum One year experience in Cloud platforms such as Azure or AWS. - Exposure to modern deployment and CI/CD processes. - Excellent programming skills including data structures and algorithms. If you join our team, you will have the opportunity to work with cutting-edge technologies and contribute to the development of innovative solutions.
ACTIVELY HIRING
posted 5 days ago
experience6 to 10 Yrs
location
Maharashtra
skills
  • Apache Spark
  • Scala
  • ETL
  • Hadoop
  • Hive
  • HDFS
  • AWS
  • Azure
  • GCP
Job Description
Role Overview: As a Big Data Engineer specializing in Spark & Scala, you will be responsible for developing and optimizing big data processing pipelines using Apache Spark and Scala. Your role will involve designing and implementing ETL workflows for large-scale batch and real-time data processing. Additionally, you will be expected to optimize Spark performance through partitioning, caching, and memory/shuffle tuning. Collaboration with cross-functional teams and adherence to best practices in coding, testing, and deployment are essential aspects of this role. Key Responsibilities: - Develop and optimize big data processing pipelines using Apache Spark (Core, SQL, Streaming) and Scala. - Design and implement ETL workflows for large-scale batch and real-time data processing. - Optimize Spark performance through partitioning, caching, and memory/shuffle tuning. - Work with big data ecosystems like Hadoop, Hive, HDFS, and cloud platforms (AWS/Azure/GCP). - Collaborate with cross-functional teams and follow best practices in coding, testing, and deployment. Qualifications Required: - Bachelor's degree in a relevant field. - 6 to 10 years of experience in a similar role. - Strong expertise in Apache Spark and Scala. - Familiarity with big data ecosystems like Hadoop, Hive, HDFS, and cloud platforms. - Self-confidence and patience. (Note: The additional details of the company were not provided in the job description.),
ACTIVELY HIRING
question

Are these jobs relevant for you?

posted 2 weeks ago

Data Engineers

SID Global Solutions
experience6 to 10 Yrs
location
Maharashtra
skills
  • SQL
  • Python
  • Scala
  • Java
  • Airflow
  • Spark
  • AWS
  • GCP
  • Azure
  • Hadoop
  • Kafka
  • NiFi
Job Description
You will be responsible for designing, building, and maintaining data pipelines (ETL / ELT) and ingesting, transforming, and integrating data from various sources. You will also optimize data storage in data lakes and warehouses, ensuring data quality, consistency, and governance. Additionally, you will collaborate with analytics and data science teams on datasets and monitor, log, and alert data infrastructure. Key Responsibilities: - Design, build, and maintain data pipelines (ETL / ELT) - Ingest, transform, and integrate data from various sources - Optimize data storage in data lakes and warehouses - Ensure data quality, consistency, and governance - Collaborate with analytics and data science teams on datasets - Monitor, log, and alert data infrastructure Qualifications Required: - 6+ years in data engineering or related roles - Proficiency in SQL, Python, Scala, or Java - Experience with ETL/ELT tools such as Airflow, Spark, NiFi, etc. - Familiarity with cloud data platforms like AWS, GCP, Azure - Knowledge of big data technologies like Hadoop, Kafka, Spark (is a plus) - Experience in data modeling, partitioning, and performance tuning (Note: No additional company details were provided in the job description.),
ACTIVELY HIRING
posted 1 week ago
experience5 to 9 Yrs
location
Pune, Maharashtra
skills
  • Hive
  • Hadoop
  • SQL
  • Excel
  • Tableau
  • Power BI
  • Google Analytics
  • Adobe Analytics
  • Data visualization
  • Stakeholder management
  • Project management
  • Mentoring
  • Data analysis
  • Communication
  • BigData systems
  • Spark Python
Job Description
Role Overview: You will be a part of Citi Analytics Information Management, a global community that connects and analyzes information to create actionable intelligence for business leaders. As a member of this fast-growing organization, you will be responsible for developing and maintaining reporting systems, collaborating with cross-functional teams, interpreting data to provide insights, and managing end-to-end project communications. Key Responsibilities: - Develop and maintain reporting systems to track key performance metrics, collaborating with cross-functional teams for accurate and timely delivery of reports and dashboards. - Rationalize, enhance, transform, and automate reports as required, performing adhoc and root cause analysis to address specific challenges. - Interpret data to identify trends, patterns, and anomalies, providing insights to stakeholders for informed decision-making. - Translate data into customer behavioral insights for targeting and segmentation strategies, effectively communicating with business partners and senior leaders. - Collaborate and manage project communication with onsite business partners and team in India, leading projects and mentoring a team of analysts. - Ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. Qualifications Required: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. - Proficiency in SQL, Excel, and data visualization tools like Tableau, Power BI, or similar software. - Knowledge of digital channels, marketing, and tools used for audience engagement. - Expertise in Google Analytics/Adobe Analytics for tracking and reporting website traffic and journey analytics. - Strong background in reporting and data analysis, excellent communication and stakeholder management skills. - Ability to create presentations, present reports, findings, and recommendations to diverse audiences. - Proven ability to manage projects, mentor teams, and contribute to organizational initiatives. - Bachelor's degree in computer science, Engineering, or related field. Additional Company Details: Citi Analytics Information Management was established in 2003 with locations across multiple cities in India including Bengaluru, Chennai, Gurgaon, Mumbai, and Pune. The function aims to balance customer needs, business strategy, and profit objectives using best-in-class analytic methodologies. (Note: Omitted the irrelevant sections such as EEO Policy Statement and Other Relevant Skills),
ACTIVELY HIRING
posted 1 week ago
experience5 to 9 Yrs
location
Pune, Maharashtra
skills
  • Hive
  • Hadoop
  • SQL
  • Excel
  • Tableau
  • Power BI
  • Google Analytics
  • Adobe Analytics
  • Data visualization
  • Stakeholder management
  • Project management
  • Mentoring
  • Data analysis
  • Communication
  • BigData systems
  • Spark Python
Job Description
As a member of Citi Analytics Information Management, you will play a crucial role in developing and maintaining reporting systems to track key performance metrics aligned with the organization's goals. Your responsibilities will include collaborating with cross-functional teams to ensure accurate and timely delivery of reports and dashboards. Additionally, you will rationalize, enhance, transform, and automate reports as required, while performing adhoc analysis and root cause analysis to address specific reporting challenges. You will be expected to interpret data to identify trends, patterns, and anomalies, providing valuable insights to stakeholders to support informed decision-making. Your role will involve translating data into customer behavioral insights to drive targeting and segmentation strategies. Effective communication skills will be essential as you will be required to clearly and effectively communicate with business partners and senior leaders. Furthermore, you will collaborate individually and manage end-to-end project communication with onsite business partners and the team in India. Your leadership skills will be put to the test as you lead projects and mentor a team of analysts to maintain a high standard of work. It will be crucial to ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. To excel in this role, you should possess 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. Proficiency in SQL, Excel, and data visualization tools such as Tableau, Power BI, or similar software is required. Knowledge of digital channels, marketing methods, and tools used by businesses to engage with their audience is essential. Additionally, expertise in using Google Analytics or Adobe Analytics to track and report website traffic, funnel performance, and journey analytics is preferred. Your educational background should include a Bachelor's degree in computer science, engineering, or a related field. Strong communication and stakeholder management skills are a must, as well as the ability to create presentations and present reports, findings, and recommendations to diverse audiences. Your proven ability to manage projects and mentor a team will be valuable assets in this role. This job description provides an overview of the responsibilities and expertise required for the position. Other duties may be assigned as needed. Position: C11 Job Family Group: Decision Management Job Family: Data/Information Management Time Type: Full time Most Relevant Skills: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies - Proficiency in SQL, Excel, and data visualization tools such as Tableau, Power BI or similar software - Knowledge of digital channels, marketing methods, and tools used by businesses to engage with their audience - Expertise in using Google Analytics or Adobe Analytics - Strong communication and stakeholder management skills - Project management and team mentoring experience Preferred Qualifications: - Exposure to Digital Business and Expertise in Adobe Site Catalyst, Clickstream Data If you require a reasonable accommodation to use our search tools or apply for a career opportunity due to a disability, please review Accessibility at Citi.,
ACTIVELY HIRING
posted 1 day ago
experience8 to 14 Yrs
location
Pune, Maharashtra
skills
  • Python
  • Apache Spark
  • Hadoop
  • AWS
  • Azure
  • GCP
  • SQL
  • Oracle
  • PostgreSQL
  • Docker
  • Kubernetes
  • Data Pipeline Development
  • Big Data Infrastructure
  • Apache Iceberg
  • Apache Hudi
  • Trino
  • Apache Airflow
  • Prefect
Job Description
As an Applications Development Senior Programmer Analyst at our company, you will play a crucial role in establishing and implementing new or revised application systems and programs in coordination with the Technology team. Your main responsibilities will involve conducting feasibility studies, providing IT planning, developing new applications, and offering user support. You will need to leverage your specialty knowledge to analyze complex problems, recommend security measures, and consult with users on advanced programming solutions. Additionally, you will be responsible for ensuring operating standards are followed and acting as an advisor to junior analysts. **Key Responsibilities:** - Conduct feasibility studies, time and cost estimates, and risk analysis for applications development - Monitor and control all phases of the development process including analysis, design, testing, and implementation - Provide user support and operational assistance on applications - Analyze complex problems and provide evaluation of business and system processes - Recommend and develop security measures for successful system design - Consult with users on advanced programming solutions and assist in system installations - Define operating standards and processes and serve as an advisor to junior analysts **Qualifications:** - 8-14 years of relevant experience - Strong experience in systems analysis and programming - Proven track record in managing and implementing successful projects - Working knowledge of consulting and project management techniques - Ability to work under pressure and manage deadlines effectively - Proficiency in Python programming language - Expertise in data processing frameworks like Apache Spark, Hadoop - Experience with cloud data platforms such as AWS, Azure, or GCP - Strong knowledge of SQL and database technologies - Familiarity with data orchestration tools like Apache Airflow or Prefect - Experience with containerization technologies like Docker and Kubernetes would be a plus In addition to the above responsibilities and qualifications, please note that this job description provides a high-level overview of the work performed. Additional duties may be assigned as required. If you are a person with a disability and require a reasonable accommodation to apply for this role, please review the Accessibility at Citi information.,
ACTIVELY HIRING
posted 3 days ago
experience3 to 15 Yrs
location
Maharashtra
skills
  • Python
  • JS
  • Angular
  • Java
  • C
  • MySQL
  • Elastic Search
  • Elasticsearch
  • Kafka
  • Apache Spark
  • Logstash
  • Hadoop
  • Hive
  • Kibana
  • Athena
  • Presto
  • BigTable
  • AWS
  • GCP
  • Azure
  • unit testing
  • continuous integration
  • Agile Methodology
  • React
  • Tensorflow
Job Description
Role Overview: As a Software Engineer at ReliaQuest, you will have the opportunity to work on cutting-edge technologies and drive the automation of threat detection and response for a rapidly growing industry. You will be responsible for researching and developing creative solutions, creating REST APIs, managing deployment processes, performing code reviews, and automating various stages of the software development lifecycle. Collaboration with internal and external stakeholders will be key to ensure seamless product utilization. Key Responsibilities: - Research and develop solutions using cutting-edge technologies to evolve the GreyMatter platform - Create REST APIs and integrations to enhance and automate threat detection for customers - Manage continuous integration and deployment processes for complex technologies - Conduct code reviews to ensure consistent improvement - Automate and enhance all stages of software development lifecycle - Collaborate closely with different parts of the business to facilitate easy product utilization - Provide support to team members and foster a culture of collaboration Qualifications Required: - 3-6 years of Software Development experience for mid-level roles and 7-15 years for Senior-level positions in Python, JS, React, Angular, Java, C#, MySQL, Elastic Search or equivalent - Proficiency in written and verbal English - Hands-on experience with technologies such as Elasticsearch, Kafka, Apache Spark, Logstash, Hadoop/hive, Tensorflow, Kibana, Athena/Presto/BigTable, Angular, React - Familiarity with cloud platforms like AWS, GCP, or Azure - Strong understanding of unit testing, continuous integration, and deployment practices - Experience with Agile Methodology - Higher education or relevant certifications This job at ReliaQuest offers you the chance to be part of a dynamic team working on groundbreaking security technology. Join us to contribute to the growth and success of the company while learning from some of the best in the industry.,
ACTIVELY HIRING
posted 1 week ago
experience4 to 8 Yrs
location
Maharashtra
skills
  • Software Development
  • Big Data
  • Algorithms
  • Statistics
  • Machine Learning
  • Continuous Integration
  • Indexing
  • Clustering
  • SQL
  • Redis
  • Hadoop
  • Yarn
  • Spark
  • Kafka
  • PostGIS
  • Data Visualization
  • Cloud Environment
  • Agile Scrum Processes
  • NoSQL DBs
  • Time Series DBs
  • Geospatial DBs
  • Python Programming
  • Data Processing Analytics
  • Querying
  • Mongo
  • Casandra
  • Redshift
  • PigHive
  • Machine Learning Algorithms
Job Description
Role Overview: As a Senior Software Engineer - Analytics at LogiNext, you will be responsible for building data products that extract valuable business insights for efficiency and customer experience. Your role will involve managing, processing, and analyzing large amounts of raw information in scalable databases. Additionally, you will be developing unique data structures and writing algorithms for new products. Critical thinking and problem-solving skills will be essential, along with experience in software development and advanced algorithms. Exposure to statistics and machine learning algorithms as well as familiarity with cloud environments, continuous integration, and agile scrum processes will be beneficial. Key Responsibilities: - Develop software that generates data-driven intelligence in products dealing with Big Data backends - Conduct exploratory analysis of data to design efficient data structures and algorithms - Manage data in large-scale data stores (e.g., NoSQL DBs, time series DBs, Geospatial DBs) - Create metrics and evaluate algorithms for improved accuracy and recall - Ensure efficient data access and usage through methods like indexing and clustering - Collaborate with engineering and product development teams Qualifications Required: - Master's or Bachelor's degree in Engineering (Computer Science, Information Technology, Information Systems, or related field) from a top-tier school, or a master's degree or higher in Statistics, Mathematics, with a background in software development - 4 to 7 years of experience in product development with algorithmic work - 3+ years of experience working with large data sets or conducting large-scale quantitative analysis - Understanding of SaaS-based products and services - Strong algorithmic problem-solving skills - Ability to mentor and manage a team, taking responsibility for team deadlines - Proficiency in Python programming language - Experience with data processing analytics and visualization tools in Python (e.g., pandas, matplotlib, Scipy) - Strong understanding of SQL and querying NoSQL databases (e.g., Mongo, Cassandra, Redis) - Understanding of working with and managing large databases, such as indexing, sharding, caching, etc. - Exposure to Big Data technologies like Hadoop, Yarn, Redshift, Spark, Kafka, Pig/Hive - Exposure to machine learning algorithms - Familiarity with geospatial data stores, with exposure to PostGIS being a plus - Desirable exposure to data visualization tools,
ACTIVELY HIRING
posted 3 days ago

Cloud Data Engineer

Hitachi Careers
experience6 to 10 Yrs
location
Pune, Maharashtra
skills
  • SQL
  • Python
  • Hadoop
  • Spark
  • AWS
  • Azure
  • GCP
  • Data governance
  • Data security
  • Compliance
  • ETL processes
Job Description
As a Data Engineer at the company, you will be responsible for designing, implementing, and maintaining the data infrastructure and pipelines necessary for AI/ML model training and deployment. Working closely with data scientists and engineers, you will ensure that data is clean, accessible, and efficiently processed. Key Responsibilities: - Build and maintain scalable data pipelines for data collection, processing, and analysis. - Ensure data quality and consistency for training and testing AI models. - Collaborate with data scientists and AI engineers to provide the required data for model development. - Optimize data storage and retrieval to support AI-driven applications. - Implement data governance practices to ensure compliance and security. Qualifications Required: - 6-8 years of experience in data engineering, preferably in financial services. - Strong proficiency in SQL, Python, and big data technologies (e.g., Hadoop, Spark). - Experience with cloud platforms (e.g., AWS, Azure, GCP) and data warehousing solutions. - Familiarity with ETL processes and tools, as well as knowledge of data governance, security, and compliance best practices. At GlobalLogic, you will experience a culture of caring where people come first. You will be part of an inclusive culture of acceptance and belonging, building meaningful connections with collaborative teammates, supportive managers, and compassionate leaders. The company is committed to your continuous learning and development, offering opportunities to try new things, sharpen your skills, and advance your career. You will work on projects that matter, engage your curiosity and problem-solving skills, and contribute to cutting-edge solutions shaping the world today. GlobalLogic supports balance and flexibility in work and life, providing various career areas, roles, and work arrangements. Joining GlobalLogic means being part of a high-trust organization where integrity is key, and trust is fundamental to relationships with employees and clients.,
ACTIVELY HIRING
posted 2 weeks ago
experience8 to 12 Yrs
location
Pune, Maharashtra
skills
  • Talend
  • SSIS
  • DataStage
  • Azure
  • GCP
  • NoSQL
  • Spark
  • Dynatrace
  • JIRA
  • Process automation
  • Performance tuning
  • Communication skills
  • Stakeholder management
  • Team leadership
  • Mentoring
  • Compliance management
  • Automation
  • Scripting
  • Metadata management
  • Master data management
  • ETL development
  • Technical solutioning
  • Leading largescale data initiatives
  • ETL tools Informatica
  • Data pipeline orchestration tools Apache Airflow
  • Azure Data Factory
  • Cloud platforms AWS
  • Databases SQL
  • Big data ecosystems Hadoop
  • Monitoring tools Splunk
  • Prometheus
  • Grafana
  • ITSM platforms ServiceNow
  • CICD
  • DevOps practices
  • RFP solutioning
  • Client proposals
  • Technical strategy alignment
  • Data management standards
  • Emerging technologies evaluation
  • Legacy systems modernization
  • Security standards
  • Audit standards
  • Regulatory standards
  • Selfhealing mechanisms
  • Workflow orchestration
  • Lin
Job Description
As a candidate for the position at Nagarro, you will be responsible for leading large-scale data initiatives and RFP responses with over 7.5 years of experience in data operations, ETL development, and technical solutioning. Your role will involve hands-on experience with ETL tools such as Informatica, Talend, SSIS, and data pipeline orchestration tools like Apache Airflow and Azure Data Factory. Exposure to cloud platforms (AWS, Azure, GCP), databases (SQL, NoSQL), and big data ecosystems (Hadoop, Spark) will be essential. Additionally, familiarity with monitoring tools (e.g., Splunk, Dynatrace, Prometheus, Grafana), ITSM platforms (e.g., ServiceNow, JIRA), CI/CD, DevOps practices, and monitoring tools for data environments is required. Key Responsibilities: - Lead the technical and business solutioning of RFPs and client proposals related to data engineering, data operations, and platform modernization. - Collaborate with architecture, governance, and business teams to align on technical strategy and data management standards. - Evaluate emerging technologies and frameworks to modernize legacy systems and improve efficiency. - Mentor and guide a team of data engineers and operations analysts, fostering a culture of technical excellence and continuous improvement. - Act as a primary liaison with business, support, and governance teams for operational matters. - Drive automation, self-healing mechanisms, and process automation across data operations to reduce manual interventions and improve system resilience. - Implement and enforce best practices including metadata management, lineage tracking, data quality monitoring, and master data management. Qualifications: - Bachelors or masters degree in computer science, Information Technology, or a related field. Please note that Nagarro is a Digital Product Engineering company with a dynamic and non-hierarchical work culture, comprising over 17500 experts across 39 countries. Join us in building products, services, and experiences that inspire, excite, and delight.,
ACTIVELY HIRING
posted 2 weeks ago

Sr. Big Data Engineer

Facile Services
experience5 to 9 Yrs
location
Pune, Maharashtra
skills
  • Hadoop
  • Apache Spark
  • EMR
  • Athena
  • Glue
  • Python
  • JSON
  • NoSQL
  • Databricks
  • Delta Tables
  • PySpark
  • AWS data analytics services
  • Parquet file format
  • RDBMS databases
Job Description
As a Big Data Engineer, you will play a crucial role in developing and managing the Big Data solutions for our company. Your responsibilities will include designing and implementing Big Data tools and frameworks, implementing ELT processes, collaborating with development teams, building cloud platforms, and maintaining the production system. Key Responsibilities: - Meet with managers to determine the companys Big Data needs. - Develop Big Data solutions on AWS using tools such as Apache Spark, Databricks, Delta Tables, EMR, Athena, Glue, and Hadoop. - Load disparate data sets and conduct pre-processing services using Athena, Glue, Spark, etc. - Collaborate with software research and development teams. - Build cloud platforms for the development of company applications. - Maintain production systems. Qualifications Required: - 5+ years of experience as a Big Data Engineer. - Proficiency in Python & PySpark. - In-depth knowledge of Hadoop, Apache Spark, Databricks, Delta Tables, and AWS data analytics services. - Extensive experience with Delta Tables, JSON, Parquet file format. - Experience with AWS data analytics services like Athena, Glue, Redshift, EMR. - Familiarity with Data warehousing will be a plus. - Knowledge of NoSQL and RDBMS databases. - Good communication skills. - Ability to solve complex data processing and transformation related problems.,
ACTIVELY HIRING
posted 2 months ago
experience2 to 6 Yrs
location
Maharashtra
skills
  • Data warehousing
  • Data solutions
  • Data marts
  • SQL
  • Oracle
  • SQOOP
  • ETL development
  • Data storage
  • Data warehousing concepts
  • Logical data model
  • Physical database structure
  • Operational data stores
  • NIFI
Job Description
As an Officer / Assistant Manager based in Mumbai, you should have a minimum of 2-3 years of ETL development experience with knowledge of ETL ideas, tools, and data structures. Your responsibilities will include: - Analyzing and troubleshooting complicated data sets - Determining data storage needs - Building a data warehouse for internal departments using data warehousing concepts - Creating and enhancing data solutions for seamless data delivery - Collecting, parsing, managing, and analyzing large sets of data - Leading the design of logical data models and implementing physical database structures - Designing, developing, automating, and supporting complex applications for data extraction, transformation, and loading - Ensuring data quality during ETL processes - Developing logical and physical data flow models for ETL applications - Utilizing advanced knowledge of SQL, Oracle, SQOOP, NIFI tools commands, and queries Qualifications required for this role include a B.E., MCA, B.Tech, or M.Sc (I.T.) degree, and an age limit of 25-30 years. If you are interested in this position, please email your resume to careers@cdslindia.com with the position applied for clearly mentioned in the subject column.,
ACTIVELY HIRING
posted 2 weeks ago
experience8 to 12 Yrs
location
Pune, Maharashtra
skills
  • Cost Optimization
  • Hadoop Ecosystem Expertise
  • DevOps Cloud Preferably Azure
  • Architectural Leadership
  • EndtoEnd Big Data Delivery
  • Project Discovery Due Diligence
Job Description
Role Overview: As a Big Data Architect with over 8 years of experience, you will be responsible for demonstrating expertise in the Hadoop Ecosystem, DevOps & Cloud (Preferably Azure), Architectural Leadership, End-to-End Big Data Delivery, Cost Optimization, and Project Discovery / Due Diligence. Your role will involve leading end-to-end projects, optimizing cloud costs, and defining solution strategies. Key Responsibilities: - Strong hands-on experience with core Hadoop components and related big data technologies - Utilize DevOps tools and CI/CD processes on cloud platforms, with a preference for Microsoft Azure - Lead at least three end-to-end projects as a Big Data Architect - Deliver complete big data solutions from requirement gathering to post-production support - Optimize cloud and infrastructure costs for data platforms - Participate in project discovery, assessment, or due diligence phases to define scope and solution strategy Qualifications Required: - 8+ years of experience in Big Data Architecture - Expertise in the Hadoop Ecosystem - Experience with DevOps tools and cloud platforms, preferably Microsoft Azure - Proven track record of leading end-to-end projects - Strong ability to optimize cloud and infrastructure costs - Previous involvement in project discovery or due diligence processes If you find that your skills match the requirements of this role, we encourage you to apply directly. Feel free to refer or share this opportunity with someone who you believe would be a strong fit.,
ACTIVELY HIRING
posted 1 week ago
experience5 to 9 Yrs
location
Pune, Maharashtra
skills
  • Hive
  • Hadoop
  • SQL
  • Excel
  • Tableau
  • Power BI
  • Google Analytics
  • Adobe Analytics
  • Data visualization
  • Stakeholder management
  • Project management
  • Mentoring
  • Communication skills
  • BigData systems
  • Spark Python
Job Description
You will be working with Citi Analytics Information Management, a global community that connects and analyzes information to create actionable intelligence for business leaders. As part of your role, you will have the following responsibilities: - Develop and maintain reporting systems to track key performance metrics, collaborating with cross-functional teams for accurate and timely delivery. - Rationalize, enhance, transform, and automate reports as required, performing adhoc and root cause analysis. - Interpret data to identify trends, patterns, and anomalies, providing insights to stakeholders for informed decision-making. - Translate data into customer behavioral insights to drive targeting and segmentation strategies, communicating effectively to business partners and senior leaders. - Collaborate and manage end-to-end project communication with onsite business partners and team in India. - Lead projects and mentor a team of analysts, ensuring high-quality work. - Ensure data accuracy and consistency by following standard control procedures and adhering to Citis Risk and Control guidelines. To excel in this role, you should have: - 5+ years of experience in BigData systems, Hive, Hadoop, Spark (Python), and cloud-based data management technologies. - Proficiency in SQL, Excel, and data visualization tools like Tableau, Power BI, or similar software. - Knowledge of digital channels, marketing, and various methods/tools businesses use to engage with their audience. - Expertise in using Google Analytics/Adobe Analytics for tracking and reporting website traffic and journey analytics. - Strong background in reporting and data analysis, with excellent communication and stakeholder management skills. - Ability to manage projects, mentor a team, and contribute to organizational initiatives. Preferred qualifications include exposure to Digital Business and expertise in Adobe Site Catalyst, Clickstream Data. Educational Requirement: - Bachelor's degree in computer science, Engineering, or related field This job description provides an overview of the work performed, and other job-related duties may be assigned as required.,
ACTIVELY HIRING
posted 2 weeks ago

Big Data/Cloud Architect

Atgeir Solutions
experience10 to 15 Yrs
location
Pune, Maharashtra
skills
  • Big Data
  • ETL
  • Hadoop
  • Spark
  • Presto
  • Hive
  • Database Management
  • Networking
  • Storage
  • GCP
  • Azure
  • AWS
  • Snowflake
  • Python
  • Java
  • Go
  • Scala
  • Cloud Architect
  • Analytics Systems Development
  • Data Warehouses
  • Hardware Optimization
  • Databricks
Job Description
You are invited to join Atgeir's Advanced Data Analytics team as a Big Data / Cloud Architect. Your role will involve collaborating with customers to understand their requirements and translate them into architectural models for large-scale and high-performance operations. You will provide guidance on running these models on both traditional Data Platforms (such as Hadoop Based) and Modern Data Platforms (such as Cloud Based). Additionally, you will work with customers to develop data management platforms using Open Source Technologies and Cloud Native services. Key Responsibilities: - Collaborate closely with customers to translate requirements into architectural models - Build data management platforms using Open Source Technologies and Cloud Native services - Extract best-practice knowledge and reference architectures for sharing with the Advanced Analytics Centre of Excellence team - Utilize your technical and analytical skills with over 10 years of experience in ETL and analytics systems development - Demonstrate strong verbal and written communication skills to work effectively across internal and external organizations - Prototype systems based on complex business requirements with quick turnaround time - Implement and optimize solutions in the Big Data Ecosystem, Databases, and Data Warehouses - Have knowledge of foundation infrastructure requirements and hands-on experience with cloud platforms and data cloud platforms - Utilize at least one programming language among Python, Java, Go, or Scala - Be willing to work hands-on on projects and lead large teams - Possess an architect level certification on one of the cloud platforms (GCP / Azure / AWS) will be an added advantage Qualifications Required: - Minimum 10 years of experience in ETL and analytics systems development - Strong technical skills and experience in the Big Data Ecosystem, Databases, and Data Warehouses - Hands-on experience with cloud platforms (GCP / Azure / AWS) and data cloud platforms (Databricks / Snowflake) - Proficiency in at least one programming language among Python, Java, Go, or Scala - Strong communication skills and ability to work effectively across teams Join Atgeir's team and contribute your expertise as a Big Data / Cloud Architect to drive innovation in Advanced Data Analytics.,
ACTIVELY HIRING
posted 1 week ago
experience5 to 9 Yrs
location
Pune, Maharashtra
skills
  • Java
  • Linux
  • Scala
  • Hadoop
  • Spark
  • MapReduce
  • Tableau
  • R
  • Matlab
  • SPSS
  • MongoDB
  • Cassandra
  • Big Data Analytics
  • Redshift
  • AgileScrum
  • Mark Logic
  • Teradata
Job Description
You will be a part of the Big Data Software development group, working alongside a small but highly skilled team of engineers. Your responsibilities will include evaluating and implementing massive data stores, working on data science, data security, and architecture design projects for a highly visible web presence, and building complex web analytic tools. Key Responsibilities: - Develop back-end big data web service-based distributed data ingestion/processing software - Write code in Java and Linux, with Scala experience being desired - Work in an Agile/SCRUM environment and process large amounts of structured and unstructured data - Utilize technologies such as Hadoop, Redshift, Spark, MongoDB, Cassandra, Mark Logic, Teradata, etc. - Collaborate with SQL Databases and various programming languages and statistical packages - Gain experience in real-time analytics and business intelligent platforms like Tableau Software - Implement machine learning skills and demonstrate data science familiarity - Stay updated on industry trends and advancements in Big Data Analytics Qualifications Required: - 5+ years of experience in developing big data web service-based software - Strong application development skills with the ability to write simple and elegant code - Proficiency in Java, Linux, and experience with technologies like Hadoop, Redshift, Spark - Familiarity with Agile/SCRUM methodologies and MapReduce - Knowledge of SQL Databases, programming languages like R, Java, Scala, Matlab or SPSS - Previous exposure to big data analytics tools such as MongoDB, Cassandra, Mark Logic, Teradata - Banking/Financial sector experience is highly preferred - Education: B.E/B.Tech in relevant field Please note that the company specializes in IT/Computers-Software industry and values expertise in Big Data, Hadoop, Redshift, Spark, Agile/Scrum, and other related technologies. For any inquiries or to apply for this role, please send your resume to jobs@augustainfotech.com.,
ACTIVELY HIRING
posted 1 week ago

Hadoop Admin

PNR Software Solutions
experience2 to 7 Yrs
location
Maharashtra
skills
  • IOs
  • Load balancing
  • Data extraction
  • Data transformation
  • Reporting
  • Advanced analytics
  • Hadoop Admin
  • Hortonworks Hadoop
  • User access management
  • Data Lake monitoring
  • Cluster health checkup
  • Database size
  • Edge node utilities
  • Designing Hadoop cluster
  • Setting up Hadoop cluster
  • Data ingestion
  • Exploratory reporting
  • ML platform
Job Description
As a Hadoop Admin for our client in Mumbai, you will be responsible for the following key tasks: - Managing and administrating on-premise Hortonworks Hadoop cluster - Knowledge on User access management - Data Lake monitoring including Cluster health checkup, database size, no of connections, IOs, edge node utilities, load balancing - Designing, estimating, and setting up Hadoop cluster - Managing multiple Hadoop utilities such as data ingestion, extraction, transformation, reporting, exploratory reporting, Advanced analytics, ML platform Qualifications required: - Any qualification If you are interested in this opportunity, please send your updated resumes to recruiter1@pnrsoftsol.com for a quick response.,
ACTIVELY HIRING
posted 2 weeks ago
experience2 to 6 Yrs
location
Nagpur, All India
skills
  • Big Data
  • Hadoop
  • HDFS
  • MapReduce
  • Pig
  • Hive
  • Sqoop
  • Hbase
  • Java
  • Communication
  • Presentation
  • ZooKeeper
Job Description
You will be responsible for conducting training on Big Data / Hadoop and assigning assignments and projects based on Hadoop. Key Responsibilities: - Conduct training sessions on Big Data / Hadoop - Assign and oversee projects related to Hadoop Qualifications Required: - Minimum 2 years of hands-on experience in Hadoop/Big Data Technology within the corporate sector - Excellent knowledge of Hadoop, Big Data, HDFS, MapReduce, Pig, Hive, Sqoop, ZooKeeper, Hbase, Java - Excellent communication and presentation skills - Dynamic personality Please note that weekend positions are available for working faculties. You will be responsible for conducting training on Big Data / Hadoop and assigning assignments and projects based on Hadoop. Key Responsibilities: - Conduct training sessions on Big Data / Hadoop - Assign and oversee projects related to Hadoop Qualifications Required: - Minimum 2 years of hands-on experience in Hadoop/Big Data Technology within the corporate sector - Excellent knowledge of Hadoop, Big Data, HDFS, MapReduce, Pig, Hive, Sqoop, ZooKeeper, Hbase, Java - Excellent communication and presentation skills - Dynamic personality Please note that weekend positions are available for working faculties.
ACTIVELY HIRING
posted 2 weeks ago

Associate Principal - Architecture (Big Data)

RiverForest Connections Private Limited
experience3 to 8 Yrs
location
Pune, Maharashtra
skills
  • Python
  • Unix scripting
  • Hive
  • SQL
  • Data migration
  • PySpark
  • SparkSQL
  • Hadoop Ecosystem
  • AWS Glue
  • AWS S3
  • Lambda function
  • Step Function
  • EC2
  • Insurance domain knowledge
Job Description
Role Overview: You will be responsible for utilizing your strong experience in PySpark, Python, and Unix scripting to work on data processing tasks. Your expertise in SparkSQL and Hive will be essential in writing SQLs and creating views. Additionally, your excellent communication skills will be valuable in collaborating with team members. Knowledge of the Insurance domain is a plus, along with a good understanding of the Hadoop Ecosystem and Architecture, including HDFS, Map Reduce, Pig, Hive, Oozie, and Yarn. You will also need familiarity with AWS services such as Glue, AWS S3, Lambda function, Step Function, and EC2. Data migration exposure from platforms like Hive/S3 to Data Bricks will be part of your responsibilities. Your ability to prioritize, plan, organize, and manage multiple tasks efficiently while maintaining high-quality work is crucial for success in this role. Key Responsibilities: - Utilize strong experience in PySpark, Python, and Unix scripting for data processing tasks - Write SQLs, create views, and work with SparkSQL and Hive - Collaborate effectively with team members using excellent oral and written communication skills - Demonstrate knowledge of the Insurance domain and the Hadoop Ecosystem and Architecture - Use AWS services like Glue, AWS S3, Lambda function, Step Function, and EC2 - Perform data migration from platforms like Hive/S3 to Data Bricks Qualifications Required: - Technical experience of 6-8 years in Pyspark, AWS (Glue, EMR, Lambda & Steps functions, S3) - 3+ years of experience in Bigdata/ETL with Python+Spark+Hive and 3+ years of experience in AWS - Proficiency in Pyspark, AWS (Glue, EMR, Lambda & Steps functions, S3), and Big data with Python+Spark+Hive experience - Exposure to big data migration Additional Company Details: There are no additional details about the company mentioned in the job description.,
ACTIVELY HIRING
logo

@ 2025 Shine.com | All Right Reserved

Connect with us:
  • LinkedIn
  • Instagram
  • Facebook
  • YouTube
  • Twitter