Sr. Data Engineer (10+ Years)
Python ScriptingPySparkSparkScalaETL ToolsAzure (Data Lakes, Data Factory and Data Bricks)
Please submit resumes in the below format.
Candidate Full Name (Legal Name):
Highest Education / University / Year of Passing:
Total Years of Experience:
Relevant Years of Experience:
Total Years of US Experience (for h1b/visa candidate):
Willing To Relocate to San Antonio, TX
Notice Period/Available to start:
Work Authorization, provide validity if h1b/any visa
Hourly Rate / Salary:
Sr. Data Engineer
We are looking for a savvy Data Engineer to join our growing team of analytics experts. The hire will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The Data Engineer will support our software developers, database architects, data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems, and products.
Communicate progress across organizations and levels from individual contributor to senior executive. Identify and clarify the critical issues that need action and drive appropriate decisions and actions. Communicate results clearly and in actionable form.Lead development and ongoing maintenance and enhancement of applications running on Azure Cloud and business intelligence tools.Detailed technical design, conduct analysis, development of applications and proof of conceptsDevelop microservices, application code and configuration to deliver applicationProvide technical leadership for development & BI team to deliver on various initiatives.Lead problem resolution tasks, document approach for support mechanismsEnsure all solutions meet Enterprise Guidelines and industry standards/best practicesAdvise IT and business stakeholders of alternative solutionsEnsure optimal system performance across BI & Analytics platforms.Lead the effort to monitor system activity, tune performance and architect solutions to meet future demand.Offer technical guidance to team members and lead design/requirements sessionsBenchmark systems, analyze system bottlenecks and propose solutions to eliminate them;Articulate pros and cons of various technologies and platforms and document use cases, solutions and recommendations;Troubleshoot complex system issues and handle multiple tasks simultaneouslyEnsure all solutions meet Enterprise Guidelines and industry standards/best practices
Bachelor's Degree or master's degree in Computer Science, Mathematics, Statistics.4+ years of development experience in using Spark to build applications through Python and PySpark3+ years' hands-on experience developing optimized, complex SQL queries and writing PLSQL code across large volumes of data in both relational and multi-dimensional data sources such as Teradata, Hive, Impala, Oracle, TeradataExperience in deploying and developing application using AzureExperience working with disparate data-sets in multiple formats like JSON, Avro, text files, Kafka queues, and log data and storage like blob/ADLS GEN22+ years of strong ETL experience on either Informatica, Ab-Initio, Talend, DataStage, SyncsortEnthusiastic to work on disparate datasets in multiple formats like JSON, Avro, text files, Kafka queues, and Knowledge of software design and programming principles.Experience working in Scrum Agile framework and using DevOps to deploy and manage code.Good communication and team-working skills