Staff Data Engineer / Multimodal AI

Chicago, Illinois

Open to Remote

Direct Hire

$165k - $180k

We are looking for a Staff Data Engineer to join one of the 100 Best Companies to Work For, with offices in the Chicagoland area and remote flexibility, on a full-time basis. This leading supplier of maintenance, repair, and operating products for businesses and a wide range of industries across the globe has a focus on sustainability, social responsibility, and community engagement, serving more than 4.5 million customers worldwide.

As a Staff Data Engineer, you will help the team by ingesting flat files and multimodal files like PDFs, videos, voice recordings, and images to help improve their AI/ML platform that uses LLM technologies. This platform will help how the organization is servicing their various customers. In this role you’ll be contributing to ongoing architecture, development, and mentor junior engineers. The ideal candidate will have a strong background in data engineering, with a focus on Databricks, and have experience implementing real-time streaming technologies. Having experience IaC will be an added bonus.

We encourage you to apply for this exciting opportunity and be part of a company that values sustainability, social responsibility, community engagement, innovation, and growth. Required Skills & Experience

As the technical lead and architect, your primary responsibility will be to design and implement highly efficient, reusable, and scalable data processing systems and pipelines in Databricks and Snowflake.
Design and implement technical solutions and processes to ensure data reliability and accuracy.
Develop data models and mappings and build new data assets required by users. Perform exploratory data analysis on existing products and datasets.
Educate data engineering teams in adopting new data patterns and tools.
Function as SME within this area when engaging with our AI, Platform, and Business Analytics teams to build useful pipelines and data assets.

Desired Skills & Experience

Experience in batch and streaming ETL using Spark, Python, Scala, Snowflake or Databricks for Data Engineering or Machine Learning workloads.
Experience orchestrating and implementing pipelines with workflow tools like Databricks Workflows, Apache Airflow, or Luigi
Experience prepping structured and unstructured data for data science models.
Experience with containerization and orchestration technologies (Docker, Kubernetes) and experience with shell scripting in Bash, Unix or windows shell is preferable.
Implemented CI/CD with automated testing in Jenkins, Github Actions, or Gitlab CI/CD
Familiarity with AWS Services not limited to Glue, Athena, Lambda, S3, and DynamoDB
Demonstrated experience implementing data management life cycle, using data quality functions like standardization, transformation, rationalization, linking and matching.

The Offer

Bonus eligible

You will receive the following benefits:

Medical, dental, vision, and life insurance plans
Paid time off (PTO) and 6 company holidays per year
Automatic 6% 401(k) company contribution each pay period
Employee discounts, parental leave, 3:1 match on donations and tuition reimbursement
A comprehensive set of emotional, financial, physical and social wellbeing programs

Applicants must be currently authorized to work in the US on a full-time basis now and in the future.

#LI-EM1

Posted by: Esteban Medina

Specialization: Data Engineering

Staff Data Engineer / Multimodal AI

Related Jobs

Microsoft Data Engineer / Azure / Synapse / ADF

Lead Data Engineer (Python/Docker/k8s)

Sr Staff Software Engineer / Python / Databricks / AWS

Lead Data Engineer / Azure / ADF / AKS

Chief Software Engineer / Startup / AWS / Python

Senior Software Engineer / Databricks / Terraform / AWS