What youll be doing
- Designing and maintaining data pipelines optimized for ML / AI workloads including handling of large-scale unstructured and semi-structured data.
- Building feature pipelines and feature stores that ensure reusability and consistency of data used by machine learning models.
- Collaborating with Data Scientists and ML Engineers to understand data requirements for training validation and production deployment .
- Ensuring data quality lineage and governance meet standards required for AI / ML applications.
- Supporting MLOps practices by integrating data pipelines with model training monitoring and deployment workflows.
- Leveraging distributed processing frameworks (e.g. Spark Databricks Azure Synapse) for scalable ML data processing .
Qualifications : What you bring
8 years of experience as a Data Engineer working with Azure and Databricks ideally with exposure to ML / AI-related data workflows .College degree that demonstrates your analytic abilities such as Econometrics Computer Sciences Mathematics or similar;Excellent analytical and problem-solving skills;Experience with data preparation for ML / AI : managing large datasets feature engineering and real-time or batch data pipelines.Familiarity with MLOps concepts and how data engineering supports model lifecycle management.Experience with orchestration frameworks (Airflow Prefect or Azure Data Factory) for complex ML pipelines .Knowledge of unstructured data processing (text images logs) is a plus.Strong SQL and Python skills; experience with distributed data processing (PySpark Dask etc.) is a plus.Who we are :
So what does it mean to be a part of the Sana Commerce team
At Sana Commerce our values guide how we work collaborate and drive success.
Champions of Our League. We deliver lasting success balancing quick wins and long-term value.We take pride in our unique product and extensive B2B knowledge and continuously strive to improve. No matter our role we bring value every day helping our customers and partners succeed.
Supercharge Our Customers. Were revolutionizing B2B commerce together helping our customers to lead and succeed.Our customers are at the heart of everything we do. We go beyond solutions providing the tools and support they need to grow.
Determined to Grow. We embrace challenges growing and raising the bar for ourselves and our industry.We take on challenges seek feedback and keep learning. Every setback is a chance to improve and move forward.
Bold Together. We dare to be bold because we have each others back.We collaborate across teams and time zones challenge the status quo and support each other to achieve the best outcomes.
Job descriptions can be tough to interpret. Even if you may not tick all the boxes please explain your motivation for the role of Data Engineer (AI / ML) in a cover letter we strongly encourage you to apply if you still feel like you are a great match for this role. Apply now!
Additional Information :
#LI-Hybrid
#LI-SV1
Remote Work : No
Employment Type : Full-time
Key Skills
Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala
Experience : years
Vacancy : 1