Promote datasets in analytical tools to the Feature Store, ensuring data quality and trustworthiness.
Implement automatic data quality detection algorithms and support data quality using built-in/custom visual tools.
Conduct visual data quality assessments and statistical summaries for both categorical and numerical data.
Handle unique and invalid values, outliers, and perform anomaly detection using machine learning-based assessments.
Requirements:
Must have Hashicorp Terraform Certification and Azure Cloud Infra Certification.
Preferably have Data Engineering certification (e.g., Dataiku, Databricks, Snowflake).
Experience in AI/ML tooling and MLOps practice setup.
Proficiency in Terraform and Python coding.
Bachelor’s degree in Computer Science/Information Technology, Engineering (Computer/Telecommunication), or equivalent.
Certification in Model Ops or MLOps is advantageous.
Experience with cloud deployments on Google Cloud Platform, Amazon Web Services, or Microsoft Azure.
Proficiency in drafting solution architecture frameworks relying on APIs and microservices.
Expertise in distributed computing environments/big data platforms (e.g., Cloudera, Hadoop, Apache Spark, Elasticsearch) and common databases (e.g., SQL, Hive, HBase).
Hands-on experience with modeling tools like DataIKU, SAS Viya, or Azure MLOps.