The team has started to work on the MLOps stack for Data Labeling. Both traditional machine learning training, and fine tuning of generative AI models, requires labeled data. The first goal for this KR is to conduct an ecosystem evaluation of open source labeling tools, likely including but not limited to Inception and Label Studio, including recommendations to the RHODS engineering team. Team has started to look into open source tools that can support and perform the data labeling and further create a POC deployment on the ET Data Science OpenShift cluster.
Open MLOps Stacks – Data Labeling
Senior Data Scientist