Hey Eric, great question! For the scenario you brought up about data capture, we can do that by
capturing a reference profile during your training cycle and then using that to monitor your model or data pipeline in production as well as trigger events based on drift, quality, or other types of anomalies. We an tie into your alerting systems and trigger automatic retraining as well.
In general WhyLabs aims to provide a platform agnostic approach to monitoring data and models across a lot of different architectures. We tend to see a few common differences with SageMaker Monitor and WhyLabs such as 1) profiling 100% of your data vs being forced to sample, 2) accommodating different types of data such as text, images, embeddings, and more compared to only supporting tabular data and 3) creating an ML optimized user experience for implementing monitoring and observability, often times the pain is felt trying to create dashboards and identify root cause as it's built on cloudwatch metrics, WhyLabs focuses on minimal setup and time before getting value out of the platform.
There are a few other things like extending whylogs with custom metrics for unique uses cases, we can go deeper on a call if you'd like. Hopefully this helps!