Evaluation
-
Assessing plausibility and usefulness of data we generated from real data
9 min read -
Building real-world skills through hands-on trial and error.
8 min read -
How to Stop Blaming the ‘Model’ and Start Building Successful AI Products
10 min read -
How to stop worrying and love the data
11 min read -
Applying chat templates to generative LM evaluation tests
8 min read -
Leveraging LangChain to move from ad-hoc Jupyter Notebooks to production modular service
27 min read -
A new RecList to provide more flexibility and better support for evaluation
8 min read -
Beyond Accuracy: Exploring Exotic Metrics for Holistic Evaluation of Machine Learning Models
ChatGPTMachine learning has undoubtedly become a powerful tool in today’s data-driven world, but are we…
14 min read -
A series of mechanisms and tests one can use to evaluate any tabular synthetic dataset,…
8 min read