Homepage
Open in app
Sign in
Get started
Towards Data Science
Your home for data science. A Medium publication sharing concepts, ideas and codes.
Latest
Editors' Picks
Deep Dives
About
Contribute
Newsletter
Follow
Following
The Past, Present, and Future of Data Quality Management: Understanding Testing, Monitoring, and…
The Past, Present, and Future of Data Quality Management: Understanding Testing, Monitoring, and…
The data estate is evolving, and data quality management needs to evolve with it.
Barr Moses
May 25
Interpretable Outlier Detection: Frequent Patterns Outlier Factor (FPOF)
Interpretable Outlier Detection: Frequent Patterns Outlier Factor (FPOF)
An outlier detector method that supports categorical data and provides explanations for the outliers flagged
W Brett Kennedy
May 25
Latest
How to: Handle Missing Data for Time Series
How to: Handle Missing Data for Time Series
Should you drop, interpolate, or impute?
Haden Pelletier
May 24
From Assumptions to Accuracy: The Role of Conditional Probability in Real-World Predictions
From Assumptions to Accuracy: The Role of Conditional Probability in Real-World Predictions
Conditional probability is better than probability ; IF you have the relevant information
Atisha Rajpurohit
May 24
The Art of Stress Management as a Data Scientist
The Art of Stress Management as a Data Scientist
What you do when you’re not a data scientist could help you become a better data scientist
Zijing Zhu, PhD
May 24
Behind The Scenes: Explaining My Work As A Data Scientist
Behind The Scenes: Explaining My Work As A Data Scientist
A breakdown of what my data science role truly entails
Egor Howell
May 24
6 Real-World Uses of Microsoft’s Newest Phi-3 Vision-Language Model
6 Real-World Uses of Microsoft’s Newest Phi-3 Vision-Language Model
Exploring possible use cases of Phi-3-Vision, a small yet powerful MLLM that can be run locally (with code examples)
Youness Mansar
May 24
Streamline Your Prompts to Decrease LLM Costs and Latency
Streamline Your Prompts to Decrease LLM Costs and Latency
Discover 5 techniques to optimize token usage without sacrificing accuracy
Jan Majewski
May 24
Real-Time Analytics Solution for Usage-Based API Billing and Metering
Real-Time Analytics Solution for Usage-Based API Billing and Metering
Design a real-time analytics pipeline for tracking API invocation usage with Apache APISIX, Redpanda, and Apache Pinot.
Dunith Danushka
May 24
Practical Computer Simulations for Product Analysts
Practical Computer Simulations for Product Analysts
Part 3: Modelling Ops queues
Mariya Mansurova
May 23
Optimising Non-Linear Treatment Effects in Pricing and Promotions
Optimising Non-Linear Treatment Effects in Pricing and Promotions
Causal AI, exploring the integration of causal reasoning into machine learning
Ryan O'Sullivan
May 23
About Towards Data Science
Latest Stories
Archive
About Medium
Terms
Privacy
Teams