Christian Kaestner
Required Readings: 🕮 Hulten, Geoff. "Building Intelligent Systems: A Guide to Machine Learning Engineering." (2018), Chapters 2 (Knowing when to use IS), 4 (Defining the IS’s Goals) and 15 (Intelligent Telemetry)
Suggested complementary reading: 🕮 Ajay Agrawal, Joshua Gans, Avi Goldfarb. “Prediction Machines: The Simple Economics of Artificial Intelligence” 2018
Examples?
Examples?
see Hulten, Chapter 2
Examples?
see Hulten, Chapter 2
Big problem? Open ended? Time changing? Hard? Partial system viable? Data continuously available? Influence objectives? Cost effective?
🕮 Ajay Agrawal, Joshua Gans, Avi Goldfarb. “Prediction Machines: The Simple Economics of Artificial Intelligence” 2018
AI: Higher accuracy predictions at much much lower cost
May use new, cheaper predictions for traditional tasks (examples?)
May now use predictions for new kinds of problems (examples?)
May now use more predictions than before
(Analogies: Reduced cost of light, reduced cost of search with the internet)
Predictions are an input to decision making under uncertainty
Making the decision requires judgement (determining relative payoffs of decisions and outcomes)
Judgement often left to humans ("value function engineering")
ML may learn to predict human judgment if enough data
Determine value function from value of each outcome and probability of each outcome
🕮 Ajay Agrawal, Joshua Gans, Avi Goldfarb. “Prediction Machines: The Simple Economics of Artificial Intelligence” 2018
What contributes to the average cost of a single prediction?
Examples: Credit card fraud detection, product recommendations on Amazon
Discuss in groups
Each group pick a popular open-source system (e.g., Firefox, Kubernetis, VS Code, WordPress, Gimp, Audacity)
Think of possible extensions with and without machine learning
Report back 1 extension that would benefit from ML and one that would probably not
10 min
Guiding questions:
Some are easier to measure then others (telemetry), some are noisier than others, some have more lag
Innate/overall goals of the organization
accurate models themselves are not a goal
AI may only very indirectly influence such organizational objectives, influence hard to quantify, lagging measures
Break overall goals along processes
Maintain mapping from task-specific goals to system goals
Measures correlating with future success, from the business' perspective
indirect proxy measures, lagging, bias
can be misleading (more daily active users => higher profits?)
How well the system is serving its users, from the user's perspective
easier and more granular to measure, but only indirect relation to organization objectives
Quality of the model used in a system, from the model's perspective
not directly linked to business goals
Example: Amazon shopping recommendations
Closely watch model properties for degradation; optimize accuracy; ongoing
Weekly review user outcomes, e.g., sales, reviews returns
Monthly review trends of leading indicators, e.g., shopper loyalty
Quarterly ensure look at organizational objectives
Telemetry?
Discuss in groups, breakout rooms
What are the goals behind automating admissions decisions?
Organizational objectives, leading indicators, user outcomes, model properties?
Report back in 10 min
Measurement is the empirical, objective assignment of numbers, according to a rule derived from a model or theory, to attributes of objects or events with the intent of describing them. – Craner, Bond, “Software Engineering Metrics: What Do They Measure and How Do We Know?"
A quantitatively expressed reduction of uncertainty based on one or more observations. – Hubbard, “How to Measure Anything …"
But: Not every measure is precise, not every measure is cost effective
🕮 Douglas Hubbard, “How to Measure Anything: finding the value of intangibles in business" 2014
max(0,(171−5.2ln(HV)−0.23CC−16.2ln(LOC)∗100/171)) <10: low maintainability, ≤20 high maintainability
Further reading: Arie van Deursen, Think Twice Before Using the “Maintainability Index”, Blog Post, 2014
Aside: Understanding scales of features is also useful for encoding or selecting learning strategies in ML
Often higher-level measures are composed from lower level measures
clear trace from specific low-level measurements to high-level metric
For design strategy, see Goal-Question-Metric approach
Make sure data scientists and software engineers share goals and success measures