Implement instrumentation with statsv for Phonos
Closed, ResolvedPublic2 Estimated Story Points
Actions

Assigned To

Authored By

	TheresNoTime
	Aug 12 2022, 4:09 PM

Description

CONTEXT

This ticket fleshes out the set of data-driven questions that can help us to understand both how to:

Measure the impact of generating pronunciation audio for readers
Optimizing the tool for future use

DATA QUESTIONS

Product - BEFORE LAUNCH of PHONOS

How often are users trying to listen to pronunciations by clicking on IPA syntax?

Product - AFTER LAUNCH of PHONOS

How often are users trying to listen to rendered pronunciations?
How often are users trying to listen to rendered pronunciations but failing to hear anything?
What's the average load time for audio?

Engineering

(Currently no engineering events to log)

New Instrumentation

Proposed new event name	Action that will trigger new event	Schema where event will be logged	Properties being tracked
timing.MediaWiki.extension.Phonos.IPA.can_play_through	Triggered when an audio finishes playing successfully.	statsv	This action will track the time an audio takes to first load and finish playing by_lang and by_wiki
counter.MediaWiki.extension.Phonos.IPA.click	Triggered when user clicks on Phonos generated content (speaker icon, IPA).	statsv	This action will track the count of clicks on Phono audio files per by_lang and by_wiki
counter.MediaWiki.extension.Phonos.IPA.error	Triggered when Phonos fails to play an audio.	statsv	This event is going to count the times Phonos fails to play an audio by_lang and by_wiki
counter.MediaWiki.extension.Phonos.IPA.replay	Triggered when Phonos is clicked for a replay.	statsv	This event will track the clicks that trigger a replay by_lang and by_wiki

Note: session-id is suggested as a property since it may be useful

STRETCH GOAL

If possible, it would also be great to set up a Quick Survey to gauge reader/listener reactions.
Since the data-tracking events will only give us an insight into how and if people are clicking on the audio files to listen them, it will be hard to infer if they found the pronunciations useful.
The goal of the Quick Survey would be to collect data on how useful readers found the audio.
The Quick Survey should trigger after the entire audio file plays. The survey should only be one question:

How helpful was this audio in helping you understand how to pronounce this term?

Scale of 1-5 (1 Not Helpful, 5 Helpful)

Details

	Subject	Repo	Branch	Lines +/-
	Clean statsv tracking	mediawiki/extensions/Phonos	master	+22 -15
	Add instrumentation with statsv	mediawiki/extensions/Phonos	master	+17 -0

Customize query in gerrit

Related Objects

Mentioned In: T335499: Ensure that we collect appropriate data for Search platform SLIs
T326212: Improve data logging on Special:Diff and Special:MobileDiff
rEPHN3ecf896fc0c8: Clean statsv tracking
rEPHN4cb334c15576: Add instrumentation with statsv

Event Timeline

TheresNoTime created this task.Aug 12 2022, 4:09 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 12 2022, 4:09 PM

TheresNoTime updated the task description. (Show Details)Aug 12 2022, 4:14 PM

• NRodriguez updated the task description. (Show Details)Aug 16 2022, 9:07 PM

• NRodriguez added a project: Community-Tech (CommTech-Sprint-32).Aug 25 2022, 4:05 PM

• NRodriguez moved this task from Backlog to 🌟Top Priority on the MediaWiki-extensions-Phonos board.

• JMcLeod_WMF edited projects, added Community-Tech (CommTech-Sprint-33); removed Community-Tech (CommTech-Sprint-32).Sep 12 2022, 3:00 PM

TheresNoTime updated the task description. (Show Details)Sep 13 2022, 2:14 PM

• JMcLeod_WMF set the point value for this task to 2.Sep 14 2022, 11:04 AM

KSiebert added a project: Product-Analytics.Sep 15 2022, 9:31 AM

HMonroy claimed this task.Sep 19 2022, 11:13 PM

HMonroy removed HMonroy as the assignee of this task.

HMonroy moved this task from Ready 🎬 to In Development 💻 on the Community-Tech (CommTech-Sprint-33) board.

HMonroy subscribed.

KSiebert subscribed.Sep 20 2022, 10:34 AM

This comment was removed by KSiebert.

Hi @mpopov! We need help with adding instrumentation to our Phonos project. This is a new extension that we are in the process of releasing. I believe we would need to do event logging in a Visual Editor feature, but not sure which feature name. Should we book a meeting with a data analytics team member?

Hello! o/ Very excited for Phonos and really glad y'all are talking about analytics for it now.

Unlike real-time previews I don't think it would make sense to plug this into VE Feature Use, since this is for the reading experience.

My recommendation would be to check in with @EChetty & @WDoranWMF about using the Metrics Platform for this. The pitch is that you write an instrument that logs just the custom data (e.g. which engine, load time for audio) and then Metrics Platform can take care of much of contextual info such as whether the user was logged in or which page they were on (see https://wikitech.wikimedia.org/wiki/Metrics_Platform/Event_Schema for more properties that Metrics Platform can fill in for you), and which properties you request is configured separately and doesn't require any changes to the instrumentation or redeploying the extension.

But yes, if you would like to consult somebody in Product Analytics you're welcome to: https://www.mediawiki.org/wiki/Product_Analytics/Consultation_Hours

HMonroy claimed this task.Sep 21 2022, 4:49 AM

@HMonroy Just want to point out that WDoran just went on leave for two months, just so you are not worried if you don't hear back.

• JMcLeod_WMF edited projects, added Community-Tech (CommTech-Sprint-34); removed Community-Tech (CommTech-Sprint-33).Sep 26 2022, 8:26 PM

• JMcLeod_WMF moved this task from Ready 🎬 to In Development 💻 on the Community-Tech (CommTech-Sprint-34) board.Sep 26 2022, 8:39 PM

mpopov removed a project: Product-Analytics.Sep 27 2022, 5:21 PM

• EChetty added a project: Metrics Platform Backlog.Oct 5 2022, 11:48 AM

• EChetty moved this task from Backlog to To be discussed on the Metrics Platform Backlog board.

• NRodriguez updated the task description. (Show Details)Oct 5 2022, 7:26 PM

• NRodriguez updated the task description. (Show Details)Oct 7 2022, 7:28 PM

• NRodriguez updated the task description. (Show Details)Oct 7 2022, 7:36 PM

@phuedx @EChetty Would we need a new schema for this extension? Or is there a current schema that we would be logging the events to?

• JMcLeod_WMF edited projects, added Community-Tech (CommTech-Sprint-35); removed Community-Tech (CommTech-Sprint-34).Oct 11 2022, 10:27 AM

• JMcLeod_WMF moved this task from Ready 🎬 to In Development 💻 on the Community-Tech (CommTech-Sprint-35) board.Oct 11 2022, 10:34 AM

The following is a summary of what @HMonroy, @TheresNoTime, and I spoke about yesterday:

There are two ways that this could be instrumented:

Using statsv
Using the Metrics Platform

Using statsv

statsv allows you to stand up instruments very quickly. However, the data that you can capture is limited to a metric name coupled with a count or a timing. So, for example, you can answer questions like "How many times has this UX element been clicked?" or "How long did that request take?".

You can partition your data broadly by designing your metric names carefully. For example, if you wanted to answer "How many times has this UX element been clicked per wiki?", then you would need to include the wiki name in the metric name, i.e.

const dbName = mw.config.get( 'wgDBname' );
const metrics = [
  'MediaWiki.extension.Phonos.IPA.click',
  `MediaWiki.extension.Phonos.IPA.click_by_wiki.${dbName}`
];

The data are stored in Prometheus and can power Grafana dashboards. As a frinstance, the Page Previews dashboard is powered by metrics collected via statsv.

Testing statsv

At the moment, production is the only environment where you can test your statsv-based instrument end-to-end. Previously, I have written QA instructions along the lines of

Observe that an HTTP request is made to /beacon/statsv?MediaWiki.extensions.Phonos.IPA.click=1c

Edit: https://www.mediawiki.org/wiki/MediaWiki-Docker/Configuration_recipes/EventLogging is a thorough guide on how to get a production-like Event Platform/Legacy EventLogging testing environment set up locally. This environment includes the WikimediaEvents extension, which provides the statsv protocol handler.

Metrics Platform/Event Platform

The Event Platform allows you to stand up instruments to capture rich data to answer equally rich questions. For example, you can answer a question like "How frequently are pronunciations listened to completely?", "How frequently does a user listen to the pronunciation and then navigate to the corresponding File page?", and "How many unique users (devices) listen to a pronunciation?"

The Metrics Platform is an opinionated Event Platform client.

Firstly, the Metrics Platform owns and maintains the schema with which your events will be validated – we call it the monoschema – so you do not have to create a new schema for each new instrument. The monoschema has properties for the most common/instrument-agnostic data that teams might need to answer their questions, e.g. session ID, pageview ID, the namespace and title of the current page, and it also has a property that can hold instrument-specific data.

Secondly, the Metrics Platform works with event names and data rather than streams and events. That is, rather than writing an instrument that submits events to a specific stream, you write an instrument that dispatches events to zero or more interested streams.

Now, because the Metrics Platform is an opinionated Event Platform client, you still have to define a stream and configure it to be interested in the events that your instrument is dispatching. See https://wikitech.wikimedia.org/wiki/User:Phuedx/Metrics_Platform/Getting_Started/Creating_An_Instrument and https://wikitech.wikimedia.org/wiki/User:Phuedx/Metrics_Platform/Creating_a_Stream_Configuration for examples. Once you have created a stream, then you can start dispatching events:

mw.eventLog.dispatch( 'web.ui.ipa.click' );

const startedAt = mw.now();

// Elsewhere…
const finishedAt = mw.now() - startedAt;

mw.eventLog.dispatch( 'web.ui.ipa.play', { time_to_load: finishedAt } );

Testing Metrics Platform/Event Platform

We did not have time to talk about testing your Metrics Platform/Event Platform based instrumentation.

https://www.mediawiki.org/wiki/MediaWiki-Docker/Configuration_recipes/EventLogging is a thorough guide on how to get a production-like Event Platform/Legacy EventLogging testing environment set up locally. After following the guide you will be able to submit events to a containerised EventGate instance that can load schemas from a local path.

HMonroy renamed this task from Add EventLogging to Phonos to Implement instrumentation with statsv for Phonos.Oct 12 2022, 10:36 PM

@NRodriguez do we want to answer the questions below per wiki? Per language?

How often are users trying to listen to rendered pronunciations?
How often are users trying to listen to rendered pronunciations but failing to hear anything?
What's the average load time for audio?

below per wiki? Per language?

Can it be by both?
Aka, could I look at the difference at how it's performing in Spanish Wiktionary versus Spanish Wikipedia?

In T315091#8322740, @NRodriguez wrote:

Can it be by both?

Yes 🙂

Change 844067 had a related patch set uploaded (by HMonroy; author: HMonroy):

[mediawiki/extensions/Phonos@master] Add instrumentation with statsv

https://gerrit.wikimedia.org/r/844067

gerritbot added a project: Patch-For-Review.Oct 18 2022, 9:34 PM

HMonroy moved this task from In Development 💻 to Review/Feedback 💬 on the Community-Tech (CommTech-Sprint-35) board.Oct 18 2022, 9:44 PM

Manually triggering some events on the Beta Cluster (which, for reference, copies production DBnames, so en.wikipedia.beta is also enwiki) and checking in the labs instance of graphite:

Looks good! When in production, we'll be able to use the much nicer https://thanos.wikimedia.org to confirm metric logging

Change 844067 merged by jenkins-bot:

[mediawiki/extensions/Phonos@master] Add instrumentation with statsv

https://gerrit.wikimedia.org/r/844067

HMonroy mentioned this in rEPHN4cb334c15576: Add instrumentation with statsv.Oct 18 2022, 10:13 PM

Maintenance_bot removed a project: Patch-For-Review.Oct 18 2022, 10:30 PM

ReleaseTaggerBot added a project: MW-1.40-notes (1.40.0-wmf.7; 2022-10-24).Oct 18 2022, 11:00 PM

TheresNoTime moved this task from Review/Feedback 💬 to QA 🐛 on the Community-Tech (CommTech-Sprint-35) board.Oct 19 2022, 10:11 AM

• JMcLeod_WMF moved this task from QA 🐛 to In Development 💻 on the Community-Tech (CommTech-Sprint-35) board.Oct 19 2022, 7:03 PM

Change 844520 had a related patch set uploaded (by HMonroy; author: HMonroy):

[mediawiki/extensions/Phonos@master] Clean statsv tracking

https://gerrit.wikimedia.org/r/844520

gerritbot added a project: Patch-For-Review.Oct 19 2022, 8:07 PM

HMonroy moved this task from In Development 💻 to Review/Feedback 💬 on the Community-Tech (CommTech-Sprint-35) board.Oct 19 2022, 8:17 PM

We set the code to track:

Timing - time it took a media to play through the end (only first time it was played after page was loaded)
Count - clicks, errors and replays

@NRodriguez please let me know if we should also time replays. I was thinking that only the first play and not replays.

@NRodriguez please let me know if we should also time replays. I was thinking that only the first play and not replays.

Let's track replays, I think it would help us understand if people are using the feature to understand how to pronounce things repeatedly

@NRodriguez we are currently counting the replays clicks, but we are not tracking the time the replay takes. We are tracking the time that it takes for an audio to load and play for the first time. Do you think we should also track the time of the replays? Thank you!

• JMcLeod_WMF edited projects, added Community-Tech (CommTech-Sprint-36); removed Community-Tech (CommTech-Sprint-35).Nov 7 2022, 10:11 PM

• JMcLeod_WMF moved this task from Ready 🎬 to Review/Feedback 💬 on the Community-Tech (CommTech-Sprint-36) board.Nov 7 2022, 10:15 PM

Change 844520 merged by jenkins-bot:

[mediawiki/extensions/Phonos@master] Clean statsv tracking

https://gerrit.wikimedia.org/r/844520

HMonroy mentioned this in rEPHN3ecf896fc0c8: Clean statsv tracking.Nov 9 2022, 6:33 AM

ReleaseTaggerBot edited projects, added MW-1.40-notes (1.40.0-wmf.10; 2022-11-14); removed MW-1.40-notes (1.40.0-wmf.7; 2022-10-24).Nov 9 2022, 7:01 AM

Maintenance_bot removed a project: Patch-For-Review.Nov 9 2022, 7:30 AM

HMonroy moved this task from Review/Feedback 💬 to QA 🐛 on the Community-Tech (CommTech-Sprint-36) board.Nov 9 2022, 5:56 PM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:34 AM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:38 AM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:43 AM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:48 AM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:55 AM

HMonroy updated the task description. (Show Details)Nov 15 2022, 12:58 AM

phuedx moved this task from To be discussed to Tracking on the Metrics Platform Backlog board.Nov 15 2022, 4:36 PM

HMonroy updated the task description. (Show Details)Nov 15 2022, 9:21 PM

HMonroy updated the task description. (Show Details)

@HMonroy I clicked on 5 IPA words with 5 times each. I also added an extra partial one before it was completed and in total, it recorded 5 statsv as been in the screenshots since they group them. I also checked the Dashboard and registered all 26 of my clicks.

I did test an error by doing a made-up language code, which did record a statsv as seen in the error screenshot. Is that error supposed to also show under Phonos error by Wiki, on the Dashboard? It did not register the error if it was supposed to.

Test Site: https://en.wikipedia.beta.wmflabs.org/wiki/Phonos
Dashboard Site: https://grafana-labs.wikimedia.org/d/wiQMOQI4k/phonos-stats-beta?orgId=1&refresh=10s&from=now-3h&to=now
Browser: Chrome

Registered Clicks

T315091_Phonos_Statsv_Chrome.png (1×1 px, 388 KB)

T315091_Phonos_Statsv_Dashboard_Chrome.png (900×2 px, 139 KB)

Test: Error

• JMcLeod_WMF edited projects, added Community-Tech (CommTech-Sprint-37); removed Community-Tech (CommTech-Sprint-36).Nov 23 2022, 10:21 PM

• JMcLeod_WMF moved this task from Ready 🎬 to QA 🐛 on the Community-Tech (CommTech-Sprint-37) board.Nov 23 2022, 10:23 PM

@GMikesell-WMF We needed to modify the panels in grafana so that they correctly pull and display the data. https://grafana-labs.wikimedia.org/d/wiQMOQI4k/phonos-stats-beta?from=1668661491446&orgId=1&to=1669740325660 you should be able to see your errors now :)

@HMonroy Got it! I see the errors now. I will move this to Product Sign-off. Thanks!

GMikesell-WMF moved this task from QA 🐛 to Product sign-off 🤘 on the Community-Tech (CommTech-Sprint-37) board.Nov 29 2022, 4:52 PM

• NRodriguez closed this task as Resolved.Nov 30 2022, 8:05 PM

• JMcLeod_WMF moved this task from Product sign-off 🤘 to Done 🏁 on the Community-Tech (CommTech-Sprint-37) board.Dec 2 2022, 4:04 PM

mpopov mentioned this in T326212: Improve data logging on Special:Diff and Special:MobileDiff.Jan 4 2023, 10:39 PM

Gehel mentioned this in T335499: Ensure that we collect appropriate data for Search platform SLIs.Jun 5 2023, 1:17 PM

	F35811897: T315091_Phonos_Statsv_TestError_Dashboard_Chrome.png
	Nov 22 2022, 12:36 AM

	F35811892: T315091_Phonos_Statsv_TestError_Chrome.png
	Nov 22 2022, 12:36 AM

	F35811818: T315091_Phonos_Statsv_Dashboard_Chrome.png
	Nov 22 2022, 12:36 AM

	F35811816: T315091_Phonos_Statsv_Chrome.png
	Nov 22 2022, 12:36 AM

	F35597437: image.png
	Oct 18 2022, 9:55 PM

Implement instrumentation with statsv for PhonosClosed, ResolvedPublic2 Estimated Story PointsActions