thasan (Taimur)
User

I wont comment on the C++, but it's worth running the ML perftests on this, because the first decode token now flushes eagerly, the perf harness records first token arrival earlier, so the measured values will shift even though the metric definitions are unchanged. Expect FIRST_TOKEN_LATENCY to drop and DECODING_TOKEN_SPEED to drop toward realistic values; both are measurement corrections, not regressions, so they shouldn't be triaged as a perf alert / backed out.

Tue, Jun 9, 10:19 PM

thasan accepted D305384: Bug 2005369 - Collect inference metrics in the static embeddings pipeline. r?thasan.

Looks good I can accept, side note regarding the UTF-16 code units, I traced what happens to an emoji through this tokenizer, it's not stripped, not split as punctuation, and gets swallowed into its surrounding word, which collapses to a single [UNK] token. So the metric stays internally consistent emoji input produces both chars and tokens, no char-without-work case.

This revision requires a Testing Policy Project Tag to be set before landing. Please apply one of , , , , . Tip: this Firefox add-on makes it easy!

Tue, Jun 9, 9:26 PM · testing-approved

thasan planned changes to D305724: Bug 2030415 - Assert security properties on chat initiation data sources.

Tue, Jun 9, 8:53 PM · testing-approved

thasan updated the diff for D304597: Bug 2030328 - Add browser_security_get_user_memories.js security test r?gregtatum.

Tue, Jun 9, 3:09 AM · testing-approved

thasan updated the diff for D304596: Bug 2030319 - Add browser_security_search_browsing_history.js security test r?gregtatum.

Tue, Jun 9, 3:09 AM · testing-approved

thasan updated the diff for D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Tue, Jun 9, 3:09 AM · testing-approved

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 9, 3:09 AM · testing-approved

thasan updated the diff for D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Tue, Jun 9, 1:52 AM · testing-approved

thasan added inline comments to D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 9, 12:25 AM · testing-approved

thasan updated the diff for D304597: Bug 2030328 - Add browser_security_get_user_memories.js security test r?gregtatum.

Tue, Jun 9, 12:10 AM · testing-approved

thasan updated the diff for D304596: Bug 2030319 - Add browser_security_search_browsing_history.js security test r?gregtatum.

Tue, Jun 9, 12:10 AM · testing-approved

thasan updated the diff for D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Tue, Jun 9, 12:10 AM · testing-approved

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 9, 12:10 AM · testing-approved

thasan abandoned D305426: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 9, 12:10 AM

thasan updated the diff for D304597: Bug 2030328 - Add browser_security_get_user_memories.js security test r?gregtatum.

Tue, Jun 9, 12:01 AM · testing-approved

thasan updated the diff for D304596: Bug 2030319 - Add browser_security_search_browsing_history.js security test r?gregtatum.

Tue, Jun 9, 12:01 AM · testing-approved

thasan requested review of D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Tue, Jun 9, 12:01 AM · testing-approved

thasan created D305426: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 9, 12:01 AM

Mon, Jun 8

thasan closed D303933: Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and-ml@mozilla.com.

Mon, Jun 8, 8:39 PM · data-classification-unnecessary, testing-exception-unchanged (Doesn't change behavior for users)

thasan committed rFIREFOXAUTOLAND1e0ac6f1c01a: Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and… (authored by thasan).

Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and…

Mon, Jun 8, 8:39 PM

thasan closed D300848: Bug 2040008 - Add ai and ai-perf mach try presets r?#ai-platform-reviewers.

Mon, Jun 8, 8:38 PM · testing-approved

thasan committed rFIREFOXAUTOLAND567b1e385fe0: Bug 2040008 - Add ai and ai-perf mach try presets r=rrando (authored by thasan).

Bug 2040008 - Add ai and ai-perf mach try presets r=rrando

Mon, Jun 8, 8:38 PM

thasan edited projects for D303933: Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and-ml@mozilla.com, added: data-classification-unnecessary; removed Restricted Project.

Mon, Jun 8, 8:36 PM · data-classification-unnecessary, testing-exception-unchanged (Doesn't change behavior for users)

thasan accepted D303364: Bug 2042664 - Disable IPv6 for the perf test to avoid latency from unreachable IPv6, r=#ai-platform-reviewers.

Looks good to me, we are going to have to check the glean telemetry to see what impact is made.

This revision requires a Testing Policy Project Tag to be set before landing. Please apply one of , , , , . Tip: this Firefox add-on makes it easy!

Mon, Jun 8, 8:02 PM · testing-exception-unchanged (Doesn't change behavior for users)

Thu, Jun 4

thasan closed D300423: Bug 2012177 - Add a "best-onnx" backend that chooses between onnx-native and wasm onnx r=gregtatum.

Thu, Jun 4, 11:45 PM · testing-approved

thasan committed rFIREFOXAUTOLANDff2569a4a447: Bug 2012177 - Add a "best-onnx" backend that chooses between onnx-native and… (authored by jbowser).

Bug 2012177 - Add a "best-onnx" backend that chooses between onnx-native and…

Thu, Jun 4, 11:44 PM

thasan closed D302656: Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r?thasan.

Thu, Jun 4, 11:42 PM · testing-approved

thasan committed rFIREFOXAUTOLAND7e024107d83e: Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r=thasan,ai… (authored by jbowser).

Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r=thasan,ai…

Thu, Jun 4, 11:42 PM

thasan edited projects for D300423: Bug 2012177 - Add a "best-onnx" backend that chooses between onnx-native and wasm onnx r=gregtatum, added: testing-approved; removed needs-testing-tag.

Thu, Jun 4, 11:42 PM · testing-approved

thasan edited projects for D302656: Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r?thasan, added: testing-approved; removed needs-testing-tag.

Thu, Jun 4, 11:38 PM · testing-approved

Wed, Jun 3

thasan updated the diff for D300848: Bug 2040008 - Add ai and ai-perf mach try presets r?#ai-platform-reviewers.

Wed, Jun 3, 10:59 PM · testing-approved

thasan updated the diff for D303933: Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and-ml@mozilla.com.

Wed, Jun 3, 10:58 PM · data-classification-unnecessary, testing-exception-unchanged (Doesn't change behavior for users)

thasan created D304597: Bug 2030328 - Add browser_security_get_user_memories.js security test r?gregtatum.

Wed, Jun 3, 10:40 PM · testing-approved

thasan created D304596: Bug 2030319 - Add browser_security_search_browsing_history.js security test r?gregtatum.

Wed, Jun 3, 10:40 PM · testing-approved

thasan updated the diff for D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Wed, Jun 3, 10:39 PM · testing-approved

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Wed, Jun 3, 10:39 PM · testing-approved

thasan accepted D300423: Bug 2012177 - Add a "best-onnx" backend that chooses between onnx-native and wasm onnx r=gregtatum.

Accepting, The best-onnx design is good. Im going to note that it might be important to run a ./mach try run here to make sure we didnt break anything surrounding best-llama, and smart tab.

This revision requires a Testing Policy Project Tag to be set before landing. Please apply one of , , , , . Tip: this Firefox add-on makes it easy!

Wed, Jun 3, 7:26 PM · testing-approved

thasan accepted D302656: Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r?thasan.

Thanks for handling the feedback, this looks a lot better. Noting here that this path intentionally diverges from ONNX on throughput: tokensPerSecond/timePerOutputToken are computed over decodingTime (decode-only) rather than ONNX's inferenceTime (prefill+decode). This is a different generation engine, and I think the decode-window pattern here is more correct than what ONNX currently does.

This revision requires a Testing Policy Project Tag to be set before landing. Please apply one of , , , , . Tip: this Firefox add-on makes it easy!

Wed, Jun 3, 6:30 PM · testing-approved

Tue, Jun 2

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Tue, Jun 2, 10:32 PM · testing-approved

Mon, Jun 1

thasan created D303933: Bug 2044189 - Route toolkit/ml telemetry notification_emails to firefox-ai-and-ml@mozilla.com.

Mon, Jun 1, 10:55 PM · data-classification-unnecessary, testing-exception-unchanged (Doesn't change behavior for users)

thasan accepted D303646: Bug 2043884 - Fix broken semantic search test. Test needed to be brought up to date with system changes. Issue is with test only..

LGTM thanks for adding the the remote perf run.

This revision requires a Testing Policy Project Tag to be set before landing. Please apply one of , , , , . Tip: this Firefox add-on makes it easy!

Mon, Jun 1, 7:58 PM · needs-testing-tag

thasan requested review of D302172: Bug 2030325 - Add browser_security_run_search.js end-to-end security test r?gregtatum.

Mon, Jun 1, 7:19 PM · testing-approved

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Mon, Jun 1, 7:19 PM · testing-approved

Thu, May 28

thasan updated the diff for D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Thu, May 28, 11:44 PM · testing-approved

thasan abandoned D302171: WIP: Bug 2018054 - Add browser_security_get_open_tabs.js security test.

Thu, May 28, 11:40 PM

thasan abandoned D303227: Bug 2018054 - Add browser_security_get_open_tabs.js security test.

Thu, May 28, 11:28 PM

thasan created D303228: Bug 2030307 - Add browser_security_get_open_tabs.js security test r?gregtatum.

Thu, May 28, 11:26 PM · testing-approved

thasan requested review of D303227: Bug 2018054 - Add browser_security_get_open_tabs.js security test.

Thu, May 28, 11:24 PM

thasan planned changes to D303227: Bug 2018054 - Add browser_security_get_open_tabs.js security test.

Thu, May 28, 11:23 PM

thasan accepted D300160: Bug 2039101 - Turn on native coarse bit indexing, sqlite-vec to 0.1.10 r=mak.

Thu, May 28, 11:21 PM · Restricted Project, testing-approved

thasan requested changes to D302656: Bug 2005365 - Collect inference metrics in the llama.cpp pipeline. r?thasan.

Thanks Joe, getting llama.cpp onto the structured metrics object is good progress, and the test is a good add.

Thu, May 28, 9:26 PM · testing-approved

thasan accepted D299398: Bug 2033537 - added telemetry engine for running llm-based telemetry r?#ai-models-reviewers,tliu.

Thanks for addressing all the feedback, the implementation looks good. Feedback for next time,this patch bundles several unrelated changes (drivebys) into one bug. Going forward, splitting unrelated work into its own bugs would keep each patch scoped and let things land faster.

Thu, May 28, 6:41 PM · testing-approved, Restricted Project