dancy (Ahmon Dancy)
Staff Software Engineer, Release EngineeringAdministrator

for n in $(seq 12 14); do host=deployment-cirrussearch$n.deployment-prep.eqiad1.wikimedia.cloud; echo $host; sudo puppetserver ca clean --certname $host; done

Thu, Jun 25, 3:04 PM · Beta-Cluster-Infrastructure

dancy merged T428819: No Puppet resources found on instance deployment-cirrussearch14 on project deployment-prep into T424100: No Puppet resources found on instance deployment-cirrussearch14 on project deployment-prep.

Thu, Jun 25, 3:04 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure

dancy merged task T428819: No Puppet resources found on instance deployment-cirrussearch14 on project deployment-prep into T424100: No Puppet resources found on instance deployment-cirrussearch14 on project deployment-prep.

Thu, Jun 25, 3:03 PM · Beta-Cluster-Infrastructure

dancy added a comment to T430165: Found non-revoked Puppet certificates for 4 deleted instances on deployment-puppetserver-1.

I ran

sudo puppetserver ca clean --certname deployment-dancy2.deployment-prep.eqiad1.wikimedia.cloud
sudo puppetserver ca clean --certname deployment-dancy3.deployment-prep.eqiad1.wikimedia.cloud

to clean up after some test instances.

Thu, Jun 25, 2:58 PM · Beta-Cluster-Infrastructure

Wed, Jun 24

dancy closed T430075: Properly handle mediawiki code/config updates on deployment-prep jobrunners as Resolved.

Wed, Jun 24, 6:54 PM · Beta-Cluster-Infrastructure

dancy closed T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep as Resolved.

I chose to resolve this by deleting the two running Docker containers and making puppet recreate them:

Wed, Jun 24, 6:50 PM · Beta-Cluster-Infrastructure

dancy added a comment to T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep.

The biggest consumer is

-rw-r----- 1 root root 9.0G Jun 24 18:42 /var/lib/docker/containers/c5a95725142c1168d1dca1c5a6bd3bf4ec5df287619997124dceebf1084baa56/c5a95725142c1168d1dca1c5a6bd3bf4ec5df287619997124dceebf1084baa56-json.log

Wed, Jun 24, 6:43 PM · Beta-Cluster-Infrastructure

dancy added a comment to T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep.

dancy@deployment-changeprop-1:~$ df -t ext4 -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda1 20G 20G 0 100% /

Wed, Jun 24, 6:39 PM · Beta-Cluster-Infrastructure

dancy added a comment to T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep.

The tail end of output:

Error: Failed to apply catalog: No space left on device @ dir_s_mkdir - /var/lib/puppet/state/state.yaml20260624-2180970-o513qq.lock
Error: Could not save last run local report: No space left on device @ dir_s_mkdir - /var/cache/puppet/public/last_run_summary.yaml20260624-2180970-1uvgjro.lock
Error: Could not send report: No space left on device @ dir_s_mkdir - /var/lib/puppet/state/last_run_report.yaml20260624-2180970-1899ley.lock

Wed, Jun 24, 6:39 PM · Beta-Cluster-Infrastructure

dancy merged T429712: Last Puppet run was over 24 hours ago on instance deployment-changeprop-1 in project deployment-prep into T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep.

Wed, Jun 24, 6:38 PM · Beta-Cluster-Infrastructure

dancy merged task T429712: Last Puppet run was over 24 hours ago on instance deployment-changeprop-1 in project deployment-prep into T429662: No Puppet resources found on instance deployment-changeprop-1 on project deployment-prep.

Wed, Jun 24, 6:38 PM · Beta-Cluster-Infrastructure

dancy renamed T429988: Buildkit v0.31.1 released from Buildkit v0.31.0 released to Buildkit v0.31.1 released.

Wed, Jun 24, 6:34 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

dancy renamed T430075: Properly handle mediawiki code/config updates on deployment-prep jobrunners from Enable opcache revalidation on deployment-prep jobrunners to Properly handle mediawiki code/config updates on deployment-prep jobrunners.

Wed, Jun 24, 5:19 PM · Beta-Cluster-Infrastructure

dancy updated the task description for T430075: Properly handle mediawiki code/config updates on deployment-prep jobrunners.

Wed, Jun 24, 5:17 PM · Beta-Cluster-Infrastructure

dancy created T430075: Properly handle mediawiki code/config updates on deployment-prep jobrunners.

Wed, Jun 24, 5:06 PM · Beta-Cluster-Infrastructure

dancy created T430062: Error during startup of php8.3-fpm on deployment-jobrunner05.deployment-prep.

Wed, Jun 24, 3:19 PM · Beta-Cluster-Infrastructure

dancy closed T429978: Project members cannot ssh into newly created deployment-prep instances as Resolved.

In T429978#12048826, @BLiviero-WMF wrote:

Hi! with T429542 being resolved, can you confirm whether this problem is also fixed? thank you!

Wed, Jun 24, 2:59 PM · cloud-services-team, Cloud-VPS

Tue, Jun 23

dancy created T429988: Buildkit v0.31.1 released.

Tue, Jun 23, 6:37 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

dancy created T429978: Project members cannot ssh into newly created deployment-prep instances.

Tue, Jun 23, 5:30 PM · cloud-services-team, Cloud-VPS

dancy added a comment to T428069: Puppet broken on phabricator-bookworm-3.devtools.eqiad1.wikimedia.cloud.

In T428069#12042082, @Arnoldokoth wrote:

I terminated this host so should not be an issue now. I'll mark this ticket as resolved.

Tue, Jun 23, 2:42 PM · VPS-project-Phabricator, VPS-project-devtools

Thu, Jun 18

dancy added a comment to T428971: Allow configuration of canary and production checks based on deployment target.

Thanks @Scott_French. Your suggested layout and sample config make sense to me and look like a good place to start experimenting with implementation, which I can do next week.

Thu, Jun 18, 7:41 PM · Release-Engineering-Team (Priority Backlog 📥)

Wed, Jun 17

dancy renamed T429542: debian-12.0-bookworm and debian-13.0-trixie image still reference mirrors.wikimedia.org from debian-13.0-trixie image still references mirrors.wikimedia.org to debian-12.0-bookworm and debian-13.0-trixie image still reference mirrors.wikimedia.org.

Wed, Jun 17, 8:56 PM · cloud-services-team, Cloud-VPS

dancy added a subtask for T416707: Sunsetting mirrors.wikimedia.org: T429542: debian-12.0-bookworm and debian-13.0-trixie image still reference mirrors.wikimedia.org.

Wed, Jun 17, 8:29 PM · Patch-For-Review, User-notice, Release-Engineering-Team (Radar), Infrastructure-Foundations, SRE

dancy added a parent task for T429542: debian-12.0-bookworm and debian-13.0-trixie image still reference mirrors.wikimedia.org: T416707: Sunsetting mirrors.wikimedia.org.

Wed, Jun 17, 8:29 PM · cloud-services-team, Cloud-VPS

dancy created T429542: debian-12.0-bookworm and debian-13.0-trixie image still reference mirrors.wikimedia.org.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Wed, Jun 17, 8:29 PM · cloud-services-team, Cloud-VPS

Tue, Jun 16

dancy updated subscribers of T429413: Eliminate sudo rm -rf /var/lib/puppet/ssl step in new deployment-prep WMCS project (and others).

@Andrew Do you anticipate any issues if we set to (or something) in the project? was mentioned in T421244#11847986.

Tue, Jun 16, 10:07 PM · Patch-For-Review, Beta-Cluster-Infrastructure

dancy created T429413: Eliminate sudo rm -rf /var/lib/puppet/ssl step in new deployment-prep WMCS project (and others).

Tue, Jun 16, 10:01 PM · Patch-For-Review, Beta-Cluster-Infrastructure

dancy closed T429364: Increase size of deployment-deploy04.deployment-prep as Resolved.

Done.

Tue, Jun 16, 4:01 PM · Release-Engineering-Team, Beta-Cluster-Infrastructure

dancy created T429364: Increase size of deployment-deploy04.deployment-prep.

Tue, Jun 16, 3:44 PM · Release-Engineering-Team, Beta-Cluster-Infrastructure

Mon, Jun 15

dancy closed T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11, a subtask of T401839: Migrate deployment-prep away from Debian Bullseye to Bookworm/Trixie, as Resolved.

Mon, Jun 15, 11:46 PM · Epic, Release-Engineering-Team (Priority Backlog 📥), Cloud-VPS (Debian Bullseye Deprecation), Beta-Cluster-Infrastructure

dancy closed T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11, a subtask of T366644: Raise MediaWiki's MariaDB requirement to 10.6, as Resolved.

Mon, Jun 15, 11:46 PM · MW-1.47-notes (1.47.0-wmf.8; 2026-06-23), Patch-For-Review, MW-1.47-release, Technical-Debt, MediaWiki-libs-Rdbms

dancy closed T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11 as Resolved.

All servers referenced by operations/mediawiki-config are running MariaDB 10.11 now.
New nodes:

deployment-db15.deployment-prep.eqiad1.wikimedia.cloud
deployment-db16.deployment-prep.eqiad1.wikimedia.cloud

Mon, Jun 15, 11:46 PM · MediaWiki-libs-Rdbms, Beta-Cluster-Infrastructure

dancy closed T429245: Set up deployment-db16 with Trixie and wmf-mariadb1011 as Resolved.

Mon, Jun 15, 8:27 PM · Beta-Cluster-Infrastructure

dancy added a comment to T426827: gitlab workers ulimit nofiles 1073741816 slows down fakeroot.

@fgiunchedi Let us know how things work now if you remove your workaround.

Mon, Jun 15, 7:24 PM · Release-Engineering-Team (Doing 😎), Patch-For-Review, GitLab (CI & Job Runners), collaboration-services

dancy created T429245: Set up deployment-db16 with Trixie and wmf-mariadb1011.

Mon, Jun 15, 6:25 PM · Beta-Cluster-Infrastructure

dancy closed T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011 as Resolved.

Mon, Jun 15, 6:24 PM · Beta-Cluster-Infrastructure

dancy closed T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011, a subtask of T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11, as Resolved.

Mon, Jun 15, 6:24 PM · MediaWiki-libs-Rdbms, Beta-Cluster-Infrastructure

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

In T428930#12012467, @Zabe wrote:

In T428930#12011975, @dancy wrote:

@Zabe I saw that you handled T329577 a few years ago and I'm wondering if you can help me bring deployment-db15 online to take over for deployment-db14 (xref
T428910#12010836).

Sure. Please tell me if I can do something. :)

Mon, Jun 15, 5:04 PM · Beta-Cluster-Infrastructure

dancy closed T429099: Editing on Beta Cluster does not work as Resolved.

Editing is working again.

Mon, Jun 15, 5:02 PM · Beta-Cluster-Infrastructure

dancy added a comment to T429099: Editing on Beta Cluster does not work.

I found the script stalled on . Doing some debugging using I found it blocked on a query to . I logged into deployment-db14 and ran there and I see:

Query caused different errors on master and slave. Error on master: message (format)='Cannot load from %s.%s. The table is probably corrupted' error code=1728 ; Error on slave: actual message='no error', error code=0. Default database: 'repltest'. Query: 'drop database repltest'

Mon, Jun 15, 3:56 PM · Beta-Cluster-Infrastructure

dancy added a comment to T429099: Editing on Beta Cluster does not work.

I'm investigating.

Mon, Jun 15, 3:37 PM · Beta-Cluster-Infrastructure

dancy added a comment to T418778: Flaky Cypress test: wbui2025 add qualifiers: mobile view (wbui2025) - tabular-data qualifier: can add a tabular-data qualifier with lookup:.

In T418778#12016062, @Krinkle wrote:

And again. Can we disable this test until a solution is found? Two months seems long enough as a grace period to "just" fix it directly.

Mon, Jun 15, 2:49 PM · User-zeljkofilipin, MW-1.47-notes (1.47.0-wmf.9; 2026-06-30), Wikidata-Omega, Patch-For-Review, ci-test-error (WMF-deployed Build Failure), Browser-Tests, Wikidata

Thu, Jun 11

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

I copied from to and I'm running

zcat /srv/db11-seed.sql.gz | sudo mysql

Thu, Jun 11, 9:43 PM · Beta-Cluster-Infrastructure

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

root@deployment-db11:~# time /opt/wmf-mariadb106/bin/mariadb-dump --all-databases --single-transaction --gtid --triggers | gzip > /srv/db11-seed.sql.gz

Thu, Jun 11, 8:34 PM · Beta-Cluster-Infrastructure

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Initialize db stuff:

sudo -u mysql /opt/wmf-mariadb1011/scripts/mariadb-install-db \
 --basedir=/opt/wmf-mariadb1011 \
 --datadir=/srv/sqldata

Thu, Jun 11, 8:31 PM · Beta-Cluster-Infrastructure

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

I have a 120Gib volume mounted on .

Thu, Jun 11, 8:00 PM · Beta-Cluster-Infrastructure

dancy updated subscribers of T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

@Zabe I saw that you handled T329577 a few years ago and I'm wondering if you can help me bring deployment-db15 online to take over for deployment-db14 (xref
T428910#12010836).

Thu, Jun 11, 7:46 PM · Beta-Cluster-Infrastructure

dancy added a comment to T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Current output:

output14 KBDownload

Thu, Jun 11, 4:48 PM · Beta-Cluster-Infrastructure

dancy updated the task description for T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Thu, Jun 11, 4:11 PM · Beta-Cluster-Infrastructure

dancy updated the task description for T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Thu, Jun 11, 4:09 PM · Beta-Cluster-Infrastructure

dancy triaged T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011 as High priority.

Thu, Jun 11, 4:08 PM · Beta-Cluster-Infrastructure

dancy added a subtask for T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11: T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Thu, Jun 11, 4:06 PM · MediaWiki-libs-Rdbms, Beta-Cluster-Infrastructure

dancy added a parent task for T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011: T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11.

Thu, Jun 11, 4:06 PM · Beta-Cluster-Infrastructure

dancy created T428930: Set up deployment-db15 with Trixie and wmf-mariadb1011.

Thu, Jun 11, 4:06 PM · Beta-Cluster-Infrastructure

dancy added a comment to T428910: Beta Cluster MariaDB is still 10.6.17, MW now requires 10.11.

Notes:
From operations/mediawiki-config/wmf-config/db-labs.php:

'hostsByName' => [
 // deployment-db11.deployment-prep.eqiad1.wikimedia.cloud, master
 'deployment-db11' => '172.16.5.150:3306',
 // deployment-db14.deployment-prep.eqiad1.wikimedia.cloud
 'deployment-db14' => '172.16.5.170:3306',
],

Thu, Jun 11, 3:47 PM · MediaWiki-libs-Rdbms, Beta-Cluster-Infrastructure

dancy added a comment to T366644: Raise MediaWiki's MariaDB requirement to 10.6.

@Jdforrester-WMF How do you feel about reverting https://gerrit.wikimedia.org/r/c/mediawiki/core/+/1300267 until beta cluster is prepared to handle it?

Thu, Jun 11, 3:21 PM · MW-1.47-notes (1.47.0-wmf.8; 2026-06-23), Patch-For-Review, MW-1.47-release, Technical-Debt, MediaWiki-libs-Rdbms

Fri, Jun 5

dancy closed T423914: 1.47.0-wmf.5 deployment blockers as Resolved.

Fri, Jun 5, 2:42 PM · Release-Engineering-Team (Priority Backlog 📥), Essential-Work, Release, Train Deployments

Thu, Jun 4

dancy created T428198: "Error: Call to a member function getTalkPage() on null" when trying to view notifications.

Thu, Jun 4, 6:07 PM · MW-1.47-notes (1.47.0-wmf.7; 2026-06-16), Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog, MediaWiki-Platform-Team (Kanban Board), Notifications (Echo), Wikimedia-production-error

dancy added a comment to T256168: Move beta cluster automatic deployment to a dedicated infrastructure.

In T256168#11960221, @bd808 wrote:

In T256168#11959969, @hashar wrote:

Can we claim victory on this one did you have following steps in mind? The ones I think of are removing the agent in Jenkins and deleting the jobs (I can take care of that).

On my side we are still missing any notification if the sync jobs break. I found a good blog post on using stanzas with systemd units over the weekend that actually seems like a promising direction. I would like email and irc yelling so we don't miss things getting messed up.

Thu, Jun 4, 3:12 PM · Patch-For-Review, User-bd808, Release-Engineering-Team (Doing 😎), Continuous-Integration-Infrastructure, Quality-and-Test-Engineering-Team (Test Infrastructure), Jenkins, Continuous-Integration-Config, Beta-Cluster-Infrastructure

dancy added a comment to T413394: Wikibase has a flaky cypress test in addQualifier.cy.ts.

A recent instance: https://integration.wikimedia.org/ci/job/quibble-with-Wikibase-extensions-browser-tests-only-vendor-php83/10164/console

Thu, Jun 4, 1:24 AM · MW-1.47-notes (1.47.0-wmf.4; 2026-05-26), Wikidata-Omega (The Board), Wikidata, Browser-Tests, ci-test-error (WMF-deployed Build Failure)

Wed, Jun 3

dancy added a comment to T387813: Error: __clone method called on non-object.

Since it's been a while since this was originally reported, here's a fresh hit from today:

Wed, Jun 3, 2:53 PM · MediaWiki-extensions-LiquidThreads, Wikimedia-production-error

dancy updated subscribers of T428069: Puppet broken on phabricator-bookworm-3.devtools.eqiad1.wikimedia.cloud.

@brennen Do you know anything about this node?

Wed, Jun 3, 2:45 PM · VPS-project-Phabricator, VPS-project-devtools

dancy created T428069: Puppet broken on phabricator-bookworm-3.devtools.eqiad1.wikimedia.cloud.

Wed, Jun 3, 2:45 PM · VPS-project-Phabricator, VPS-project-devtools

Tue, Jun 2

dancy added a comment to T423914: 1.47.0-wmf.5 deployment blockers.

In T423914#11978634, @dancy wrote:

Train is blocked at testwikis due to T427935.

Tue, Jun 2, 6:52 PM · Release-Engineering-Team (Priority Backlog 📥), Essential-Work, Release, Train Deployments

dancy added a comment to T427935: wbsearchentities rejects limit=max for items and properties.

Given the described scope of the problem (wikidata.org, which is in group1), I will roll the train to group0 now.

Tue, Jun 2, 6:51 PM · Wikibase Reuse Team (Sprint 70)

dancy added a comment to T423914: 1.47.0-wmf.5 deployment blockers.

Train is blocked at testwikis due to T427935.

Tue, Jun 2, 6:25 PM · Release-Engineering-Team (Priority Backlog 📥), Essential-Work, Release, Train Deployments

dancy triaged T427935: wbsearchentities rejects limit=max for items and properties as Unbreak Now! priority.

Changing priority to UBN! this since task was added as a train blocker in T423914. I'm currently holding the train.

Tue, Jun 2, 6:24 PM · Wikibase Reuse Team (Sprint 70)

dancy closed T426212: Buildkit v0.30.0 released as Resolved.

Buildkit v0.30.0 deployed to all places.

Tue, Jun 2, 4:38 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

dancy closed T427449: gitlab-webhooks: toolforge webservice logs -f choking on a log message as Resolved.

Looks good now.

Tue, Jun 2, 4:27 PM · tools-platform-team, Patch-For-Review, Toolforge

Restricted Application changed the subtype of T366857: InvalidArgumentException from line 80 of ServerInfo.php: No server with index '0' (in a maintenance script) from "Task" to "Production Error".

Here's a fresh report from a batch of these errors that I saw today:

Tue, Jun 2, 3:00 PM · DBA, Wikimedia-production-error, MediaWiki-libs-Rdbms

May 29 2026

dancy added a comment to T426827: gitlab workers ulimit nofiles 1073741816 slows down fakeroot.

I prepared https://gitlab.wikimedia.org/repos/releng/gitlab-cloud-runner/-/merge_requests/610 which should help with this. It is live in the staging cluster now.

May 29 2026, 3:17 PM · Release-Engineering-Team (Doing 😎), Patch-For-Review, GitLab (CI & Job Runners), collaboration-services

May 28 2026

dancy added a comment to T427449: gitlab-webhooks: toolforge webservice logs -f choking on a log message.

It looks like stream of container log records being returned from Kubernetes is getting mangled, possibly due to weird Unicode characters?

May 28 2026, 7:56 PM · tools-platform-team, Patch-For-Review, Toolforge

dancy added a comment to T400083: Use a more recent Helm version to deploy to prod.

now exists with helm v4.2.0 installed.

May 28 2026, 4:55 PM · Release-Engineering-Team (Doing 😎), Catalyst (Luka Ijo Pimeja Jan), User-jnuche, Essential-Work

May 27 2026

dancy added a comment to T427250: registry.cloud.releng.team returns 500 responses when pushing layers.

I deployed a new version of Reggie which handles errors in the upload and manifest cleaners.

May 27 2026, 9:14 PM · GitLab (CI & Job Runners), Release-Engineering-Team

dancy added a comment to T427449: gitlab-webhooks: toolforge webservice logs -f choking on a log message.

Noting that it doesn't always stop at the same message. For example, I ran again recently and it stopped at an earlier timestamp of

May 27 2026, 8:12 PM · tools-platform-team, Patch-For-Review, Toolforge

dancy created T427449: gitlab-webhooks: toolforge webservice logs -f choking on a log message.

May 27 2026, 8:09 PM · tools-platform-team, Patch-For-Review, Toolforge

dancy added a comment to T427324: @CodeReviewBot is sometimes only commenting when a GitLab MR is merged, and not also when it's opened.

Today I created https://gitlab.wikimedia.org/repos/releng/reggie/-/merge_requests/110 with a footer but no comment was added to that ticket.

May 27 2026, 5:35 PM · User-brennen, Release-Engineering-Team, GitLab (Integrations)

dancy renamed T427315: Increase CI job timeout for helm-chart job (deployment-charts CI) from Increase CI job timeout for deployment-charts CI to Increase CI job timeout for helm-chart job (deployment-charts CI).

May 27 2026, 4:00 PM · Release-Engineering-Team (Doing 😎), ServiceOps-SharedInfra, ServiceOps new, collaboration-services

dancy added a comment to T427250: registry.cloud.releng.team returns 500 responses when pushing layers.

I deleted the pod. It restarted and the cleaners are running properly again. Space usage is down from 123GB to 32GB at the moment.

May 27 2026, 3:40 PM · GitLab (CI & Job Runners), Release-Engineering-Team

dancy added a comment to T427250: registry.cloud.releng.team returns 500 responses when pushing layers.

Reggie's filesystem usage seems to be only increasing. I'm not seeing regular hits for the word "clean" in the log like I expect. I'll look into that.

May 27 2026, 3:26 PM · GitLab (CI & Job Runners), Release-Engineering-Team

May 20 2026

dancy added a comment to T400083: Use a more recent Helm version to deploy to prod.

How do we feel about a GitLab repo for this purpose? Alternatively we can put something in https://gerrit.wikimedia.org/r/plugins/gitiles/integration/config/+/refs/heads/master/dockerfiles/, in which case we would receive the benefit of the image being updated when the base image is updated.

May 20 2026, 9:09 PM · Release-Engineering-Team (Doing 😎), Catalyst (Luka Ijo Pimeja Jan), User-jnuche, Essential-Work

dancy added a comment to T426761: PHP Warning: Undefined array key "wikimedia-donor".

Dropping another variant here for searchability:

Error

May 20 2026, 8:47 PM · MW-1.47-notes (1.47.0-wmf.5; 2026-06-02), Community-Tech, MediaWiki-extensions-GlobalPreferences, Wikimedia-production-error

dancy closed T397089: scap backport should warn if it knows it will take a long time as Resolved.

Deployed in scap 4.266.0

May 20 2026, 2:26 PM · Scap

May 18 2026

dancy added a comment to T387886: Jobs on Digital Ocean Cloud Runners are being OOM killed.

@Don-vip, I've made a configuration change which might help with your job. Please retry and let me know how it goes.

May 18 2026, 6:18 PM · Release-Engineering-Team (Priority Backlog 📥), User-brennen, GitLab (CI & Job Runners)

dancy added a comment to T387886: Jobs on Digital Ocean Cloud Runners are being OOM killed.

In T387886#11931806, @Don-vip wrote:

It didn't help, sadly.

May 18 2026, 3:22 PM · Release-Engineering-Team (Priority Backlog 📥), User-brennen, GitLab (CI & Job Runners)

dancy added a comment to T387886: Jobs on Digital Ocean Cloud Runners are being OOM killed.

@Don-vip, please try adding the following to your file:

May 18 2026, 2:57 PM · Release-Engineering-Team (Priority Backlog 📥), User-brennen, GitLab (CI & Job Runners)

May 15 2026

dancy closed T426436: Upgrade gitlab-cloud-runners Kubernetes to 1.35.1-do.6 as Resolved.

May 15 2026, 6:36 PM · Release-Engineering-Team, GitLab (CI & Job Runners)

dancy created T426436: Upgrade gitlab-cloud-runners Kubernetes to 1.35.1-do.6.

May 15 2026, 4:17 PM · Release-Engineering-Team, GitLab (CI & Job Runners)

May 14 2026

dancy changed the status of T397089: scap backport should warn if it knows it will take a long time from Open to In Progress.

May 14 2026, 7:43 PM · Scap

dancy added a comment to T425687: No Puppet resources found on instance deployment-mx04 on project deployment-prep.

Thanks @bd808!

May 14 2026, 4:27 PM · User-bd808, Beta-Cluster-Infrastructure

May 13 2026

dancy created T426212: Buildkit v0.30.0 released.

May 13 2026, 2:53 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

dancy lowered the priority of T425988: Deprecated: Accessing the language without explicitly setting it via MediaHandler:setLanguage, MediaHandler::getHandler, or MediaHandlerFactory::getHandler from Unbreak Now! to Needs Triage.

Thanks @MGChecker and @cscott!

May 13 2026, 2:40 PM · MW-1.46-release, MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), MediaWiki-File-management

May 12 2026

dancy added a comment to T425988: Deprecated: Accessing the language without explicitly setting it via MediaHandler:setLanguage, MediaHandler::getHandler, or MediaHandlerFactory::getHandler.

I deployed https://gerrit.wikimedia.org/r/c/mediawiki/core/+/1286464 but it did not have an effect on the logging rate.

May 12 2026, 9:08 PM · MW-1.46-release, MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), MediaWiki-File-management

dancy added a comment to T425988: Deprecated: Accessing the language without explicitly setting it via MediaHandler:setLanguage, MediaHandler::getHandler, or MediaHandlerFactory::getHandler.

From a recent deployment

17:56:20 Waiting 20 seconds for production traffic...
17:56:40 Logstash checker counted 107 error(s) in the last 20 seconds. OK.

The threshold is 150.

May 12 2026, 6:19 PM · MW-1.46-release, MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), MediaWiki-File-management

dancy added a comment to T425988: Deprecated: Accessing the language without explicitly setting it via MediaHandler:setLanguage, MediaHandler::getHandler, or MediaHandlerFactory::getHandler.

The volume of warnings has moved us dangerously close to the point where scap deployments will start complaining about it. This is not a place we want to be so I increased this priority of this ticket to Unbreak Now.

May 12 2026, 6:18 PM · MW-1.46-release, MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), MediaWiki-File-management

dancy triaged T425988: Deprecated: Accessing the language without explicitly setting it via MediaHandler:setLanguage, MediaHandler::getHandler, or MediaHandlerFactory::getHandler as Unbreak Now! priority.

May 12 2026, 6:14 PM · MW-1.46-release, MW-1.47-notes (1.47.0-wmf.2; 2026-05-12), MediaWiki-File-management

May 11 2026

dancy added a comment to T425687: No Puppet resources found on instance deployment-mx04 on project deployment-prep.

Probably caused by https://gerrit.wikimedia.org/r/c/operations/puppet/+/1283025 (T325394)

May 11 2026, 3:15 PM · User-bd808, Beta-Cluster-Infrastructure

dancy added a comment to T425687: No Puppet resources found on instance deployment-mx04 on project deployment-prep.

$ sudo run-puppet-agent
Info: Using environment 'production'
Info: Retrieving pluginfacts
Info: Retrieving plugin
Info: Loading facts
Error: Could not retrieve catalog from remote server: Error 500 on SERVER: Server Error: Could not find class role::mail::mx for deployment-mx04.deployment-prep.eqiad1.wikimedia.cloud on node deployment-mx04.deployment-prep.eqiad1.wikimedia.cloud
Warning: Not using cache on failed catalog
Error: Could not retrieve catalog; skipping run

May 11 2026, 3:13 PM · User-bd808, Beta-Cluster-Infrastructure

Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct. · Wikimedia Foundation · Privacy Policy · Code of Conduct · Terms of Use · Disclaimer · CC-BY-SA · GPL · Credits

URL: https://phabricator.wikimedia.org/p/dancy/

⇱ ♟ dancy

dancy (Ahmon Dancy)
Staff Software Engineer, Release EngineeringAdministrator

Projects (19)
View All

Calendar

Today

Tomorrow

Wednesday

User Details

Recent Activity
View All

Fri, Jun 26

Thu, Jun 25

Wed, Jun 24

Tue, Jun 23

Thu, Jun 18

Wed, Jun 17

Tue, Jun 16

Mon, Jun 15

Thu, Jun 11

Fri, Jun 5

Thu, Jun 4

Wed, Jun 3

Tue, Jun 2

May 29 2026

May 28 2026

May 27 2026

May 20 2026

Error

May 18 2026

May 15 2026

May 14 2026

May 13 2026

May 12 2026

May 11 2026

URL: https://phabricator.wikimedia.org/p/dancy/

⇱ ♟ dancy

dancy (Ahmon Dancy)Staff Software Engineer, Release EngineeringAdministrator

Projects (19)View All

Calendar

Today

Tomorrow

Wednesday

User Details

Recent ActivityView All

Fri, Jun 26

Thu, Jun 25

Wed, Jun 24

Tue, Jun 23

Thu, Jun 18

Wed, Jun 17

Tue, Jun 16

Mon, Jun 15

Thu, Jun 11

Fri, Jun 5

Thu, Jun 4

Wed, Jun 3

Tue, Jun 2

May 29 2026

May 28 2026

May 27 2026

May 20 2026

Error

May 18 2026

May 15 2026

May 14 2026

May 13 2026

May 12 2026

May 11 2026

dancy (Ahmon Dancy)
Staff Software Engineer, Release EngineeringAdministrator

Projects (19)
View All

Recent Activity
View All