Tasks handled by the Wikimedia Foundation's datacenter operations team, which is a sub-team of the SRE department.
This project includes sub-project procurement, decommission-hardware, and every single datacenter site-specific project: ops-codfw, ops-drmrs, ops-eqdfw, ops-eqiad, ops-eqord, ops-esams , ops-eqsin, ops-ulsfo, & ops-magru .
This can be linked to via: https://phabricator.wikimedia.org/tag/dc-ops/
Please note any wikitech documentation handled by DC-Ops is linked off of https://wikitech.wikimedia.org/wiki/Dc-operations
SLAs
DC-Ops makes every attempt to resolve all tasks and requests in a timely manner. We've implemented the following SLA targets.
Please note none of these start until both the clarified start time and with proper project tags. See details for each type of task request in their section below. Please use templates listed below.
Project | Days to Resolve | SLA start | Template |
procurement | 90 | Date of Task filing | Procurement Template |
Racking/Installation | 30 | Arrival of Hardware to DC site | |
Hardware Failure / Repair | 10 | Date of Task filing | Hardware Failure Template |
Decommission | 45 | When all sub-team steps are complete and task is assigned to on-site | Server Decommission Template |
Hardware Repair
If you need to file a task requesting hardware troubleshooting, please use the File Hardware Failure Task link here or in the navbar on the left.
Troubleshooting includes hardware failures, raid re-configuration, etc...
A full runbook on how to troubleshoot hardware failures can be viewed here: https://wikitech.wikimedia.org/wiki/Dc-operations/Hardware_Troubleshooting_Runbook
Requesting Hardware
If you have a budget line item, and want to file a request for pricing, please file your procurement request via this link. If you do not yet have a budget line for the request in this fiscal year, you can still file via that link, merely list that there is no budget allocation in that section of the task.
Once hardware has been ordered, a racking task must be entered using the form. This form may also be used if a system has to be moved and re-imaged.
Decommissioning Hardware
All hardware being returned to DC-Ops for processing into spares, or into decommission state and removed from the rack.
Any hardware no longer required for use should have a task filed for decommission via the pre-defined server decommission request form.
Netbox Reporting
The template for netbox report errors is here: https://phabricator.wikimedia.org/maniphest/task/edit/form/133/