Skip to content

Conversation

gavinkflam
Copy link
Contributor

@gavinkflam gavinkflam commented Jun 15, 2025

What type of PR is this?

/kind feature

What this PR does / why we need it:

CRI-O is not reporting HugeTLB metrics.

Which issue(s) this PR fixes:

Part of #9125

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Added HugeTLB usage (`container_hugetlb_usage_bytes`) and maxUsage (`container_hugetlb_max_usage_bytes`) metrics.

@openshift-ci openshift-ci bot added release-note Denotes a PR that will be considered when it comes time to generate release notes. kind/feature Categorizes issue or PR as related to a new feature. dco-signoff: yes Indicates the PR's author has DCO signed all their commits. labels Jun 15, 2025
@openshift-ci openshift-ci bot requested review from hasan4791 and littlejawa June 15, 2025 03:07
@openshift-ci openshift-ci bot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jun 15, 2025
Copy link
Contributor

openshift-ci bot commented Jun 15, 2025

Hi @gavinkflam. Thanks for your PR.

I'm waiting for a cri-o member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Copy link

codecov bot commented Jun 15, 2025

Codecov Report

Attention: Patch coverage is 72.85714% with 19 lines in your changes missing coverage. Please review.

Project coverage is 66.93%. Comparing base (92381ff) to head (7c8e7ef).
Report is 12 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #9257      +/-   ##
==========================================
+ Coverage   66.81%   66.93%   +0.11%     
==========================================
  Files         199      200       +1     
  Lines       27469    27613     +144     
==========================================
+ Hits        18353    18482     +129     
- Misses       7585     7608      +23     
+ Partials     1531     1523       -8     
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@gavinkflam gavinkflam force-pushed the 9125-hugetlb-metrics branch 2 times, most recently from 658816a to 51ea0cf Compare June 17, 2025 02:19
@gavinkflam
Copy link
Contributor Author

@sohankunkerkar I have fixed the freebsd build. Could you please approve the workflows again?

@sohankunkerkar
Copy link
Member

@gavinkflam Thanks for the PR. Could you write an integration test for this? You can find more details under https://github.com/cri-o/cri-o/tree/main/test

@sohankunkerkar
Copy link
Member

/ok-to-test

@openshift-ci openshift-ci bot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 17, 2025
@gavinkflam gavinkflam force-pushed the 9125-hugetlb-metrics branch from 51ea0cf to 3ec8159 Compare June 22, 2025 19:48
@gavinkflam
Copy link
Contributor Author

Thanks @sohankunkerkar! I wrote an integration test and fixed some issues.

A bug from upstream is preventing us from getting cgroup v2 HugeTLB stats. Once my pull request is accepted by upstream, I will update the go.mod version.

opencontainers/cgroups#24

@gavinkflam gavinkflam force-pushed the 9125-hugetlb-metrics branch from 3ec8159 to 0033f5e Compare June 28, 2025 02:45
@bitoku
Copy link
Contributor

bitoku commented Jun 30, 2025

@gavinkflam Can you fix the vendor by running make vendor ?
Other than that it lgtm.

@gavinkflam
Copy link
Contributor Author

gavinkflam commented Jun 30, 2025

Thanks @bitoku.

I'm still waiting for upstream to merge my PR opencontainers/cgroups#24. There is a bug that prevents us from getting cgroup v2 HugeTLB stats. Once that is merged, I will update go.mod and run make vendor.

@gavinkflam
Copy link
Contributor Author

@bitoku This is updated. Thank you for the feedbacks.

@bitoku
Copy link
Contributor

bitoku commented Jul 16, 2025

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 16, 2025
@bitoku
Copy link
Contributor

bitoku commented Jul 16, 2025

/release-note-edit

Added HugeTLB usage (`container_hugetlb_usage_bytes`) and maxUsage (`container_hugetlb_max_usage_bytes`) metrics.

@bitoku
Copy link
Contributor

bitoku commented Jul 16, 2025

/skip
/retest-required

@bitoku
Copy link
Contributor

bitoku commented Jul 16, 2025

@cri-o/cri-o-maintainers PTAL

Signed-off-by: Gavin Lam <gavin.oss@tutamail.com>
@gavinkflam gavinkflam force-pushed the 9125-hugetlb-metrics branch from 42a7530 to 7c8e7ef Compare July 17, 2025 03:43
@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Jul 17, 2025
@gavinkflam gavinkflam requested a review from saschagrunert July 17, 2025 03:47
@gavinkflam
Copy link
Contributor Author

I've improved the tests to avoid potential flakiness around the sleeps. PTAL.

@saschagrunert @bitoku

Copy link
Member

@sohankunkerkar sohankunkerkar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cri-o/cri-o-maintainers PTAL

@openshift-ci openshift-ci bot added lgtm Indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Jul 18, 2025
@sohankunkerkar sohankunkerkar removed the lgtm Indicates that a PR is ready to be merged. label Jul 18, 2025
@sohankunkerkar
Copy link
Member

/retest

@haircommander
Copy link
Member

/override ci/prow/ci-e2e-evented-pleg

Copy link
Contributor

openshift-ci bot commented Jul 21, 2025

@haircommander: Overrode contexts on behalf of haircommander: ci/prow/ci-e2e-evented-pleg

In response to this:

/override ci/prow/ci-e2e-evented-pleg

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@gavinkflam
Copy link
Contributor Author

gavinkflam commented Jul 22, 2025

@bitoku Can I get another lgtm please? The previous one is gone because of the last commit.

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 23, 2025
Copy link
Contributor

openshift-ci bot commented Jul 23, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gavinkflam, saschagrunert, sohankunkerkar

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [saschagrunert,sohankunkerkar]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-bot openshift-merge-bot bot merged commit ec7450f into cri-o:main Jul 23, 2025
84 of 89 checks passed
@gavinkflam gavinkflam deleted the 9125-hugetlb-metrics branch July 23, 2025 12:35
R3hankhan123 pushed a commit to R3hankhan123/cri-o that referenced this pull request Jul 24, 2025
Implement HugeTLB metrics

Signed-off-by: Rehan Khan <Rehan.Khan7@ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. dco-signoff: yes Indicates the PR's author has DCO signed all their commits. kind/feature Categorizes issue or PR as related to a new feature. lgtm Indicates that a PR is ready to be merged. ok-to-test Indicates a non-member PR verified by an org member that is safe to test. release-note Denotes a PR that will be considered when it comes time to generate release notes.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants