Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Intel GPU info collection to the collect env script #137846

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

jingxu10
Copy link
Collaborator

@jingxu10 jingxu10 commented Oct 12, 2024

  1. Add Intel GPU info collection to the collect env script
  2. Since wmic is deprecated on Windows, this PR updates the usage of wmic to its counterpart in powershell. WMI command line (WMIC) utility deprecation: Next steps

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

Copy link

pytorch-bot bot commented Oct 12, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137846

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 2 Cancelled Jobs, 1 Unrelated Failure

As of commit dc8231a with merge base a777dea (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOBS - The following jobs were cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@jingxu10
Copy link
Collaborator Author

@pytorchbot label "topic: not user facing"

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Oct 12, 2024
@EikanWang EikanWang marked this pull request as draft October 15, 2024 02:40
@jingxu10 jingxu10 marked this pull request as ready for review October 15, 2024 20:53
@jingxu10 jingxu10 marked this pull request as draft October 15, 2024 20:53
@jingxu10 jingxu10 marked this pull request as ready for review October 16, 2024 21:29
Comment on lines 495 to 503
ret = run_and_read_all(
run_lambda,
'powershell.exe "gwmi -Class Win32_OperatingSystem | Select-Object -Property Caption,\
OSArchitecture,Version | ConvertTo-Json"',
)
try:
obj = json.loads(ret)
ret = f'{obj["Caption"]} ({obj["Version"]} {obj["OSArchitecture"]})'
except ValueError as e:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you please land this one as separate PR

@@ -146,7 +151,7 @@ def get_nvidia_driver_version(run_lambda):
return run_and_parse_first_match(run_lambda, smi, r'Driver Version: (.*?) ')


def get_gpu_info(run_lambda):
def get_nvidia_gpu_info(run_lambda):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please don't delete any existing functions, as this might be considered a public API change

Comment on lines 238 to 257
if mgr_name == "":
rc, _, _ = run("which dpkg")
if rc == 0:
mgr_name = "dpkg"
if mgr_name == "":
rc, _, _ = run("which dnf")
if rc == 0:
mgr_name = "dnf"
if mgr_name == "":
rc, _, _ = run("which yum")
if rc == 0:
mgr_name = "yum"
if mgr_name == "":
rc, _, _ = run("which zypper")
if rc == 0:
mgr_name = "zypper"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please Avoid code duplication, use loops

Suggested change
if mgr_name == "":
rc, _, _ = run("which dpkg")
if rc == 0:
mgr_name = "dpkg"
if mgr_name == "":
rc, _, _ = run("which dnf")
if rc == 0:
mgr_name = "dnf"
if mgr_name == "":
rc, _, _ = run("which yum")
if rc == 0:
mgr_name = "yum"
if mgr_name == "":
rc, _, _ = run("which zypper")
if rc == 0:
mgr_name = "zypper"
for mgr_name in ["dpkg", "dnf", "yum", "zypper", ""]:
if mgr_name == "":
continue
rc, _, _ = run(f"which {mgr_name}")
if rc == 0:
break

@drisspg drisspg added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 18, 2024
@jingxu10 jingxu10 force-pushed the jingxu10/collect_env_main branch 6 times, most recently from 77fb09d to dc8231a Compare October 22, 2024 21:36
@jingxu10
Copy link
Collaborator Author

The CI doesn't seem to use 2.5. Offline testing with torch.xpu on 2.5 wheel works.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/inductor module: inductor open source topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
Status: Review Required
Development

Successfully merging this pull request may close these issues.

4 participants