AssertionError: please set tensor_parallel_size to less than max local gpu count

Solution Verified - Updated -

Issue

  • AssertionError: please set tensor_parallel_size to less than max local gpu count.
  • Unable to serve any model.
  • Unable to do ilab chat.

Environment

  • RHEL AI

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content