Troubleshooting

The following is a list of problems that usually occur during the work process.

Pod Not Running

This syntax is used to detect errors.

kubectl logs <pod-name> -n vllm

Make sure the service is of type LoadBalancer and that your cluster supports the LoadBalancer service.

kubectl describe service llama-31-70b-instruct -n vllm

Ensure the nodes have sufficient resources (CPU, memory, GPU) to meet the resource requests and limits.

Last updated 6 months ago