Deka GPU Documentations
  • Starter Guide
    • Introduction
    • Sign Up
    • Choose a Package
    • Top Up
    • Create a Virtual Machine
    • Download kubeconfig
    • Create a Deka LLM
    • Create a Deka Notebook
    • Conclusion
  • Service Portal
    • Introduction
    • Sign Up
    • Sign In
    • Sign Out
    • Forgot Password
    • Account Setting
      • Using MFA Google Authenticator
      • Using MFA Microsoft Authenticator
    • Project
      • Add Project
      • Delete Project
    • List Roles
    • Broadcast
    • Audit Log
    • Voucher
    • Security
      • AI Security AI Infrastructure Layer
      • AI Security AI Application Layer
    • Ticket
      • Create Ticket
      • Detail Ticket
    • Billing
      • Daily Cost Estimated
      • Monthly Cost
      • Invoice
      • Summary Monthly
    • Balance
      • Project Type: SME
        • GPU Merdeka
        • Choose Package
        • Top-Up
      • Project Type: Enterprise
      • History Balance
        • Balance
        • Transaction
      • Custom Resource Definition
  • Deka GPU
    • Deka GPU: Kubernetes
      • Introduction
      • GPU Type
      • Dashboard
        • Check Status Kubernetes
        • Download Kube Config
        • Access Console
      • Workloads
        • Pods
          • Create New Pod
          • Access Console
          • Configuration Pod
          • Delete Pod
          • How to Create a New Pod use CLI
        • Deployments
          • Create New Deployment
          • Configuring Deployment
          • Delete of a Deployment
          • How to Create a New Deployment use CLI
        • DaemonSets
          • Create a New DaemonSet
          • Configuring a DaemonSet
          • Delete DaemonSet
      • Services
      • Storages
        • Storage Class
        • Persistent Volume Claims
          • Create a New Persistent Volume Claim
          • How to Create a New Persistent Volume Claim use CLI
    • Deka GPU: VMs
      • Operating System
      • GPU Type
      • Machine Type
      • Namespace Type
      • Storage Class
      • How to Create a Virtual Machine on Service Portal
      • How to Manually Create a Virtual Machine
        • Download Kube Config
        • Running Kube Config
        • Configuration file dv.yaml
        • Configuration file vm.yaml
        • Configuration file svc.yaml
      • Feature Overview of Virtual Machine
        • Detail a Virtual Machine
        • Open Console
        • Turn Off a VM Instance
        • Turn On a VM Instance
        • Restart a Virtual Machine
        • How to Access Console
        • Show YAML File
      • Delete a Virtual Machine
    • Deka GPU: Registry
      • Create Registry
      • Quota
      • Detail Registry
        • Summary
        • Repository
        • Logs
        • Labels
        • Tag Immutability
        • Member
        • Resize Storage Registry
      • Delete Registry
    • Deka GPU: Security
      • Deka Guard
        • Introduction
        • Create Guard to Deny All Ingress
        • Create Guard to Allow Ingress
        • Create Guard to Allow Ingress with port
        • Create Guard to Allow Ingress with IP/CIDR
        • Create Guard to Deny All Egress
        • Create Guard to Allow Egress
        • Create guard to Allow Egress with Port
        • Create Guard to Allow Egress with IP/CIDR
    • Deka GPU: Service
      • Ingress
        • Install Ingress nginx
        • Install Cert Manager
        • Create Cluster Issuer
        • Create Ingress with TLS
    • Deka GPU: Autoscaling
      • Basic Autoscaling
    • Deka GPU: Network
      • Deka VPC
    • Deka GPU: MLOps
      • Introduction
      • Notebook
      • Tensorboards
      • Volumes
      • Endpoints
        • Create Endpoint
        • Delete Endpoint
      • Experiments (AutoML)
        • Create Experiments (AutoML)
        • Create Experiments (AutoML) using Python SDK
        • Get Experiments Results
      • Experiments (KFP)
        • Create Experiment
      • Pipelines
      • Runs
        • Create Run
        • Delete Active Run
      • Recurring Runs
        • Create Recurring Run
        • Delete Recurring Runs
        • Home
      • Artifacts
      • Executions
      • Manage Contributors
  • Deka LLM
    • Introduction
    • Check Project Type
    • Create a New LLM
    • Detail Deka LLM
      • Overview Tab
      • Keys Tab
        • Create a New Key
        • Detail a Key
        • Edit a Key
        • Get a Secret Key
        • Delete a Key
      • Usage Tab
      • Top Up Coin
    • API Deka LLM
      • Model Management
      • Completions
      • Embedding
    • Delete Deka LLM
    • How to Create Simple Prompt with Deka LLM
      • Create Deka LLM
      • Get URL API Deka LLM
      • Get Secret Key
      • Access API Deka LLM using Postman
      • Get Model
      • Post Chat Completions
  • Deka Notebook
    • Introduction
    • Namespace Type
    • Create a New Notebook
    • Detail Deka Notebook
      • Configuration Deka Notebook
      • Start Deka Notebook Service
      • Stop Deka Notebook Service
      • Get Token
      • Login Deka Notebook
      • Logout Deka Notebook
    • Delete Deka Notebook
  • Reference
    • How to use kubeconfig on Linux
    • How to use kubeconfig on Windows
    • Kubernetes Commands for Enhancing Security
    • How to add GPU in Kubernetes
    • How to Add GPU in VM
      • Download kubeconfig
      • Install kubectl
      • Add GPU
      • Install Driver NVIDIA
    • RAPIDS
      • How to Setup RAPIDS
      • How to make Custom Image
    • How to push image with Docker
    • Deployment LLaMA 3.1 70B with VLLM on Kubernetes
      • Getting the Hugging Face API Key
      • Requesting Access to the LLaMA Model
      • Connect Kubernetes on Computer
      • Create Namespace
      • Create PersistentVolumeClaim (PVC)
      • Create Secret for Hugging Face Token
      • Create Deployment
      • Create Service
      • Verify Deployment
      • Accessing the LLaMA Service
      • Troubleshooting
    • How to Get an API Key on NGC
    • Deployment LLM with Deka GPU + NIM
    • Deployment Deepseek R1 70B with VLLM on Deka GPU's Kubernetes
      • Prerequisites
      • Create Namespace
      • Create PersistentVolumeClaim (PVC)
      • Create Deployment
      • Create Service
      • Verify Deployment
      • Accessing the Deepsek Service
      • Troubleshooting
    • How to Upload and Download on FTP Web
  • Troubleshooting
    • Reinstall Driver NVIDIA on Linux
    • NVIDIA Driver Not Detected After Upgrade Kernel
Powered by GitBook
On this page
  • Uninstall NVIDIA Driver Already Installed
  • Install New Driver NVIDIA
  • Check Installation
  • Enable and Start Service
  • Configure CUDA Toolkit
  • Install Python3
  • Testing
  1. Troubleshooting

Reinstall Driver NVIDIA on Linux

Reinstalling NVIDIA drivers on a Linux operating system is an important process to ensure optimal performance of your hardware, especially when experiencing degraded performance issues. This guide will cover the detailed steps required to reinstall NVIDIA drivers on Linux to help you maximize the performance and stability of your system.

If after following the guide below and the error still occurs, your system needs to reboot.

Uninstall NVIDIA Driver Already Installed

In the first stage, removing the installed NVIDIA driver is logging in as a super user.

sudo su

Displays a list of NVIDIA Driver packages that are already installed on the system by executing the following syntax.

apt list nvidia-driver* | grep installed

In the list there is a driver "nvidia-driver-550", delete the driver by running the following syntax.

apt remove nvidia-driver-550

Run this command to remove packages that are no longer needed by the system.

apt autoremove

After successfully deleting the NVIDIA driver and packages that are no longer needed by the system, then restart the system by running the following syntax.

reboot

Install New Driver NVIDIA

Download the CUDA keyring package by executing the following syntax.

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb

Install the CUDA keyring package that has been successfully downloaded by running the following syntax.

sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt-get update

After running the syntax to install the CUDA keyring package, the next step is to install the CUDA driver and fabric manager to the system. Run the following syntax.

apt install cuda-drivers-550 cuda-drivers-fabricmanager-550

Check Installation

Check the installation results using nvidia-smi by running the following command.

nvidia-smi

Enable and Start Service

To manage the nvidia-fabricmanager service that is installed on a Linux system, you need to run some syntax to activate the service by running the following syntax.

If you use baremetal and H100 GPU type. You can run the syntax below.

systemctl enable nvidia-fabricmanager
systemctl start nvidia-fabricmanager
systemctl status nvidia-fabricmanager

Configure CUDA Toolkit

Install CUDA Toolkit on your system by executing the following syntax.

apt-get -y install cuda-toolkit-12-4

The next stage is to configure the installed CUDA Toolkit path by creating a cuda.sh profile file.

nano /etc/profile.d/cuda.sh

Adding the following path.

export PATH=$PATH:/usr/local/cuda/bin

Update the bashrc file.

nano ~/.bashrc

add the path to the last line.

export PATH=$PATH:/usr/local/cuda/bin

Check the version of nvcc used, log out and log back in to load the variables used.

nvcc –version

Install Python3

Install Python3 and pip install torch, torchvision, torchaudio and transformers by executing the following syntax.

apt install python3 python3-pip
pip3 install torch torchvision torchaudio
pip install transformers

Testing

Log in to Python3 and run the following code to test if CUDA is available:

import torch 
torch.cuda.is_available() 
torch.cuda.current_device() 
torch.cuda.device(0)
PreviousHow to Upload and Download on FTP WebNextNVIDIA Driver Not Detected After Upgrade Kernel

Last updated 5 months ago

Page cover image