# How to Setup RAPIDS

This section of the guide will explain how to set up RAPIDS in the Deka GPU Portal Service. By using RAPIDS, operations that would normally run on the CPU can be significantly accelerated by running them on the GPU, providing improved performance in large-scale scenarios. The following are the steps for setting up RAPIDS in the Deka GPU Portal Service:

* On the MLOps menu page on the left, select the Notebooks menu.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfb_pJ6NRMS1c9lHB0_kAa5zTk5vcSwdRDolDh-BGBLCkKwjVGd975kyp801bDpzqye1JuWIKKSSFI4k2rNIwmOUqUcNyK8IcTMIzSIyctEyMMJAfpUaAJ_Xbb1qI3PgK5JeIo0ZZKK1VkwvalTarlD-C4P?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* On the Notebooks page press the +New Notebooks button.&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfVDs4MS4Nrv9j7LoB-t3UwQHEp1wL_XuEAw8FTu7pU7_FiaSfsJmVgt_Y5z6m1rKTkB9r0CqPZ6Vt_mCSNOX5lyK_ws3IZGHa8TTuJEIZT-615-3Mw8El-td02BZucCwi9Bejo45d0CdAyo4x2okBmcNHt?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* The New Notebooks page appears then fill in the name of the notebook and select Jupyterlab.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXesz47N0VHPREACK-gMr4YiE9S9lYwrnDRwjuMpWQaGVi50BzjSaXCfCp1v4nJcl4tGxP3brhtRxLcqIQKnHibkHZxLb_kiTh34KMo7qjSJa6qR0RBKTBESg343e2V4fwKXuVnw0AzPwYssO9YsUZUcG-k?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* In the Custom Notebooks section you can use two options including:

1. Image

In the Image section of existing Custom Notebooks, select "Image", and select zhydnytrat/rapids:1.06&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXdax0c0VjQFN9BNay3qe5rLHCtL_r6ckaw2F-c9LSiv3MIsm1jnmykeV-JzMPfzhGRqjwuVgqEy1UiG64PdRtvP89mZZyag7ovjJAvK4-5TtcB7-6ZrsPcggXsWVBNoMrK_o-QevU5I1eb7D1v3cGURdZ62?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Setup RAPIDS-Custom Notebooks</p></figcaption></figure>

2. Custom Image

In the Custom Image section, select "Advanced Options", check the "Custom Image" section, and enter the name of the image repository that will be used along with the tag. For example, the name of the image repository used in the following link <https://hub.docker.com/u/ndominic100> then in the custom image name section use something like this ndominic100/rapids-23.08:latest. For further explanation on creating a Custom Image, see the guide in the sub-chapter 4.13.2 How to make Custom Image.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXcJb5Gw07SrFxYrLA6-UKifHgu52MDcyiJ46QEOcn-QznsFW7cbZJvUTNh42l2LjPMvbhmHweEPLYEIqA4NyO77QRFGGq1phhrjIgviCjDm6SqPw4J-BBS-StyKFH7Qi6nzXhXQVtMOU4pJroWM1x7j3UZ1?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Setup RAPIDS-Custom Notebooks</p></figcaption></figure>

* In the CPU/RAM section, adjust it to your needs. Make sure the GPUs section in the Number of GPUs section is 1 and the GPU Vendor uses NVIDIA.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXeXb2HiHhoFBRmG_P1hk-phDGfy9P_UHweb4nBOG4iCFVMVb6kBWNA8tSfxWS-KjjweakVe3y1m0m-1EDJG6Px8Krh8EAV8tzP7oj564mfPsBKldGWodwEcWYhbLoQoyUAOuj0QpA37SDKiUoHmMD6EtmU?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* In the Workspace Volume section, determine the size that will be used, in the Access Mode section so that the volume can be used on several notebooks, select "ReadWriteMany" and in the Mount Path section, replace it with "/home/rapids/data".&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXeDo22n2zHkAniXK9HgmT-v4CfZewp2PLbQ7IMRgkdP03unFGnNwpcmENMxft3Irx_nOGXdZpr2gKxJtcLPImeUSAYqN-3HtI49HBQtjJANJXteNcZfWtuE4ytVrxfw9KoBxZdoahp76SamHkkn0ED_LHP2?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* In the Data Volumes section, make sure you have added a new volume and press the LAUNCH button to continue the RAPIDS creation process in the Deka GPU Portal Service.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXcXKFU1hlwhrO4lTL3Kh92WgE3i2a-OZRww4h09SBpD2_qg3GZLX2rewVUFI0rEAyVr3Xyf5ZFC0SCOddNDAXC-IdmnuqEdGGIHAY7ETSo18DmcfER-PNBR8lGQFRRjtBk_jF6UgdbGy9zK5Q0MxtFgb_1D?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

* Wait until the notebook creation process is complete and ready to use.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfeE6IwWqqIzmEuflCLaq5LK9_32L_GAYv1NXJ8ujIvwpyRvcJcFSILW05x25EMk-NJpWLuThQXf2ZVx-McJbxHs1hhJIQI4YwsSDXM9fDCc42-nb4hAI7fIXPQXFPhuzKhlgvVPWE6mQ1rwlPJc9fkicea?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>RAPIDS Setup</p></figcaption></figure>

After you have successfully created RAPIDS in the Cloudeka Portal Service, for the next step you can add data processing that can be used. To be able to add processing data that will be used on the Notebooks page in the Deka GPU Portal Service, press the CONNECT button to run the Notebooks that were previously created.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXezY6m_q1Yh-UObbznUfxVxDpj7HJAdnmz1z-y_6po7PQ47a2SntiifOk3428qq52L3_dN9bap_vcfqdmLPlw8HOpY1U2orPK43uUwX0FM8IZMyL11xIBvbv6KfzakKwL95Tg_i70SmiHUK9Kn28psJxTE?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Connect to this Notebooks</p></figcaption></figure>

Automatically, you will be directed to the notebooks server page, namely Jupyter, select Python 3 in the Notebooks section.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXcBmoi04PbBU7Lo74nwpczHeOu6jcoaSadXvla1moATC5HC3Le9RHXyT0_UTZFlIsc09x8GqpLSw-WiM9PE6V6GQBbeEucf4SCGgDyV9OPAtBXg_lywa3O-fl-afrpVghO-z98gwfAQaiEo03-wF_so9lab?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Connect to this Notebooks</p></figcaption></figure>

You can add a Data Frame that will be used in these notebooks.

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXcaD5eie9Y7AXXvda7unnC6igkZ26rNrQFfYyAGnrPlQNNihtMfZIpe3n1ptKtWMGNM1R_7S1-QBEmkzUVVI6JvC-eIZNsnwktHTpobtugVvKCjc2HVI7X2PFHqTnIdhSoIu13wUDpZO8rpdn6XDFbqtOC0?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Add Code</p></figcaption></figure>

&#x20;On RAPIDS there are several data processing that can be used and explained in this guide include the following:

## Data Processing - cuDF

Data Processing-cuDF, used for operations such as merging, and filtering data. Main advantage cuDF is its ability to speed up big data processing by leveraging GPUs, thereby reducing data processing time significantly. For further explanation about Data Processing-cuDF, see [this link](https://docs.rapids.ai/api/cudf/stable/user_guide/api_docs/). The following is an example of using Data Processing-cuDF:&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXeL5rOBuSrx8a81SN50A2CoKsnaPyJnnXtJfTxzLgDxWbsj_P2HI5AJBV738nC3xphR9WHMYmE56GA4XUjCldC6cvqTkAwC-UGj76scjQiOXIWdHZ9hD-WphtE465YqA9AqqccxSajTRXOG28Pw-91tGLPo?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Example Syntax for Data Processing cuDF</p></figcaption></figure>

When DataFrame is already using data processing cuDF then you can perform Pandas operations as usual. For other alternatives, you can use magic line so that there is no need to change all Pandas code to cuDF so that it will automatically switch to CPU if in cuDF there is no method defined by running the syntax below this.

```python
%load_ext cudf.pandas 
```

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfVCtS_P3QICI_iwggmM228QWspip50xSJACQDePMcKHfZRcXOLsKsb-5M3I_Ugb_6eFMBXpMD99Y_j2PkwPP5FlRUTY-uTM35bbTkCTaOkhU8N3KkWu354kDpzU6KweN5x9WSqyRNYqr6MUmJpQQPAry1h?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Example Syntax for Data Processing cuDF</p></figcaption></figure>

If you often work with large datasets and are already familiar with pandas, this command is very useful for improving your data processing performance by taking advantage of the GPU.

## Data Processing - cuGraph

Data Processing-cuGraph, to run graph analysis on a large scale using GPU. For further explanation about Data Processing-cuGraph, see [this link](https://docs.rapids.ai/api/cugraph/stable/api_docs/cugraph/). The following is an example of using Data Processing-cuGraph:&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXd4AjCO8nbEvtX6aJHAgF0k9TzxANlmXKZLU_XyqTSMd3uE0ReB9S8iL-RAUP0UXWoUGQWl7X-3zATx8TlX2JKaHzeDV8X4TLPpsF-_q8MD84pWVpgRmUs2ruoWFR15Aa04WGYHcD_Zb12e-ZV2dQhI9Do?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Example Syntax for Data Processing cuGraph</p></figcaption></figure>

## Data Processing - cuML

Data Processing-cuML,  is a GPU-based machine learning library developed by NVIDIA as part of the RAPIDS ecosystem. cuML is designed to speed up machine learning workflows by utilizing the parallelism capabilities of GPUs, thus enabling faster data processing and model training compared to CPUs. It provides various algorithms such as linear regression, clustering, PCA, and many more. cuML is compatible with the APIs of the Scikit-Learn library, so users can easily migrate existing code to take advantage of GPU acceleration.  For further explanation about Data Processing-cuML, see [this link](https://docs.rapids.ai/api/cuml/stable/api/). The following is an example of using Data Processing-cuML:&#x20;

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXdCLYeBSxdZ2uDJIjHfvqP7opfkEvPcLIguVvYH09OFWQbVnS_SXy3bgAZuvuyB_jaODQhk4B9IzmJrosnkrIxyHcGK9wWfz7kCjDHFDGIwHyld0-t2f_J9chv58SZENt-56wAC81xBZabXyclBN4292iE?key=jjlJ5kzuqYjsDgQSdh7ynA" alt=""><figcaption><p>Example Syntax for Data Processing cuDF</p></figcaption></figure>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.cloudeka.ai/reference/rapids/how-to-setup-rapids.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
