You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I want to build the library in debug mode and use the build.sh in root dir. My command is:
./build.sh libcuvs -g
But I encounter the error during the building process , and the error message is:
FAILED: CMakeFiles/cuvs-cagra-search.dir/src/neighbors/cagra_search_uint8.cu.o
/root/miniconda3/envs/cuvs/bin/nvcc -forward-unknown-to-host-compiler -DCUTLASS_NAMESPACE=raft_cutlass -DFMT_HEADER_ONLY=1 -DFMT_SHARED -DLIBCUDACXX_ENABLE_EXPERIMENTAL_MEMORY_RESOURCE -DNVTX_ENABLED -DRAFT_SYSTEM_LITTLE_ENDIAN=1 -DSPDLOG_FMT_EXTERNAL -DTHRUST_DEVICE_SYSTEM=THRUST_DEVICE_SYSTEM_CUDA -DTHRUST_DISABLE_ABI_NAMESPACE -DTHRUST_HOST_SYSTEM=THRUST_HOST_SYSTEM_CPP -DTHRUST_IGNORE_ABI_NAMESPACE_ERROR -I/root/WorkSpace/cuvs/cuvs_true/cuvs/cpp/include -I/root/miniconda3/envs/cuvs/include/rapids -I/root/miniconda3/envs/cuvs/include/rapids/libcudacxx -isystem /root/miniconda3/envs/cuvs/include -isystem /root/miniconda3/envs/cuvs/targets/x86_64-linux/include -g -std=c++17 "--generate-code=arch=compute_75,code=[sm_75]" -Xcompiler=-fPIC -Xcompiler=-Wno-deprecated-declarations -DRAFT_HIDE_DEPRECATION_WARNINGS -Xcompiler=-Wall,-Werror,-Wno-error=deprecated-declarations -Werror=all-warnings --expt-extended-lambda --expt-relaxed-constexpr -DCUDA_API_PER_THREAD_DEFAULT_STREAM -Xfatbin=-compress-all -Xcompiler=-fopenmp -G -Xcompiler=-rdynamic -Xptxas --suppress-stack-size-warning -Xcompiler -pthread -MD -MT CMakeFiles/cuvs-cagra-search.dir/src/neighbors/cagra_search_uint8.cu.o -MF CMakeFiles/cuvs-cagra-search.dir/src/neighbors/cagra_search_uint8.cu.o.d -x cu -rdc=true -c /root/WorkSpace/cuvs/cuvs_true/cuvs/cpp/src/neighbors/cagra_search_uint8.cu -o CMakeFiles/cuvs-cagra-search.dir/src/neighbors/cagra_search_uint8.cu.o
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj64ELj64ELj0ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj64ELj0ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj64ELj64ELj0ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj64ELj0ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj64ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj64ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj128ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj128ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj128ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj128ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj256ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj256ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj256ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj256ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj512ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj512ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj512ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj512ELj64ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj64ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj64ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj64ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj128ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj128ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj128ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj128ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj256ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj256ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj256ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj256ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search13search_kernelILj512ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj512ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
ptxas error : Entry function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search15search_kernel_pILj512ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPKT2_PNS3_15worker_handle_tEPNS3_10job_desc_tIS9_EEPjPKNS9_7INDEX_TEjjmSK_jPSI_jjjjSH_jjjT3_' with max regcount of 64 calls function '_ZN4cuvs9neighbors5cagra6detail17single_cta_search11search_coreILj512ELj128ELj1ENS2_25dataset_descriptor_base_tIhjfEENS0_9filtering18none_sample_filterEEEvPNT2_7INDEX_TEPNS9_10DISTANCE_TEjPKS9_PKNS9_6DATA_TEPKSA_jjmSK_jSB_jjjjPjjjjjT3_' with regcount of 255
.......etc
I think the error message indicates that certain kernels exceed the specified maximum register limit. But I don't know how to fix it.
This is My GPU info by using nvidia-mi:
Tue Nov 12 10:42:53 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03 Driver Version: 560.35.03 CUDA Version: 12.6 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA TITAN RTX Off | 00000000:81:00.0 Off | N/A |
| 41% 28C P8 13W / 280W | 16MiB / 24576MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
My cuda version is:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Thu_Jun__6_02:18:23_PDT_2024
Cuda compilation tools, release 12.5, V12.5.82
Build cuda_12.5.r12.5/compiler.34385749_0
Any help will be highly appreciated!
The text was updated successfully, but these errors were encountered:
xwt1
changed the title
building library from source: add -g flag to build.sh in root directory and get error
[BUG] building library from source: add -g flag to build.sh in root directory and get error
Nov 13, 2024
Hello! I want to build the library in debug mode and use the build.sh in root dir. My command is:
But I encounter the error during the building process , and the error message is:
I think the error message indicates that certain kernels exceed the specified maximum register limit. But I don't know how to fix it.
This is My GPU info by using nvidia-mi:
My cuda version is:
Any help will be highly appreciated!
The text was updated successfully, but these errors were encountered: