Skip to content

BUG in GPU histogram #1003

Open
Open
@lorenzoridolfi

Description

@lorenzoridolfi

Environment info

Operating System: Fedora 26
CPU: I5
GPU: NVidia GTX 1060
C++/Python/R version:
Python 3.6.2
Cuda 9.0

Error Message:

[LightGBM] [Info] Number of positive: 17355, number of negative: 458814
[LightGBM] [Warning] Only find one worker, will switch to serial tree learner.
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 1357
[LightGBM] [Info] Number of data: 476169, number of used features: 57
[LightGBM] [Info] Using GPU Device: GeForce GTX 1060 6GB, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 12
[LightGBM] [Info] 43 dense feature groups (19.98 MB) transfered to GPU in 0.048936 secs. 9 sparse feature groups.
[LightGBM] [Info] Number of positive: 17355, number of negative: 458814
[LightGBM] [Warning] Only find one worker, will switch to serial tree learner.
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 1357
[LightGBM] [Info] Number of data: 476169, number of used features: 57
[LightGBM] [Info] Using GPU Device: GeForce GTX 1060 6GB, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 12
[LightGBM] [Info] 43 dense feature groups (19.98 MB) transfered to GPU in 0.048049 secs. 9 sparse feature groups.
[LightGBM] [Info] Number of positive: 17355, number of negative: 458814
[LightGBM] [Warning] Only find one worker, will switch to serial tree learner.
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 1357
[LightGBM] [Info] Number of data: 476169, number of used features: 57
[LightGBM] [Info] Using GPU Device: GeForce GTX 1060 6GB, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 12
[LightGBM] [Info] 43 dense feature groups (19.98 MB) transfered to GPU in 0.039569 secs. 9 sparse feature groups.
[LightGBM] [Info] Number of positive: 17355, number of negative: 458815
[LightGBM] [Warning] Only find one worker, will switch to serial tree learner.
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 1357
[LightGBM] [Info] Number of data: 476170, number of used features: 57
[LightGBM] [Info] Using GPU Device: GeForce GTX 1060 6GB, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 12
[LightGBM] [Info] 43 dense feature groups (19.98 MB) transfered to GPU in 0.035209 secs. 9 sparse feature groups.
[LightGBM] [Info] Number of positive: 17356, number of negative: 458815
[LightGBM] [Warning] Only find one worker, will switch to serial tree learner.
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 1357
[LightGBM] [Info] Number of data: 476171, number of used features: 57
[LightGBM] [Info] Using GPU Device: GeForce GTX 1060 6GB, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 12
[LightGBM] [Info] 43 dense feature groups (19.98 MB) transfered to GPU in 0.040315 secs. 9 sparse feature groups.
[LightGBM] [Fatal] Bug in GPU histogram! split 8211: 11359, smaller_leaf: 9610, larger_leaf: 9960

Traceback (most recent call last):
File "lightgbm_param.py", line 127, in
main()
File "lightgbm_param.py", line 79, in main
categorical_feature=cat_index_2)
File "/usr/local/anaconda3/lib/python3.6/site-packages/lightgbm/engine.py", line 443, in cv
cvfolds.update(fobj=fobj)
File "/usr/local/anaconda3/lib/python3.6/site-packages/lightgbm/engine.py", line 244, in handlerFunction
ret.append(getattr(booster, name)(*args, **kwargs))
File "/usr/local/anaconda3/lib/python3.6/site-packages/lightgbm/basic.py", line 1436, in update
ctypes.byref(is_finished)))
File "/usr/local/anaconda3/lib/python3.6/site-packages/lightgbm/basic.py", line 48, in _safe_call
raise LightGBMError(_LIB.LGBM_GetLastError())
lightgbm.basic.LightGBMError: b'Bug in GPU histogram! split 8211: 11359, smaller_leaf: 9610, larger_leaf: 9960\n'

Reproducible examples

	params = {
			    'boosting_type': 'gbdt',
			    'objective': 'binary',
			    'metric': 'binary_logloss',
			    'num_leaves': 31,
			    'learning_rate': 0.005,
			    'feature_fraction': 0.9,
			    'bagging_fraction': 0.8,
			    'verbose': 1,
			    'device' : 'gpu'
			}

	d_train = lgb.Dataset(all_x, label=all_y)

	cv_results = lgb.cv(params,
			                d_train,
			                num_boost_round=700,
			                categorical_feature=cat_index_2)

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions