Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add transfer delete shap #1

Open
wants to merge 158 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
158 commits
Select commit Hold shift + click to select a range
0202e04
Add base margin to sklearn interface. (#5151)
trivialfis Dec 24, 2019
73b1bd2
Update demo for ranking. (#5154)
trivialfis Dec 24, 2019
298ebe6
[Breaking] Remove `learning_rates` in Python. (#5155)
trivialfis Dec 24, 2019
ced3660
Tests for empty dmatrix. (#5159)
trivialfis Dec 26, 2019
f3d7877
Parameter validation (#5157)
trivialfis Dec 26, 2019
9b0af6e
Enable OpenMP with Apple Clang (Mac default compiler) (#5146)
hcho3 Dec 26, 2019
ee81ba8
implementation of map ranking algorithm on gpu (#5129)
sriramch Dec 26, 2019
61286c6
Fix wrapping GPU ID and prevent data copying. (#5160)
trivialfis Dec 27, 2019
6848d04
Clean up Python 2 compatibility code. (#5161)
trivialfis Dec 27, 2019
d55489a
Don't use modernize-use-trailing-return-type. (#5169)
trivialfis Dec 29, 2019
139ccc9
Fix num_roots to be 1. (#5165)
trivialfis Dec 29, 2019
018df60
Fix feature_name crated from int64index dataframe. (#5081)
Hi-king Dec 30, 2019
04db125
Quick fix for memory leak in CPU Hist. (#5153)
trivialfis Dec 31, 2019
7b17e76
Optimized EvaluateSplut function (#5138)
SmirnovEgorRu Dec 31, 2019
2d95b9a
Fix simple typo: utilty -> utility (#5182)
timgates42 Jan 4, 2020
2b9a62a
Throw error when not compiled with NCCL. (#5170)
trivialfis Jan 7, 2020
ebc86a3
Disable parameter validation for Scikit-Learn interface. (#5167)
trivialfis Jan 7, 2020
77cfbff
Fix span constructor. (#5166)
trivialfis Jan 7, 2020
ee28780
Lazy initialization of device vector. (#5173)
trivialfis Jan 7, 2020
9049c7c
Add new lines for Spark XGBoost missing values section (#5180)
cpfarrell Jan 7, 2020
9559f81
Move SimpleDMatrix constructor to .cc file (#5188)
RAMitchell Jan 9, 2020
87ebfc1
Implement cudf construction with adapters. (#5189)
RAMitchell Jan 9, 2020
8cbcc53
Remove old cudf constructor code (#5194)
RAMitchell Jan 10, 2020
7b65698
Enforce correct data shape. (#5191)
trivialfis Jan 13, 2020
f100b8d
[Breaking] Don't drop trees during DART prediction by default (#5115)
Kodiologist Jan 13, 2020
a73e25e
Implement slice via adapters (#5198)
RAMitchell Jan 13, 2020
0184f2e
Explicitly use UTF-8 codepage when using MSVC (#5197)
hcho3 Jan 14, 2020
808f610
Update R doc by roxygen2. (#5201)
trivialfis Jan 15, 2020
5199b86
Fix R dart prediction. (#5204)
trivialfis Jan 16, 2020
e526871
Add Optuna badge to README.md (#5208)
Crissman Jan 16, 2020
ff1342b
Fix compilation error (#5215)
mattn Jan 18, 2020
2a071ce
Add CMake option to run Undefined Behavior Sanitizer (UBSan) (#5211)
hcho3 Jan 20, 2020
9c56480
Support dmatrix construction from cupy array (#5206)
RAMitchell Jan 22, 2020
5d4c24a
Fix cupy without cudf import (#5219)
RAMitchell Jan 22, 2020
1891cc7
Fix metainfo from DataFrame. (#5216)
trivialfis Jan 22, 2020
61ece2d
working printv C++ command from python
lav-patel Jan 22, 2020
f752e36
working requirements
lav-patel Jan 22, 2020
48ddb48
converted choice to string to int
lav-patel Jan 22, 2020
d82b45c
choice converted to template
lav-patel Jan 23, 2020
8518701
working weighted probabilites
lav-patel Jan 23, 2020
1000f49
just note
lav-patel Feb 7, 2020
f6ff74d
oops
lav-patel Feb 10, 2020
0416633
testing script
lav-patel Feb 10, 2020
86deebb
testing script
lav-patel Feb 10, 2020
68212dd
feb 16
lav-patel Feb 17, 2020
14562d3
formating
lav-patel Feb 17, 2020
b5fb9a8
making sure righ xgboost version
lav-patel Feb 17, 2020
c1eb346
with sample_weight
lav-patel Feb 17, 2020
423d3c9
first finding
lav-patel Feb 17, 2020
b2229c9
added venv
lav-patel Feb 17, 2020
b9acd96
working weights
lav-patel Feb 17, 2020
4a7590e
move files to one level above
lav-patel Feb 17, 2020
c7dcd18
making sure ownership is lpatel
lav-patel Feb 17, 2020
e5faa0c
ran the results for weight2
lav-patel Feb 20, 2020
6109249
requirements.txt
Feb 24, 2020
380bd35
updated to run on server
Feb 24, 2020
6c21854
weight1 full grid search
Mar 3, 2020
571e16d
working baysian
Mar 4, 2020
0909b6a
after saving first run
Mar 5, 2020
bd85cd3
w1 baysian with song 's parameter
Mar 5, 2020
c895605
jupytext and baysian grid search
Mar 5, 2020
879ff17
w2
Mar 5, 2020
3722af7
w3
Mar 5, 2020
ea1d5b5
fixed eta with w1
Mar 6, 2020
156b971
re w1 oops
Mar 6, 2020
97726e1
running parallel
Mar 6, 2020
e19d51e
get best models' feature importance
Mar 9, 2020
5aa674b
lpatel to username
Apr 7, 2020
13ff0a6
uncommitted changes from the last time
Jul 8, 2020
16df369
featture importance for each iteration
lav-patel Jul 20, 2020
3ec7dff
gain feature importance every iteration
Aug 4, 2020
6380be6
feature importance by gain every iteration
Aug 4, 2020
5f92a26
previous changes
Aug 4, 2020
d5584a6
adde bash script
Aug 5, 2020
6732116
fixed typo
Aug 5, 2020
6486666
colsample_bytree_weight pass between python and c as int
Aug 5, 2020
493d0e2
able to pass int vector all the way to random function
Aug 6, 2020
31aba1b
able to pass multification/devidie int
Aug 7, 2020
4338dd2
convert float tuple to int
Aug 8, 2020
6ba6940
auto formated
Aug 8, 2020
0cf7f05
convert_vec_to_float compilable
Aug 8, 2020
6d514d5
static weight(int) works from python
Aug 8, 2020
ad88ed3
model iteration
Aug 9, 2020
2d5403d
working iteration and logloss
Aug 10, 2020
b9e790a
code clean up
Aug 10, 2020
0859f8c
formatted code
Aug 13, 2020
ff30cd6
refactoring code
Aug 14, 2020
594b694
ignore ipython notebook
Aug 14, 2020
24bdabc
python api
Aug 14, 2020
3a0e13c
find_view_weights
Aug 15, 2020
317ab4e
unused_views_weight
Aug 16, 2020
36d6bf1
new_view_weight_normalized
Aug 17, 2020
d13c8d4
almost done with algo
Aug 17, 2020
38e5fe2
autopep8
Aug 17, 2020
0fcf6a8
run all weights and see filenames
Aug 17, 2020
8c6df6b
ignore nohup.out
Aug 17, 2020
97c9984
working auc code
Aug 23, 2020
d7d74f1
organized function
Aug 23, 2020
76f9bba
running for all weight with 1000 iteration
Aug 23, 2020
469d38d
can run w0 to w4
Aug 24, 2020
f55aa20
w5 working
Aug 25, 2020
c726cd1
500 iteration for all w
Aug 25, 2020
1246984
100 iteration
Aug 27, 2020
6ae4c20
commented out repated function
Aug 31, 2020
ee8699c
working from saving file
Aug 31, 2020
e94e496
running on aug30
Aug 31, 2020
2242546
running small scale
Sep 15, 2020
7eb29cd
does not work
Sep 16, 2020
13031ff
fixed all weigt not totalling to 1
Sep 16, 2020
058b872
formating
Sep 16, 2020
ac01dfe
running w1 with 1000000 factor
Sep 16, 2020
bd5d621
running for entiere data set
Sep 16, 2020
3145ee0
timestamp
Sep 16, 2020
8b4a4dd
It stops building tree if it does not see improvement in the last 20 …
Sep 17, 2020
3ce0537
w1 and w2 only model_iteration = 500, early_stopping_n = 20
Sep 17, 2020
9222a1f
printing cv output filename
Sep 18, 2020
59824e2
print process
Sep 19, 2020
ed489a5
added early stopping at 0.9998 of max of previous 20
Sep 22, 2020
d437aa5
smart conversion from float to int
Sep 23, 2020
c1f1477
100 model iteration instead of 500 as it is taking lont time
Sep 24, 2020
6c69399
profiling comment
Sep 24, 2020
ea735c5
effecient choice function
Sep 30, 2020
d0c53e5
commented out method of doing profiling
Sep 30, 2020
e906bb3
added doc string
Sep 30, 2020
cb2d363
installed c++ boost lib
Oct 5, 2020
67d3c25
faster implementation of choice
Oct 5, 2020
e01045a
choice_n
Oct 5, 2020
d38f46b
working choice_n function in file
Oct 6, 2020
0dc70b3
compilation is almost done
Oct 6, 2020
4be19ed
commented out unneeded code
Oct 6, 2020
35e5069
remove unneded code
Oct 6, 2020
161faba
renamed the othe same name of the function
Oct 6, 2020
c5595a0
included choice code in class iteself
Oct 6, 2020
44c88dd
clean up cache privious lib output
Oct 7, 2020
d4611db
debugging statments
Oct 7, 2020
1aae432
made shell script logging
Oct 11, 2020
6f1db06
more logging
Oct 11, 2020
347b93b
removed print statments
Oct 12, 2020
39c7448
model_iteration and early_stopping_at at prod level
Oct 12, 2020
cf5bd09
removed print stmnt
Oct 12, 2020
f87412f
working with old data set
Oct 29, 2020
b83b628
anaconda management
Nov 3, 2020
e93e536
trying to read r data
Nov 10, 2020
67ebdf2
non transpose code
Nov 10, 2020
501e39a
notes from meeting
Nov 10, 2020
2071286
last uncommited changes
Jan 25, 2021
f2aac6d
Checked.
yuanborong Mar 16, 2021
df237f9
Add new random.h .
yuanborong Mar 16, 2021
5a646b7
test
yuanborong Jun 5, 2021
dc39c06
add new function (calculateGradientByLogisticClassification(pred, y_t…
yuanborong Jun 6, 2021
ba8a72e
changed xgb.train function
yuanborong Jun 6, 2021
172be58
change some bug
yuanborong Jun 6, 2021
b6fe770
change some bug
yuanborong Jun 6, 2021
97042d7
fix bug
yuanborong Jun 6, 2021
20ebc87
change some bug
yuanborong Jun 6, 2021
026da4d
change some bug
yuanborong Jun 6, 2021
3cc28b5
test
yuanborong Jun 6, 2021
9c4813c
test
yuanborong Jun 6, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .clang-tidy
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
Checks: 'modernize-*,-modernize-make-*,-modernize-use-auto,-modernize-raw-string-literal,-modernize-avoid-c-arrays,google-*,-google-default-arguments,-clang-diagnostic-#pragma-messages,readability-identifier-naming'
Checks: 'modernize-*,-modernize-make-*,-modernize-use-auto,-modernize-raw-string-literal,-modernize-avoid-c-arrays,-modernize-use-trailing-return-type,google-*,-google-default-arguments,-clang-diagnostic-#pragma-messages,readability-identifier-naming'
CheckOptions:
- { key: readability-identifier-naming.ClassCase, value: CamelCase }
- { key: readability-identifier-naming.StructCase, value: CamelCase }
Expand Down
6 changes: 6 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -101,3 +101,9 @@ R-package/src/Makevars
.idea
*.iml
/cmake-build-debug/

# added by Lav
venv
.ipynb_checkpoints
*.ipynb
nohup*.out
111 changes: 111 additions & 0 deletions .ipynb_checkpoints/z_xgboost_aki_tesing-checkpoint.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,111 @@
# ---
# jupyter:
# jupytext:
# formats: ipynb,py:light
# text_representation:
# extension: .py
# format_name: light
# format_version: '1.5'
# jupytext_version: 1.3.4
# kernelspec:
# display_name: Python 3
# language: python
# name: python3
# ---

# +
from sklearn.datasets import load_boston
import xgboost as xgb
from sklearn.metrics import mean_squared_error
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
import os
from sklearn.metrics import roc_auc_score, roc_curve
from matplotlib import pyplot
from sklearn.metrics import auc
import seaborn as sns
from sklearn import metrics
import datetime
from sklearn.model_selection import GridSearchCV

print("xgb.__version__ : ",xgb.__version__)
data_dir= '/home/lpatel/projects/AKI/data_592v'
#data_dir= '~/projects/AKI/test'
#data_dir='/home/lpatel/projects/AKI/data'
train_csv = os.path.join(data_dir,'train_csv.csv')
test_csv = os.path.join(data_dir,'test_csv.csv')
weight_csv = os.path.join(data_dir,'weight_csv.csv')

train = pd.read_csv(train_csv)
test = pd.read_csv(test_csv)
weight = pd.read_csv(weight_csv)
#column names are formted inconsitantly
weight['col_fmt'] = weight.col.str.replace('-','.').str.replace(':','.')


cols = train.columns.tolist()
X_col = cols[1:-1]
y_col = cols[-1]

X_train,y_train = train[X_col],train[y_col]
X_test, y_test = test[X_col] ,test[y_col]

print(set(X_col) -set(weight.col_fmt.tolist()) )
print(set(weight.col_fmt.tolist()) - set(X_col) )

weight1_lst = weight.set_index(keys=['col_fmt']).reindex(X_train.columns.tolist()).weight1.tolist()
weight2_lst = weight.set_index(keys=['col_fmt']).reindex(X_train.columns.tolist()).weight2.tolist()
weight3_lst = weight.set_index(keys=['col_fmt']).reindex(X_train.columns.tolist()).weight3.tolist()
weight4_lst = weight.set_index(keys=['col_fmt']).reindex(X_train.columns.tolist()).weight4.tolist()
weight5_lst = weight.set_index(keys=['col_fmt']).reindex(X_train.columns.tolist()).weight5.tolist()



# +
def algorithm_pipeline(X_train_data, X_test_data, y_train_data, y_test_data,
model, param_grid, cv=10, scoring_fit = 'roc_auc',
do_probabilities = True):

gs = GridSearchCV(
estimator=model,
param_grid=param_grid,
cv=cv,
n_jobs=4,
scoring=scoring_fit,
verbose=2
)
fitted_model = gs.fit(X_train_data, y_train_data)

if do_probabilities:
pred = fitted_model.predict_proba(X_test_data)
else:
pred = fitted_model.predict(X_test_data)

return fitted_model, pred

model = xgb.XGBClassifier(
objective='binary:logistic',
n_jobs = 6
)
param_grid = {
'max_depth': [3, 6, 9],
'n_estimators': [500, 1000, 1500],
'colsample_bytree': [0.05,0.5,0.75],
'subsample': [0.5, 0.75, 0.9],
'objective': ['binary:logistic'],

}


model, pred = algorithm_pipeline(X_train, X_test, y_train, y_test, model,
param_grid, cv=5)

data = pd.DataFrame(model.cv_results_)
# pd.options.display.max_columns = None
# pd.options.display.max_rows = None
print(data)
t = datetime.datetime.now().strftime('%Y-%m-%d--%H-%M-%S')
data.to_csv("~/results_parm_cv.csv_weight1_lst" + t)
print ("done")

4 changes: 2 additions & 2 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,11 +21,11 @@ env:
addons:
homebrew:
packages:
- gcc@9
- cmake
- libomp
- graphviz
- openssl
- libgit2
- cmake
- wget
- r
update: true
Expand Down
22 changes: 10 additions & 12 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
cmake_minimum_required(VERSION 3.3)
cmake_minimum_required(VERSION 3.12)
project(xgboost LANGUAGES CXX C VERSION 1.0.0)
include(cmake/Utils.cmake)
list(APPEND CMAKE_MODULE_PATH "${xgboost_SOURCE_DIR}/cmake/modules")
Expand All @@ -9,9 +9,6 @@ if ((${CMAKE_VERSION} VERSION_GREATER 3.13) OR (${CMAKE_VERSION} VERSION_EQUAL 3
endif ((${CMAKE_VERSION} VERSION_GREATER 3.13) OR (${CMAKE_VERSION} VERSION_EQUAL 3.13))

message(STATUS "CMake version ${CMAKE_VERSION}")
if (MSVC)
cmake_minimum_required(VERSION 3.11)
endif (MSVC)

if (CMAKE_COMPILER_IS_GNUCC AND CMAKE_CXX_COMPILER_VERSION VERSION_LESS 5.0)
message(FATAL_ERROR "GCC version must be at least 5.0!")
Expand Down Expand Up @@ -80,14 +77,11 @@ endif (USE_AVX)

#-- Sanitizer
if (USE_SANITIZER)
# Older CMake versions have had troubles with Sanitizer
cmake_minimum_required(VERSION 3.12)
include(cmake/Sanitizer.cmake)
enable_sanitizers("${ENABLED_SANITIZERS}")
endif (USE_SANITIZER)

if (USE_CUDA)
cmake_minimum_required(VERSION 3.12)
SET(USE_OPENMP ON CACHE BOOL "CUDA requires OpenMP" FORCE)
# `export CXX=' is ignored by CMake CUDA.
set(CMAKE_CUDA_HOST_COMPILER ${CMAKE_CXX_COMPILER})
Expand All @@ -99,6 +93,15 @@ if (USE_CUDA)
message(STATUS "CUDA GEN_CODE: ${GEN_CODE}")
endif (USE_CUDA)

if (USE_OPENMP)
if (APPLE)
# Require CMake 3.16+ on Mac OSX, as previous versions of CMake had trouble locating
# OpenMP on Mac. See https://github.com/dmlc/xgboost/pull/5146#issuecomment-568312706
cmake_minimum_required(VERSION 3.16)
endif (APPLE)
find_package(OpenMP REQUIRED)
endif (USE_OPENMP)

# dmlc-core
msvc_use_static_runtime()
add_subdirectory(${xgboost_SOURCE_DIR}/dmlc-core)
Expand Down Expand Up @@ -146,11 +149,6 @@ endif (JVM_BINDINGS)

#-- CLI for xgboost
add_executable(runxgboost ${xgboost_SOURCE_DIR}/src/cli_main.cc ${XGBOOST_OBJ_SOURCES})
# For cli_main.cc only
if (USE_OPENMP)
find_package(OpenMP REQUIRED)
target_compile_options(runxgboost PRIVATE ${OpenMP_CXX_FLAGS})
endif (USE_OPENMP)

target_include_directories(runxgboost
PRIVATE
Expand Down
4 changes: 2 additions & 2 deletions Jenkinsfile
Original file line number Diff line number Diff line change
Expand Up @@ -180,10 +180,10 @@ def BuildCPU() {
${dockerRun} ${container_type} ${docker_binary} build/testxgboost
"""
// Sanitizer test
def docker_extra_params = "CI_DOCKER_EXTRA_PARAMS_INIT='-e ASAN_SYMBOLIZER_PATH=/usr/bin/llvm-symbolizer -e ASAN_OPTIONS=symbolize=1 --cap-add SYS_PTRACE'"
def docker_extra_params = "CI_DOCKER_EXTRA_PARAMS_INIT='-e ASAN_SYMBOLIZER_PATH=/usr/bin/llvm-symbolizer -e ASAN_OPTIONS=symbolize=1 -e UBSAN_OPTIONS=print_stacktrace=1:log_path=ubsan_error.log --cap-add SYS_PTRACE'"
def docker_args = "--build-arg CMAKE_VERSION=3.12"
sh """
${dockerRun} ${container_type} ${docker_binary} ${docker_args} tests/ci_build/build_via_cmake.sh -DUSE_SANITIZER=ON -DENABLED_SANITIZERS="address" \
${dockerRun} ${container_type} ${docker_binary} ${docker_args} tests/ci_build/build_via_cmake.sh -DUSE_SANITIZER=ON -DENABLED_SANITIZERS="address;leak;undefined" \
-DCMAKE_BUILD_TYPE=Debug -DSANITIZER_PATH=/usr/lib/x86_64-linux-gnu/
${docker_extra_params} ${dockerRun} ${container_type} ${docker_binary} build/testxgboost
"""
Expand Down
2 changes: 1 addition & 1 deletion R-package/DESCRIPTION
Original file line number Diff line number Diff line change
Expand Up @@ -63,5 +63,5 @@ Imports:
data.table (>= 1.9.6),
magrittr (>= 1.5),
stringi (>= 0.5.2)
RoxygenNote: 6.1.0
RoxygenNote: 7.0.2
SystemRequirements: GNU make, C++11
2 changes: 1 addition & 1 deletion R-package/R/utils.R
Original file line number Diff line number Diff line change
Expand Up @@ -145,7 +145,7 @@ xgb.iter.update <- function(booster_handle, dtrain, iter, obj = NULL) {
if (is.null(obj)) {
.Call(XGBoosterUpdateOneIter_R, booster_handle, as.integer(iter), dtrain)
} else {
pred <- predict(booster_handle, dtrain)
pred <- predict(booster_handle, dtrain, training = TRUE)
gpair <- obj(pred, dtrain)
.Call(XGBoosterBoostOneIter_R, booster_handle, dtrain, gpair$grad, gpair$hess)
}
Expand Down
5 changes: 3 additions & 2 deletions R-package/R/xgb.Booster.R
Original file line number Diff line number Diff line change
Expand Up @@ -288,7 +288,7 @@ xgb.Booster.complete <- function(object, saveraw = TRUE) {
#' @export
predict.xgb.Booster <- function(object, newdata, missing = NA, outputmargin = FALSE, ntreelimit = NULL,
predleaf = FALSE, predcontrib = FALSE, approxcontrib = FALSE, predinteraction = FALSE,
reshape = FALSE, ...) {
reshape = FALSE, training = FALSE, ...) {

object <- xgb.Booster.complete(object, saveraw = FALSE)
if (!inherits(newdata, "xgb.DMatrix"))
Expand All @@ -307,7 +307,8 @@ predict.xgb.Booster <- function(object, newdata, missing = NA, outputmargin = FA
option <- 0L + 1L * as.logical(outputmargin) + 2L * as.logical(predleaf) + 4L * as.logical(predcontrib) +
8L * as.logical(approxcontrib) + 16L * as.logical(predinteraction)

ret <- .Call(XGBoosterPredict_R, object$handle, newdata, option[1], as.integer(ntreelimit))
ret <- .Call(XGBoosterPredict_R, object$handle, newdata, option[1],
as.integer(ntreelimit), as.integer(training))

n_ret <- length(ret)
n_row <- nrow(newdata)
Expand Down
38 changes: 19 additions & 19 deletions R-package/R/xgboost.R
Original file line number Diff line number Diff line change
Expand Up @@ -5,8 +5,8 @@
#' @export
xgboost <- function(data = NULL, label = NULL, missing = NA, weight = NULL,
params = list(), nrounds,
verbose = 1, print_every_n = 1L,
early_stopping_rounds = NULL, maximize = NULL,
verbose = 1, print_every_n = 1L,
early_stopping_rounds = NULL, maximize = NULL,
save_period = NULL, save_name = "xgboost.model",
xgb_model = NULL, callbacks = list(), ...) {

Expand All @@ -18,60 +18,60 @@ xgboost <- function(data = NULL, label = NULL, missing = NA, weight = NULL,
early_stopping_rounds = early_stopping_rounds, maximize = maximize,
save_period = save_period, save_name = save_name,
xgb_model = xgb_model, callbacks = callbacks, ...)
return(bst)
return (bst)
}

#' Training part from Mushroom Data Set
#'
#'
#' This data set is originally from the Mushroom data set,
#' UCI Machine Learning Repository.
#'
#'
#' This data set includes the following fields:
#'
#'
#' \itemize{
#' \item \code{label} the label for each record
#' \item \code{data} a sparse Matrix of \code{dgCMatrix} class, with 126 columns.
#' }
#'
#' @references
#' https://archive.ics.uci.edu/ml/datasets/Mushroom
#'
#' Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository
#' [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
#'
#' Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository
#' [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
#' School of Information and Computer Science.
#'
#'
#' @docType data
#' @keywords datasets
#' @name agaricus.train
#' @usage data(agaricus.train)
#' @format A list containing a label vector, and a dgCMatrix object with 6513
#' @format A list containing a label vector, and a dgCMatrix object with 6513
#' rows and 127 variables
NULL

#' Test part from Mushroom Data Set
#'
#' This data set is originally from the Mushroom data set,
#' UCI Machine Learning Repository.
#'
#'
#' This data set includes the following fields:
#'
#'
#' \itemize{
#' \item \code{label} the label for each record
#' \item \code{data} a sparse Matrix of \code{dgCMatrix} class, with 126 columns.
#' }
#'
#' @references
#' https://archive.ics.uci.edu/ml/datasets/Mushroom
#'
#' Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository
#' [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
#'
#' Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository
#' [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
#' School of Information and Computer Science.
#'
#'
#' @docType data
#' @keywords datasets
#' @name agaricus.test
#' @usage data(agaricus.test)
#' @format A list containing a label vector, and a dgCMatrix object with 1611
#' @format A list containing a label vector, and a dgCMatrix object with 1611
#' rows and 126 variables
NULL

Expand Down Expand Up @@ -107,7 +107,7 @@ NULL
#' @importFrom graphics par
#' @importFrom graphics title
#' @importFrom grDevices rgb
#'
#'
#' @import methods
#' @useDynLib xgboost, .registration = TRUE
NULL
8 changes: 6 additions & 2 deletions R-package/man/cb.early.stop.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

18 changes: 14 additions & 4 deletions R-package/man/predict.xgb.Booster.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion R-package/man/xgb.create.features.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading