Train XGBoost with Spark
1 | # XGB training script |
1 | # XGB training script |
Industry applications of machine learning generally require us to have the ability to deal with massive datasets. Spark provides a machine learning library named mllib
allowing us to build machine learning models efficiently and parallelly.
This post is going to start with a Spark ML modelling example based on pyspark
on Python, K-Means, and to explain some basic steps as well as the usage of Spark APIs when building an ML model on Spark.
For the complete code of the K-Means example, please refer to Sec2. Spark K-Means code summarization.
Update your browser to view this website correctly.&npsb;Update my browser now