How to use the Train/Test Split method in scikit-learn?

作者

2023年08月22日

更新时间

13.09 分钟

阅读时间

阅读量

To use the train/test split method in scikit-learn, you can follow these steps:

First, import the necessary module and load your dataset into scikit-learn.


from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris

iris = load_iris() # load the iris dataset
X = iris.data # feature matrix
y = iris.target # target vector


2. Next, split your data into training and testing sets using the `train_test_split()` function.

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

In this example, we are splitting the data into training and testing sets, with 20% of the data being used for testing, and a random seed of 42 to ensure repeatability.

3. Now you can train your machine learning model on the training set, for example a DecisionTreeClassifier:

from sklearn.tree import DecisionTreeClassifier

classifier = DecisionTreeClassifier()
classifier.fit(X_train, y_train)


4. Finally, evaluate the performance of your model on the testing set:

accuracy = classifier.score(X_test, y_test)
print(“Accuracy:”, accuracy)



By following these steps, you can use the train/test split method in scikit-learn to evaluate the performance of your machine learning models.

How to use the Train/Test Split method in scikit-learn?

相关标签

How to fix Emby metadata issues?

How to perform clustering using scikit-learn?

博客作者

GLM 是真敢删啊？！说好的 P0 安全规范呢？

如果要投票一个最弱智的ai模型一定是千问

告别手动拼接：PromptForge 如何重新定义你的 AI 工作流

Privacy Policy for TerryVoiceRead Chrome Extension

告别龟速！NAS迅雷内测体验，速度起飞，附邀请码！