Ask Implementing Classification Models with Scikit-Learn

phamvytatbinh · Nov 25, 2023

## Thực hiện các mô hình phân loại với scikit-learn

Phân loại là nhiệm vụ gán nhãn cho điểm dữ liệu đầu vào.Trong học máy, chúng tôi sử dụng các mô hình phân loại để tìm hiểu mối quan hệ giữa dữ liệu đầu vào và nhãn đầu ra.Scikit-Learn là một thư viện học máy phổ biến trong Python cung cấp nhiều mô hình phân loại.

Trong bài viết này, chúng tôi sẽ chỉ cho bạn cách triển khai các mô hình phân loại với Scikit-Learn.Chúng tôi sẽ sử dụng [Bộ dữ liệu Iris] (https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_iris.html) để đào tạo và đánh giá các mô hình của chúng tôi.

### 1. Tải bộ dữ liệu IRIS

Bước đầu tiên là tải bộ dữ liệu IRIS.Chúng ta có thể làm điều này bằng cách sử dụng hàm `load_iris` từ mô -đun` sklearn.datasets`.

`` `Python
từ sklearn.datasets nhập load_iris

iris = load_iris ()
`` `

Bộ dữ liệu `iris` chứa 150 điểm dữ liệu, mỗi điểm đại diện cho một bông hoa từ một trong ba loài: Iris setosa, Iris Versolor hoặc Iris Virginica.Các điểm dữ liệu có bốn tính năng: chiều dài sepal, chiều rộng sepal, chiều dài cánh hoa và chiều rộng cánh hoa.

### 2. Chia dữ liệu vào tập luyện và kiểm tra

Trước khi chúng tôi có thể đào tạo một mô hình phân loại, chúng tôi cần chia dữ liệu thành các bộ đào tạo và kiểm tra.Bộ đào tạo sẽ được sử dụng để đào tạo mô hình và bộ thử nghiệm sẽ được sử dụng để đánh giá hiệu suất của mô hình.

Chúng ta có thể phân chia dữ liệu bằng hàm `troed_test_split` từ mô -đun` sklearn.model_selection`.

`` `Python
từ sklearn.model_selection nhập khẩu troed_test_split

X_TRAIN, X_TEST, Y_TRAIN, Y_TEST = Train_Test_Split (iris.data, iris.target, test_size = 0.2)
`` `

Chức năng `Train_test_split` sẽ trả về bốn mảng:` x_train`, `x_test`,` y_train` và `y_test`.`X_train` và` y_train` chứa dữ liệu đào tạo và `x_test` và` y_test` chứa dữ liệu kiểm tra.

### 3. Đào tạo mô hình phân loại

Bây giờ chúng tôi đã chia dữ liệu thành các bộ đào tạo và kiểm tra, chúng tôi có thể đào tạo một mô hình phân loại.Chúng tôi sẽ sử dụng [logisticRegression] (https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.logisticregression.html) từ mô đun `sklearn.linear_model`.

`` `Python
từ sklearn.linear_model nhập khẩu logisticregression

model = logisticRegress ()
model.fit (x_train, y_train)
`` `

Phương pháp `fit` sẽ đào tạo mô hình trên dữ liệu đào tạo.

### 4. Đánh giá mô hình phân loại

Bây giờ chúng tôi đã đào tạo một mô hình phân loại, chúng tôi cần đánh giá hiệu suất của nó.Chúng ta có thể làm điều này bằng cách sử dụng phương thức `scorce`.

`` `Python
in (model.score (x_test, y_test)))
`` `

Phương thức `scord` sẽ trả về độ chính xác của mô hình trên dữ liệu thử nghiệm.Trong trường hợp này, độ chính xác là 97%.

### 5. Dự đoán với mô hình phân loại

Chúng ta có thể sử dụng phương thức `dự đoán` để đưa ra dự đoán trên dữ liệu mới.

`` `Python
new_data = [[5.1, 3.5, 1.4, 0.2]]]]
Dự đoán = model.predict (new_data)

in (dự đoán)
`` `

Phương thức `dự đoán` sẽ trả về nhãn dự đoán cho dữ liệu mới.Trong trường hợp này, nhãn dự đoán là 0, tương ứng với iris setosa.

### Phần kết luận

Trong bài viết này, chúng tôi đã chỉ cho bạn cách triển khai các mô hình phân loại với Scikit-LEARN.Chúng tôi đã sử dụng bộ dữ liệu IRIS để đào tạo và đánh giá các mô hình của chúng tôi.Chúng tôi cũng đã học cách đưa ra dự đoán với các mô hình phân loại.

### hashtags

* #Machinelearning
* #Scikit-Learn
* #Classification
* #irisdataset
*
=======================================
## Implementing Classification Models with Scikit-Learn

Classification is the task of assigning a label to an input data point. In machine learning, we use classification models to learn the relationship between the input data and the output label. Scikit-Learn is a popular machine learning library in Python that provides a variety of classification models.

In this article, we will show you how to implement classification models with Scikit-Learn. We will use the [iris dataset](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_iris.html) to train and evaluate our models.

### 1. Loading the Iris dataset

The first step is to load the Iris dataset. We can do this using the `load_iris` function from the `sklearn.datasets` module.

```python
from sklearn.datasets import load_iris

iris = load_iris()
```

The `iris` dataset contains 150 data points, each of which represents a flower from one of three species: Iris setosa, Iris versicolor, or Iris virginica. The data points have four features: sepal length, sepal width, petal length, and petal width.

### 2. Splitting the data into training and test sets

Before we can train a classification model, we need to split the data into training and test sets. The training set will be used to train the model, and the test set will be used to evaluate the model's performance.

We can split the data using the `train_test_split` function from the `sklearn.model_selection` module.

```python
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, test_size=0.2)
```

The `train_test_split` function will return four arrays: `X_train`, `X_test`, `y_train`, and `y_test`. `X_train` and `y_train` contain the training data, and `X_test` and `y_test` contain the test data.

### 3. Training a classification model

Now that we have split the data into training and test sets, we can train a classification model. We will use the [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) class from the `sklearn.linear_model` module.

```python
from sklearn.linear_model import LogisticRegression

model = LogisticRegression()
model.fit(X_train, y_train)
```

The `fit` method will train the model on the training data.

### 4. Evaluating the classification model

Now that we have trained a classification model, we need to evaluate its performance. We can do this by using the `score` method.

```python
print(model.score(X_test, y_test))
```

The `score` method will return the accuracy of the model on the test data. In this case, the accuracy is 97%.

### 5. Making predictions with the classification model

We can use the `predict` method to make predictions on new data.

```python
new_data = [[5.1, 3.5, 1.4, 0.2]]
prediction = model.predict(new_data)

print(prediction)
```

The `predict` method will return the predicted label for the new data. In this case, the predicted label is 0, which corresponds to Iris setosa.

### Conclusion

In this article, we showed you how to implement classification models with Scikit-Learn. We used the Iris dataset to train and evaluate our models. We also learned how to make predictions with the classification models.

### Hashtags

* #Machinelearning
* #Scikit-learn
* #Classification
* #irisdataset
*

lazytiger694 · Jun 30, 2024

Các loại số liệu đánh giá khác nhau cho các mô hình phân loại là gì?

Ask Implementing Classification Models with Scikit-Learn

phamvytatbinh

New member

lazytiger694

New member