Lesson 4: Creating a Basic Model & Serving it with Flask

Lesson Content

Introduction to Model Deployment

Model deployment is the process of making your trained machine learning model available for use. This means making it accessible to other applications or users. Without deployment, your model sits unused and doesn't provide any value. We'll be using a simple example: predicting whether a person has diabetes, based on some basic health indicators. We'll use Python, scikit-learn for the model, and Flask for the web server.

Building a Simple Model with Scikit-learn

Let's create a basic model using a popular dataset. We'll use the Pima Indians Diabetes Dataset. We will train a simple Logistic Regression model. First, you'll need to install scikit-learn: pip install scikit-learn. Then, import the necessary libraries.

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

# Load the dataset (replace with the correct path to the data file if needed)
df = pd.read_csv('diabetes.csv')

# Separate features (X) and target variable (y)
X = df.drop('Outcome', axis=1)
y = df['Outcome']

# Split data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Create and train the model
model = LogisticRegression(solver='liblinear', random_state=42) # Added solver and random_state to avoid warnings
model.fit(X_train, y_train)

# Make predictions on the test set
y_pred = model.predict(X_test)

# Evaluate the model
accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy}')

Important: Ensure you have a diabetes.csv file available in the same directory as your Python script or modify the file path in the pd.read_csv() function. Also, it's good practice to set random_state for reproducibility.

Model Serialization with Pickle

Once your model is trained, you need to save it so it can be used later. Serialization is the process of converting a Python object (like our trained model) into a byte stream that can be stored (e.g., in a file) or transmitted over a network. We'll use the pickle library for this.

import pickle

# Save the model to a file
filename = 'diabetes_model.pkl'
pickle.dump(model, open(filename, 'wb'))

# To load the model later (in your Flask app):
# loaded_model = pickle.load(open(filename, 'rb'))

Here, pickle.dump() saves the trained model to a file named 'diabetes_model.pkl'. The 'wb' argument specifies 'write binary' mode. The loaded model in the comment is what you'll use in the next step.

Creating a Flask Application

Flask is a micro web framework in Python that allows you to build web applications easily. We'll use it to create an API endpoint that receives input data and returns a prediction from our saved model. First install flask: pip install flask.

from flask import Flask, request, jsonify
import pickle
import pandas as pd

app = Flask(__name__)

# Load the saved model
model = pickle.load(open('diabetes_model.pkl', 'rb')) # Make sure the file name is the same.

@app.route('/predict', methods=['POST'])
def predict():
    try:
        # Get JSON data from the request
        data = request.get_json(force=True)

        # Convert data to DataFrame (or a list of lists if you want to support batch requests)
        input_df = pd.DataFrame([data])

        # Make prediction
        prediction = model.predict(input_df)[0]

        # Return prediction as JSON
        return jsonify({'prediction': int(prediction)})

    except Exception as e:
        return jsonify({'error': str(e)})

if __name__ == '__main__':
    app.run(debug=True) # Set debug to False for production

This code does the following:

Imports: Imports necessary modules (Flask, pickle, pandas).
Loads Model: Loads the saved diabetes_model.pkl.
Defines the API endpoint: @app.route('/predict', methods=['POST']) defines a route that accepts POST requests at /predict.
Receives Data: Inside the predict() function, it retrieves the JSON data sent in the request (request.get_json(force=True)).
Preprocesses Data: Transforms the input JSON into a Pandas DataFrame that the model can understand. This can also be a list of lists, but you have to modify the way you call predict() and deal with the data.
Makes Prediction: Uses the loaded model to predict the output based on the input data.
Returns Results: Returns the prediction as a JSON response using jsonify().
Error Handling: Includes a try...except block to catch any errors and return an error message.
Runs the App: app.run(debug=True) starts the Flask development server (debug mode for development, set to False for production).

Testing Your Flask API

After running your Flask application (usually with python your_script_name.py), you can test it by sending a POST request to the /predict endpoint. You can use tools like curl, Postman, or the requests library in Python. Here's an example using curl from your terminal:

curl -X POST -H "Content-Type: application/json" -d '{"Pregnancies":6, "Glucose":148, "BloodPressure":72, "SkinThickness":35, "Insulin":0, "BMI":33.6, "DiabetesPedigreeFunction":0.627, "Age":50}' http://127.0.0.1:5000/predict

Or, with the Python requests library:

import requests
import json

url = 'http://127.0.0.1:5000/predict'
data = {"Pregnancies":6, "Glucose":148, "BloodPressure":72, "SkinThickness":35, "Insulin":0, "BMI":33.6, "DiabetesPedigreeFunction":0.627, "Age":50}
headers = {'Content-Type': 'application/json'}

response = requests.post(url, headers=headers, data=json.dumps(data))

print(response.json())

Replace the example input data with your own and observe the output, which should be either 0 or 1, representing the prediction from your model.

Deep Dive

Explore advanced insights, examples, and bonus exercises to deepen understanding.

Day 4: Data Scientist - Model Deployment & Productionization - Extended Learning

Welcome back! You've learned the basics of deploying a machine learning model using Flask. Let's delve deeper into this crucial aspect of a Data Scientist's skillset.

Deep Dive: Beyond the Basics - Serialization & Request Handling

You've used pickle for model serialization. While it's simple, consider the following alternatives and their trade-offs:

Pickle: Easy to use, but can be a security risk if you're loading models from untrusted sources. Also, it's Python-specific, making it harder to share models across different programming languages.
Joblib: Another Python-specific library, often a better alternative to pickle for large NumPy arrays because it's more efficient. It still inherits some of the security risks of pickle.
ONNX (Open Neural Network Exchange): A more robust and platform-agnostic format. It allows you to save your model in a standard format that can be loaded and used in various environments and programming languages (like C++, Java, JavaScript, etc.). Requires converting your model to the ONNX format before deployment.
PMML (Predictive Model Markup Language): Another standard XML-based format for representing predictive models. Offers similar benefits as ONNX, but can be more verbose.

Request Handling: Think about how your Flask application handles incoming requests. Consider these points:

Data Validation: Before passing data to your model, validate it! Ensure the correct data types, ranges, and formats. Use libraries like `marshmallow` or `pydantic` for robust validation. This helps prevent errors and potential security issues.
Error Handling: Implement error handling (e.g., using `try...except` blocks) to gracefully manage unexpected inputs or model failures. Return informative error messages to the client.
Asynchronous Processing: For computationally intensive models, consider using asynchronous tasks (e.g., with `Celery` or `RQ`) to prevent blocking the Flask server and improve responsiveness.
Logging: Log all incoming requests, errors, and model predictions. This provides valuable insights into how your model is being used and helps with debugging and performance monitoring. Consider using the Python `logging` module.

Bonus Exercises

Serialization Experiment: Modify your Flask application to serialize and deserialize your model using `joblib` instead of `pickle`. Compare the performance and observe if the file size changes.
Data Validation Implementation: Add data validation to your Flask application. Create a simple endpoint that accepts a JSON payload and validates it using a library like `marshmallow` or `pydantic` before passing the data to your model. Handle invalid input gracefully.
Error Handling Practice: Implement basic error handling within your Flask application. Catch potential exceptions (e.g., when the model cannot make a prediction) and return a meaningful error message with a status code other than 200 (e.g., 500 - Internal Server Error).

Real-World Connections

Model deployment is vital across industries:

Fraud Detection: Deploy models to analyze transaction data in real-time and flag suspicious activities.
Recommendation Systems: Deploy models to suggest products, content, or services to users based on their behavior and preferences.
Healthcare: Deploy models to assist in diagnosis, predict patient outcomes, or personalize treatment plans.
Customer Service: Deploy models to power chatbots, automate responses to common inquiries, or route customers to the most appropriate support agent.

Challenge Yourself

Explore more advanced techniques:

Containerization with Docker: Package your Flask application and its dependencies into a Docker container for easier deployment and portability. This simplifies deployment to various environments.
Load Balancing: Deploy your application behind a load balancer to handle increased traffic and improve availability. Services like AWS Elastic Load Balancing or Nginx can achieve this.
Automated Testing: Write unit and integration tests for your Flask endpoints to ensure they function correctly. Frameworks like `pytest` are valuable for this.

Further Learning

ONNX documentation: https://onnx.ai/ (Learn about exporting and importing models in the ONNX format.)
Flask documentation: https://flask.palletsprojects.com/en/2.3.x/ (Explore advanced Flask features, such as request handling, blueprints, and deployment options.)
Data Validation Libraries: Explore libraries like `pydantic` and `marshmallow` for data validation and schema management.
Deployment Platforms: Research cloud platforms such as AWS (Amazon SageMaker, Elastic Beanstalk), Google Cloud Platform (Cloud ML Engine, Cloud Run), and Microsoft Azure (Azure Machine Learning) for model deployment and management.

Cookie Preferences

Regenerating Content

Creating a Basic Model & Serving it with Flask

Learning Objectives

Text-to-Speech

Lesson Content

Introduction to Model Deployment

Building a Simple Model with Scikit-learn

Model Serialization with Pickle

Creating a Flask Application

Testing Your Flask API

Deep Dive

Day 4: Data Scientist - Model Deployment & Productionization - Extended Learning

Deep Dive: Beyond the Basics - Serialization & Request Handling

Bonus Exercises

Real-World Connections

Challenge Yourself

Further Learning

Interactive Exercises

Build Your Own Diabetes Model

Adapt the Flask App

Error Handling Enhancements

Practical Application

Key Takeaways

Next Steps

Your Progress is Being Saved!

Extended Learning Content

Extended Resources

Extended Resources

Question 1: What does the `pickle.dump()` function do?

Question 2: In the Flask code, what does `@app.route('/predict', methods=['POST'])` do?

Question 3: What is the purpose of the `request.get_json()` function in the Flask code?

Question 4: Which library would you use to send a POST request to your Flask API to test it?

Question 5: What is the primary function of model deployment?

Congratulations!

Cookie Preferences

Upgrade to Premium

Premium Benefits: