How to use my model?

We propose a simple decision tree to to help you understand how to integrate a machine learning model in a Marcelle application.

Do you have a pre-existing model?

What do you want to do?

What framework do you use?

What do you want to do?

What language to you want to use?

Using PyTorch Models

There are three possible solutions to use a PyTorch model in a Marcelle application.

Solution 1: server-side inference with Ray Recommended for most cases

In most cases, the simplest solution consists in using a Python web framework of serving library to expose the model at a HTTP endpoint. Several possibilities exist, including Torch-specific libraries such as TorchServe or generic web frameworks such as Starlette.

We recommend using Ray Serve, a framework-agnostic model serving library for building online inference APIs. Ray enables you to expose your prediction function over an HTTP endpoint, which can be interogated from a lightweight custom Marcelle component and seamlessly integrated in a Marcelle App.

Pros:

High compatibility: it is possible to run any Python code, with any ML framework
Scalability: Ray facilitate scaling and using various architectures
Independent from the client's capabilities

Cons:

Requires setting up and managing a HTTP server
Requires sending client data to the server

How to use my model? ​

Using PyTorch Models ​

Solution 1: server-side inference with Ray Recommended for most cases ​

Solution 2: client-side inference with ONNX Recommended for small models ​

Solution 3: server-side inference through the backend Recommended for long inference ​

Using Tensorflow or Keras Models ​

Solution 1: client-side inference with Tensorflow.js ​

Solution 2: server-side inference with Ray ​

Solution 4: server-side inference through the backend Recommended for long inference ​

Using Scikit-Learn Models ​

Solution 1: server-side inference with Ray Recommended for most cases ​

Solution 2: client-side inference with ONNX Recommended for small models ​

Using HuggingFace Models ​

Solution 1: Use a model hosted on Huggingface.co Recommended ​

Solution 2: client-side inference with Transformers.js Recommended for small transformers ​

Solution 3: server-side inference with Ray ​

Solution 4: server-side inference through the backend Recommended for long inference ​

Using Machine Learning Models implemented in Python ​

Solution 1: server-side inference with Ray Recommended for most cases ​

Solution 2: server-side inference through the backend Recommended for long inference ​

Using Machine Learning Models implemented in JavaScript ​

Good for you! ​

Training Models from Marcelle data in Python ​

Training Models from Marcelle data in JavaScript ​

How to use my model?

Using PyTorch Models

Solution 1: server-side inference with Ray Recommended for most cases

Solution 2: client-side inference with ONNX Recommended for small models

Solution 3: server-side inference through the backend Recommended for long inference

Using Tensorflow or Keras Models

Solution 1: client-side inference with Tensorflow.js

Solution 2: server-side inference with Ray

Solution 4: server-side inference through the backend Recommended for long inference

Using Scikit-Learn Models

Solution 1: server-side inference with Ray Recommended for most cases

Solution 2: client-side inference with ONNX Recommended for small models

Using HuggingFace Models

Solution 1: Use a model hosted on Huggingface.co Recommended

Solution 2: client-side inference with Transformers.js Recommended for small transformers

Solution 3: server-side inference with Ray

Solution 4: server-side inference through the backend Recommended for long inference

Using Machine Learning Models implemented in Python

Solution 1: server-side inference with Ray Recommended for most cases

Solution 2: server-side inference through the backend Recommended for long inference

Using Machine Learning Models implemented in JavaScript

Good for you!

Training Models from Marcelle data in Python

Training Models from Marcelle data in JavaScript