Profiling of different implementation of attention modules

less than 1 minute read

Placeholder for the blog logging the results of profiling vllm

HUGE Overhead when the number of requests in the queue is large

Twitter Facebook LinkedIn

Comments

Use Nsight System to Profile a Model Training with DeepSpeed on Multi-Node Cluster

11 minute read

This post is to log how I managed to profile a model training running on multiple nodes in a cluster with DeepSpeed and Nsight System. Click here to jump to ...

Profiling of different implementation of attention modules

less than 1 minute read

Placeholder for the blog logging the results of profiling different implementations of attention modules.

Build an App with OpenAI API

less than 1 minute read

Placeholder for the blog logging how I build an app with OpenAI API.

Training Custom Mixtral Model with DeepSpeed

less than 1 minute read

Placeholder for the blog logging how I trained a custom Mixtral model with DeepSpeed.

Xueshen Liu

Comments

You May Also Enjoy

Use Nsight System to Profile a Model Training with DeepSpeed on Multi-Node Cluster

Profiling of different implementation of attention modules

Build an App with OpenAI API

Training Custom Mixtral Model with DeepSpeed