`make install` insufficient for running llama3-8B-Instruct #484

fozziethebeat · 2024-05-22T07:38:17Z

System Info

lorax-launcher-env output:

Target: x86_64-unknown-linux-gnu
Cargo version: 1.74.0
Commit sha: 97ede5207a4eeb5a9a03dea33b0fb472b762496d

cargo version output:

cargo 1.74.0 (ecb9851af 2023-10-18)`

Model being used: meta-llama/Meta-Llama-3-70B-Instruct

GPUs: 8 A100s on Coreweave (can't get more details since I broke nvidia accidentally).

Cuda is 12.2 I believe.

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

Clone Locally
Run make install
Run lorax-launcher --model-id meta-llama/Meta-Llama-3-70B-Instruct --port 8080

Initial failure reports that module dropout_layer_norm can't be found.

From reading the docker instructions, I believe the full installation is something like:

clone locally
cd lorax
make install
cd server
make install-flash-attn
make install-flash-attn-v2
make install-vllm

However when doing this the install-vllm step ran into an issue whereby it expected torch==2.2.1 however make install actually runs pip install torch==2.2.0 which breaks the vllm step.

Expected behavior

The following steps work successfully:

Clone Locally
Run make install
Run lorax-launcher --model-id meta-llama/Meta-Llama-3-70B-Instruct --port 8080

Alternatively, 2 could be something like make install-comprehensive to include the full vllm and flash attention set of dependencies.

The text was updated successfully, but these errors were encountered:

fozziethebeat · 2024-05-22T07:39:09Z

I'll note that the docker install worked perfectly. I just happen to be testing in an environment where I can't run docker.

fozziethebeat · 2024-05-23T05:57:20Z

I successfully got everything working by installing all the low level libraries.

Further, I found that for some Loras it triggers a flow that depends on punica_kernels.sgmv_cutlass_tmp_size in all cases. That required installation via a few added steps:

At root git submodule sync, git submodule update --init
cd server/punica_modules
python setup.py install

After that my rank 256 lora successfully ran.

magdyksaleh · 2024-05-23T19:40:49Z

Need to update the docs to reflect the steps you took to get it to work. Are you blocked on anything?

fozziethebeat · 2024-05-23T23:10:25Z

Not blocked~ But updated docs (or a unified install target) would help a lot.

Now that I figured it out (a lot was from semi-copying from the dockerfile) I unblocked myself, but I imagine others who want to repeat this step will probably stumble. I'll definitely try to avoid re-doing this however.

tgaddair added the documentation Improvements or additions to documentation label May 23, 2024

magdyksaleh self-assigned this May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`make install` insufficient for running llama3-8B-Instruct #484

`make install` insufficient for running llama3-8B-Instruct #484

fozziethebeat commented May 22, 2024

fozziethebeat commented May 22, 2024

fozziethebeat commented May 23, 2024

magdyksaleh commented May 23, 2024

fozziethebeat commented May 23, 2024

make install insufficient for running llama3-8B-Instruct #484

make install insufficient for running llama3-8B-Instruct #484

Comments

fozziethebeat commented May 22, 2024

System Info

Information

Tasks

Reproduction

Expected behavior

fozziethebeat commented May 22, 2024

fozziethebeat commented May 23, 2024

magdyksaleh commented May 23, 2024

fozziethebeat commented May 23, 2024

`make install` insufficient for running llama3-8B-Instruct #484

`make install` insufficient for running llama3-8B-Instruct #484