bigcode/the-stack-dedup
Viewer • Updated • 237M • 12k • 396
How to use Vipitis/santacoder-finetuned-the-stack-glsl with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="Vipitis/santacoder-finetuned-the-stack-glsl", trust_remote_code=True) # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Vipitis/santacoder-finetuned-the-stack-glsl", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Vipitis/santacoder-finetuned-the-stack-glsl", trust_remote_code=True)How to use Vipitis/santacoder-finetuned-the-stack-glsl with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Vipitis/santacoder-finetuned-the-stack-glsl"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Vipitis/santacoder-finetuned-the-stack-glsl",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/Vipitis/santacoder-finetuned-the-stack-glsl
How to use Vipitis/santacoder-finetuned-the-stack-glsl with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "Vipitis/santacoder-finetuned-the-stack-glsl" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Vipitis/santacoder-finetuned-the-stack-glsl",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "Vipitis/santacoder-finetuned-the-stack-glsl" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Vipitis/santacoder-finetuned-the-stack-glsl",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use Vipitis/santacoder-finetuned-the-stack-glsl with Docker Model Runner:
docker model run hf.co/Vipitis/santacoder-finetuned-the-stack-glsl
Santacoder finetuned on The-Stack-dedup (GLSL subset) for 1000 steps with a batch size of 2 and full sequence length of 2048. adapted finetuning script found here
python3 train.py --model_path "bigcode/santacoder" \
--dataset_name "bigcode/the-stack-dedup" \
--subset "data/glsl" \
--data_column "content" \
--split "train" \
--seq_length 2048 \
--max_steps 1000 \
--batch_size 2 \
--gradient_accumulation_steps 4 \
--learning_rate 5e-5 \
--num_warmup_steps 100 \
--eval_freq 100 \
--save_freq 100 \
--log_freq 1 \
--output_dir "checkpoint_dir" \
--no_fp16
Main purpose of this model is to explore if finetuning models improves performance on ShaderEval, which reached 0.380 with 300 samples.
License carried over from model, and the finetuning dataset holds the same license.
Base model
bigcode/santacoder