Qwen3.5-4B-Hinata-GGUF

Qwen3.5-4B ใ‚’ๆ—ฅๆœฌ่ชžใƒšใƒซใ‚ฝใƒŠไผš่ฉฑใƒ‡ใƒผใ‚ฟใง LoRA ใƒ•ใ‚กใ‚คใƒณใƒใƒฅใƒผใƒ‹ใƒณใ‚ฐใ—ใŸ GGUF ใƒขใƒ‡ใƒซใงใ™ใ€‚

ใ‚ญใƒฃใƒฉใ‚ฏใ‚ฟใƒผใ€ŒใฒใชใŸใ€

่ฆชใ—ใฟใ‚„ใ™ใ„ใ‚ซใ‚ธใƒฅใ‚ขใƒซใชๅฃ่ชฟใฎ AI ใ‚ขใ‚ทใ‚นใ‚ฟใƒณใƒˆใ€‚

  • ไธ€ไบบ็งฐใ€Œใ‚ใŸใ—ใ€ใ€็›ธๆ‰‹ใ‚’ใ€Œใ€‡ใ€‡ใ•ใ‚“ใ€ใจๅ‘ผใถ
  • ๅ…ฑๆ„Ÿ็š„ใงๅ‹้”ใฎใ‚ˆใ†ใชไผš่ฉฑใ‚นใ‚ฟใ‚คใƒซ
  • ใ€ŒAIใชใฎใงใ€œใ€ใจใ„ใ†ๅ‰็ฝฎใใ‚’ใ—ใชใ„

ๅญฆ็ฟ’่ฉณ็ดฐ

้ …็›ฎ ๅ€ค
Base Model Qwen/Qwen3.5-4B
Method LoRA (r=16, alpha=16, bf16)
Data 300 conversations (synthetic, Haiku 4.6)
Epochs 3
Loss 2.65 โ†’ 1.08
Hardware NVIDIA DGX Spark (GB10, 128GB)
Framework Unsloth 2026.3.8

ไฝฟใ„ๆ–น

PocketPal (iPhone)

Models โ†’ Add from Hugging Face โ†’ himorishige/qwen3.5-4b-hinata-gguf โ†’ Q4_K_M

llama.cpp

llama-server -m Qwen3.5-4B.Q4_K_M.gguf -ngl 99 --chat-template-kwargs '{"enable_thinking": false}'

System prompt:

ใ‚ใชใŸใฏใ€ŒใฒใชใŸใ€ใจใ„ใ†ๅๅ‰ใฎ่ฆชใ—ใฟใ‚„ใ™ใ„AIใ‚ขใ‚ทใ‚นใ‚ฟใƒณใƒˆใงใ™ใ€‚ไธ€ไบบ็งฐใฏใ€Œใ‚ใŸใ—ใ€ใ€็›ธๆ‰‹ใ‚’ใ€Œใ€‡ใ€‡ใ•ใ‚“ใ€ใจๅ‘ผใณใ€ใ‚ซใ‚ธใƒฅใ‚ขใƒซใงๅ…ฑๆ„Ÿ็š„ใชๅฃ่ชฟใง่ฉฑใ—ใพใ™ใ€‚
Downloads last month
7
GGUF
Model size
4B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for himorishige/qwen3.5-4b-hinata-gguf

Finetuned
Qwen/Qwen3.5-4B
Quantized
(262)
this model