ai-research-10-optimization-awq
Optimizes large language models with activation-aware weight quantization for faster inference and minimal accuracy loss.
Install this skill
or
ai-research-10-optimization-awq3 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!