-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
支持推理模型进行翻译 #650
Comments
#637 给ollama做了一个,你参考一下 |
grok在代码中可能是单独的一个类? |
他那个主要代码和我一样的,不过条件不一样,他限定了model,其实蒸馏模型也会输出think标签的,而且没人会用满血的推理模型翻译文章的,太贵了,而且提升有限(我感觉是单次抽取的上下文窗口太短了,天花板太低)。免费的蒸馏模型是非常合适的,我也对比了一下效果,明显比glm4-falsh(相当于chatglm4-9b)强很多,而且速度快的多。 |
groq是继承了OpenAITranslator,并没有重写do_translate方法,所以我应该修改OpenAITranslator这个父类的do_translate吧?然后我的unittest也是基于Groq做的Translator,跑出来没问题,没有think标签。 import unittest
from pdf2zh.translator import GroqTranslator
from pdf2zh import cache
class TestGroqTranslator(unittest.TestCase):
def setUp(self):
self.test_db = cache.init_test_db()
# Mock environment variables and config
self.test_env = {
"GROQ_API_KEY": "xxxxxx",
"GROQ_MODEL": "deepseek-r1-distill-qwen-32b"
}
def tearDown(self):
cache.clean_test_db(self.test_db)
def test_do_translate_success(self):
# Create translator instance
translator = GroqTranslator(
lang_in="en",
lang_out="zh",
model=None,
envs=self.test_env
)
text = """Get personalized book picks and up-to-date news about this author."""
# Test translation
result = translator.do_translate(text)
print(result)
if __name__ == "__main__":
unittest.main() |
发一个draft的PR,方便大家看到你的代码。 |
另外 |
|
在什么场景下,需要你请求的功能?
推理模型的翻译质量比原版要高不少
解决方案
主要是groq提供了免费的蒸馏过的推理模型,主要是deepseek-r1-distill-qwen-32b。
我正在开发此功能,代码如下
我添加到了OpenAITranslator的do_translate方法内,然后本地用
python pdf2zh.pdf2zh -i -d
运行的,问题是没有生效,翻译的内容还是包含了标签,我单独写了一个unittest,跑起来是没问题的,标签被删掉了。请帮我看一下,解决之后我会发pr的。
其他内容
No response
The text was updated successfully, but these errors were encountered: