“十五五”开局之年,我们要锚定目标任务,完整准确全面贯彻新发展理念,推动经济和社会协调发展,在发展中保障和改善民生,让民生更有温度、幸福更有质感,让发展动力更强、成色更足,书写高质量发展与民生改善相得益彰的时代新篇。
00:27, 8 марта 2026Мир
。safew对此有专业解读
Logging the memory, it seems like it starts the forward pass, memory starts increasing on GPU 0, then OOMs. I wonder if it’s trying to be smart and planning ahead and dequantizing multiple layers at a time. Dequantizing each layer uses ~36 GB of memory so if it was doing this that could cause it to use too much memory. Maybe if we put each layer on alternating GPU’s it could help.。谷歌对此有专业解读
The analysis got put on ice for a month or two due to me being very busy at work and dealing with some other higher priority stuff, but yesterday I came across the bad TCXO while cleaning up the lab and decided I had a bit of time to spend poking at it. (That’s a risk you take when you send samples to a back room FA lab, I might forget about them for a while…),详情可参考Snipaste - 截图 + 贴图