songsenand
  • Joined on 2024-01-17
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-07 08:01:26 +08:00
d14fd09f41 feat(reverse_proxy): 添加 Apache 和 Nginx 反向代理配置支持 WebSocket 和 CORS
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-06 22:53:38 +08:00
3da8ae8876 feat(docs): 添加HTTP静态文件服务与远程监控说明
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-06 12:29:33 +08:00
2f0166c8ce feat: 添加模型权重检查与推理调试工具脚本
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-06 06:13:09 +08:00
a203e67aff fix(trainer): 修正总训练步数计算逻辑以支持多轮训练
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-06 06:06:12 +08:00
6ad003133c fix(dataset): 添加异常捕获防止标签生成失败
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 23:19:21 +08:00
493bfdec1a refactor(MoELayer): 并行化前向传播以兼容 torch.compile 和 AMP
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 22:17:00 +08:00
7143896f4d fix: 启用 TensorFloat32 加速矩阵乘法并解决 UserWarning
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 22:06:49 +08:00
59bb29e4fd feat(benchmark): 添加性能基准测试脚本用于诊断模型训练瓶颈
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 19:38:41 +08:00
c0489e538c feat(docs): 添加基于JSON旁路记录法的移动端监控方案文档
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 10:38:25 +08:00
c31ec3990f fix(trainer): 添加键盘中断处理以保存训练进度
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 10:30:03 +08:00
f838ec9b22 docs: 更新 README 中代码示例和训练说明
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 08:08:31 +08:00
1e9f1e8bd6 fix(dependency): 降级 torch 版本要求至 2.10.0
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 07:54:36 +08:00
310a926c98 refactor(trainer): 优化检查点保存逻辑以支持定期覆盖保存
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 01:36:07 +08:00
3e529d805f fix(trainer): 调整模型保存频率以避免频繁写盘
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 00:59:03 +08:00
369424be28 feat(trainer): 添加GPU可用性日志并调整pinyin_ids处理逻辑
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 00:32:26 +08:00
b3055656d1 fix(trainer): 检测GPU可用性并提示用户退回CPU训练
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 00:17:26 +08:00
4823afcd1f fix(dependencies): 移除不必要的开发依赖项
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-05 00:08:45 +08:00
69349a88a6 feat(train): 添加训练脚本并重构模型输入处理逻辑
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-03 17:04:58 +08:00
1af85a36bc feat: 更新输入法模型架构设计文档并重构核心组件代码
songsenand pushed to main at songsenand/SUimeModelTraner 2026-04-03 08:19:54 +08:00
fd49058764 feat(dataset): 添加四段式文本编码方法并优化拼音处理逻辑