songsenand
  • Joined on 2024-01-17
songsenand pushed to main at songsenand/SUInput 2026-02-26 00:58:27 +08:00
dc718cde5b fix(dataset): 添加 token_type_ids 到 collate 函数的 hint 字段
songsenand pushed to main at songsenand/SUInput 2026-02-26 00:48:34 +08:00
7c90633ebc refactor(model): 使用注意力池化替换 span pooling 并支持 token_type_ids
songsenand pushed to main at songsenand/SUInput 2026-02-25 16:56:43 +08:00
93dced50c7 feat(model): 更新模型结构,使用 GELU 激活函数并优化专家网络参数
songsenand pushed to main at songsenand/SUInput 2026-02-24 01:06:17 +08:00
db90516fcf fix(encoder): 修复 encoder 调用时缺少 src_key_padding_mask 参数
songsenand pushed to main at songsenand/SUInput 2026-02-24 00:53:01 +08:00
b1f78668dc fix(model): 修正池化层输入源以确保正确计算特征向量
songsenand pushed to main at songsenand/SUInput 2026-02-24 00:49:04 +08:00
be6b686bd1 refactor(model): 移除 encoder 的 padding_mask 参数调用
songsenand pushed to main at songsenand/SUInput 2026-02-24 00:10:38 +08:00
63efc49aa6 feat(model): 添加训练完成通知功能,通过ServerChan发送微信消息
songsenand pushed to main at songsenand/autocommit 2026-02-23 22:52:35 +08:00
058c09698f docs(config): 优化 commit message 生成提示内容
songsenand pushed to main at songsenand/SUInput 2026-02-23 22:40:52 +08:00
019fa2d23d feat(dataset): 优化拼音处理逻辑并增强代码注释
songsenand pushed to main at songsenand/SUInput 2026-02-22 15:36:20 +08:00
a82279b02a 添加拼音丢弃率参数并根据该参数决定是否丢弃拼音
songsenand pushed to main at songsenand/SUInput 2026-02-22 15:20:12 +08:00
398155721d feat: 调整拼音输入数据集处理逻辑及模型结构参数
songsenand pushed to main at songsenand/SUInput 2026-02-22 13:00:45 +08:00
350cab20c5 调整拼音分组及模型参数以优化性能
songsenand pushed to main at songsenand/SUInput 2026-02-22 12:16:46 +08:00
5857c90be7 重构代码结构并优化注释格式
songsenand pushed to main at songsenand/SUInput 2026-02-22 11:02:26 +08:00
3bb44f1d73 feat(model): 添加共享残差块并调整池化层输出维度
songsenand pushed to main at songsenand/SUInput 2026-02-22 10:20:38 +08:00
96706abb93 feat(model): 修改池化层输出维度并添加线性变换层
songsenand pushed to main at songsenand/SUInput 2026-02-22 09:31:07 +08:00
fc71124484 feat(suinput): 引入拼音分组配置并优化上下文采样逻辑
songsenand pushed to main at songsenand/SUInput 2026-02-21 22:03:08 +08:00
2219f6530d 更新评估数据集样本文件
songsenand pushed to main at songsenand/SUInput 2026-02-21 22:01:45 +08:00
51f9ddbc70 修复拼音组处理逻辑,避免未处理拼音导致的索引错误
songsenand pushed to main at songsenand/SUInput 2026-02-21 21:56:05 +08:00
8f58917d13 调整拼音分组与采样逻辑,优化模型结构及专家路由策略
songsenand pushed to main at songsenand/SUInput 2026-02-21 00:56:23 +08:00
917c9f4256 调整数据采样逻辑以提升模型训练效果