This website requires JavaScript.
Explore
Help
Sign In
songsenand
0 Followers
·
0 Following
Joined on
2024-01-17
Repositories
12
Projects
Packages
Public Activity
Starred Repositories
songsenand
pushed to
main
at
songsenand/autocommit
2026-02-23 22:52:35 +08:00
058c09698f
docs(config): 优化 commit message 生成提示内容
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-23 22:40:52 +08:00
019fa2d23d
feat(dataset): 优化拼音处理逻辑并增强代码注释
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 15:36:20 +08:00
a82279b02a
添加拼音丢弃率参数并根据该参数决定是否丢弃拼音
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 15:20:12 +08:00
398155721d
feat: 调整拼音输入数据集处理逻辑及模型结构参数
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 13:00:45 +08:00
350cab20c5
调整拼音分组及模型参数以优化性能
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 12:16:46 +08:00
5857c90be7
重构代码结构并优化注释格式
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 11:02:26 +08:00
3bb44f1d73
feat(model): 添加共享残差块并调整池化层输出维度
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 10:20:38 +08:00
96706abb93
feat(model): 修改池化层输出维度并添加线性变换层
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 09:31:07 +08:00
fc71124484
feat(suinput): 引入拼音分组配置并优化上下文采样逻辑
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-21 22:03:08 +08:00
2219f6530d
更新评估数据集样本文件
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-21 22:01:45 +08:00
51f9ddbc70
修复拼音组处理逻辑,避免未处理拼音导致的索引错误
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-21 21:56:05 +08:00
8f58917d13
调整拼音分组与采样逻辑,优化模型结构及专家路由策略
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-21 00:56:23 +08:00
917c9f4256
调整数据采样逻辑以提升模型训练效果
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-20 23:30:44 +08:00
17324ffa10
修复初始步骤损失计算逻辑
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-20 23:28:47 +08:00
4560a9ed06
移除 global_step 自增逻辑并调整至循环末尾
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-20 23:21:46 +08:00
558d7f9fc9
调整模型结构及参数以优化性能
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-16 10:27:01 +08:00
ae414bae6b
feat(trainer): 添加残差块以增强模型表达能力
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-15 23:01:58 +08:00
ab2dbc378b
修复损失权重计算逻辑,修正平方根次数以提升稳定性
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-15 21:51:39 +08:00
cd25349d90
删除旧的 MoE 模型文件
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-15 01:48:50 +08:00
0d529c0c89
调整损失权重计算并优化训练循环终止条件
First
Previous
...
4
5
6
7
8
...
Next
Last