This website requires JavaScript.
Explore
Help
Sign In
songsenand
0 Followers
·
0 Following
Joined on
2024-01-17
Repositories
12
Projects
Packages
Public Activity
Starred Repositories
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 23:03:57 +08:00
4031a668da
feat(model): 添加 tensorboard 依赖并重构训练监控逻辑
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 14:36:46 +08:00
43c8349d51
fix(trainer): 移除固定总步数,使用实际停止批次计算 warmup 步数
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 14:30:51 +08:00
1178f87713
fix(dataset): 添加6%概率返回None以增强数据多样性
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 14:14:03 +08:00
dfcce1f1ed
feat(dataset): 调整拼音输入数据集的采样和处理逻辑以提升效果
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 01:19:35 +08:00
66c2f78dda
fix(model): 移除梯度 NaN 检查,直接执行优化器步骤
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 01:00:37 +08:00
b0a4ce9ac8
fix(model): 修正评估损失计算以避免除零错误
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 00:58:27 +08:00
dc718cde5b
fix(dataset): 添加 token_type_ids 到 collate 函数的 hint 字段
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-26 00:48:34 +08:00
7c90633ebc
refactor(model): 使用注意力池化替换 span pooling 并支持 token_type_ids
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-25 16:56:43 +08:00
93dced50c7
feat(model): 更新模型结构,使用 GELU 激活函数并优化专家网络参数
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-24 01:06:17 +08:00
db90516fcf
fix(encoder): 修复 encoder 调用时缺少 src_key_padding_mask 参数
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-24 00:53:01 +08:00
b1f78668dc
fix(model): 修正池化层输入源以确保正确计算特征向量
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-24 00:49:04 +08:00
be6b686bd1
refactor(model): 移除 encoder 的 padding_mask 参数调用
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-24 00:10:38 +08:00
63efc49aa6
feat(model): 添加训练完成通知功能,通过ServerChan发送微信消息
songsenand
pushed to
main
at
songsenand/autocommit
2026-02-23 22:52:35 +08:00
058c09698f
docs(config): 优化 commit message 生成提示内容
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-23 22:40:52 +08:00
019fa2d23d
feat(dataset): 优化拼音处理逻辑并增强代码注释
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 15:36:20 +08:00
a82279b02a
添加拼音丢弃率参数并根据该参数决定是否丢弃拼音
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 15:20:12 +08:00
398155721d
feat: 调整拼音输入数据集处理逻辑及模型结构参数
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 13:00:45 +08:00
350cab20c5
调整拼音分组及模型参数以优化性能
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 12:16:46 +08:00
5857c90be7
重构代码结构并优化注释格式
songsenand
pushed to
main
at
songsenand/SUInput
2026-02-22 11:02:26 +08:00
3bb44f1d73
feat(model): 添加共享残差块并调整池化层输出维度
First
Previous
...
4
5
6
7
8
...
Next
Last