arxiv:2405.13929
Wortega PRO
AlexWortega
AI & ML interests
Ебать обучать больше чем ты
Recent Activity
new activity about 17 hours ago
AlexWortega/SIQ-1-35B:mtp missing tensor new activity about 17 hours ago
AlexWortega/SIQ-1-35B:context window length updated a Space 1 day ago
AlexWortega/same-data-different-losses