GassiGiuseppe
|
e76dbeb9a7
|
typo
|
2025-10-10 22:26:06 +02:00 |
|
GassiGiuseppe
|
96610612fe
|
Batcher added
|
2025-10-10 20:10:08 +02:00 |
|
Christian Risi
|
92ae40013d
|
Added a way to detach models and create them standalone
|
2025-10-10 18:43:20 +02:00 |
|
Christian Risi
|
15f203cad5
|
Added boe 16k tokens vocabulary
|
2025-10-10 18:43:02 +02:00 |
|
Christian Risi
|
31c8541dfb
|
Co-authored-by: GassiGiuseppe <GassiGiuseppe@users.noreply.github.com>
|
2025-10-10 16:28:09 +02:00 |
|
Christian Risi
|
bed9718f27
|
Added BPE small vocabulary
|
2025-10-10 11:40:39 +02:00 |
|
GassiGiuseppe
|
93865bee8a
|
typo
|
2025-10-09 22:26:17 +02:00 |
|
Christian Risi
|
1c0ddb8753
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-09 22:23:36 +02:00 |
|
Christian Risi
|
51399f9dc9
|
commit of toy dataset with whole batch
|
2025-10-09 22:22:42 +02:00 |
|
GassiGiuseppe
|
d1ba4ae026
|
last update for collab
( we are gonna run it on a 100 yey)
|
2025-10-09 21:57:05 +02:00 |
|
Christian Risi
|
db0090981c
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-09 21:53:45 +02:00 |
|
Christian Risi
|
e1c5649d67
|
updated to overfit over toy dataset
|
2025-10-09 21:53:42 +02:00 |
|
GassiGiuseppe
|
0bca241662
|
update environment yaml
|
2025-10-09 20:53:45 +02:00 |
|
GassiGiuseppe
|
005d7af6a0
|
lil update of requirements
|
2025-10-09 20:30:06 +02:00 |
|
GassiGiuseppe
|
9068db550e
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-09 19:44:46 +02:00 |
|
GassiGiuseppe
|
d8f81e1a47
|
that god can have mercy upon us
|
2025-10-09 19:43:50 +02:00 |
|
Christian Risi
|
a67df9724e
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-09 18:14:33 +02:00 |
|
Christian Risi
|
c5fd57d854
|
Updated train playground
|
2025-10-09 18:14:29 +02:00 |
|
GassiGiuseppe
|
ee253c39f4
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-09 13:31:37 +02:00 |
|
GassiGiuseppe
|
2036b4015f
|
added logistic collector
|
2025-10-09 13:31:16 +02:00 |
|
Christian Risi
|
aac7675b30
|
Pipeline fix and added a util to decode
|
2025-10-09 13:24:48 +02:00 |
|
GassiGiuseppe
|
d2fdeb18a2
|
bla bla doctor
|
2025-10-09 12:41:47 +02:00 |
|
Christian Risi
|
f3b83eda3d
|
Rework
|
2025-10-09 11:37:46 +02:00 |
|
Christian Risi
|
0158db2dce
|
Fixed a bug where I took encoder embeddings rather than encoder output
|
2025-10-09 11:37:21 +02:00 |
|
Christian Risi
|
ba592c3480
|
Disabled Softmax
|
2025-10-09 11:36:56 +02:00 |
|
Christian Risi
|
1f9c30b531
|
Added Custom Learning Rate
|
2025-10-09 11:36:40 +02:00 |
|
GassiGiuseppe
|
b805dc538e
|
learning sheduler as torch one
|
2025-10-08 16:05:22 +02:00 |
|
Christian Risi
|
c2e13bc9c6
|
Quick fix to architecture
|
2025-10-08 12:34:09 +02:00 |
|
GassiGiuseppe
|
14c3914571
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-08 12:14:05 +02:00 |
|
Christian Risi
|
b9273b95e2
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-08 12:13:46 +02:00 |
|
GassiGiuseppe
|
c263e2cf13
|
Custom learning rate sheduler from Attention is all you need
|
2025-10-08 12:13:02 +02:00 |
|
GassiGiuseppe
|
c9a50d50b7
|
typo in Batcher
|
2025-10-08 11:39:08 +02:00 |
|
GassiGiuseppe
|
9b0c57c238
|
Batcher ended, attention it returns list of tokenId, which later needs to be embedded
|
2025-10-08 11:26:47 +02:00 |
|
Christian Risi
|
24ea4d3ba4
|
Added a training model for NanoSocrates
|
2025-10-08 11:18:05 +02:00 |
|
GassiGiuseppe
|
e353c200d7
|
updated special token
|
2025-10-08 11:02:18 +02:00 |
|
GassiGiuseppe
|
159266a603
|
WIP Batcher added class to fourth task
|
2025-10-08 00:39:16 +02:00 |
|
GassiGiuseppe
|
7027414342
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-07 23:16:20 +02:00 |
|
GassiGiuseppe
|
fc44929a7b
|
moved spanned mask variables in init for better reliability, also tested
|
2025-10-07 23:15:50 +02:00 |
|
Christian Risi
|
0560bc439a
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-07 20:45:10 +02:00 |
|
Christian Risi
|
8adacdb08c
|
Added new playgrounds
|
2025-10-07 20:45:04 +02:00 |
|
Christian Risi
|
533347ee22
|
Added new special token
|
2025-10-07 20:44:54 +02:00 |
|
Christian Risi
|
d1ff88da82
|
Added small dataset
|
2025-10-07 20:44:40 +02:00 |
|
Christian Risi
|
3f465991f0
|
Added toy dataset
|
2025-10-07 20:44:11 +02:00 |
|
GassiGiuseppe
|
96cbf4eabb
|
wip Batcher
|
2025-10-07 20:09:51 +02:00 |
|
GassiGiuseppe
|
f801afe0e4
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-07 17:42:21 +02:00 |
|
GassiGiuseppe
|
b4ee8362a2
|
WIP training Batching
|
2025-10-07 17:41:53 +02:00 |
|
Christian Risi
|
3021a51961
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-07 16:38:12 +02:00 |
|
Christian Risi
|
99b5198c9a
|
WIP
|
2025-10-07 16:38:08 +02:00 |
|
Christian Risi
|
b97282179d
|
Fixed a bug about sequence normalizations
|
2025-10-07 16:37:43 +02:00 |
|
Christian Risi
|
fdece42462
|
Made model Batch ready
|
2025-10-07 16:37:20 +02:00 |
|