Christian Risi
|
4ca1d0a189
|
Activated Dropout to avoid overfitting
|
2025-10-12 12:28:06 +02:00 |
|
Christian Risi
|
160b7dbfc0
|
V0.0.1 Athene
|
2025-10-11 19:35:43 +02:00 |
|
GassiGiuseppe
|
443f54fffd
|
WIP decoder with prefix mask
|
2025-10-11 15:31:43 +02:00 |
|
GassiGiuseppe
|
5e3878ea17
|
Merge branch 'dev' into dev.train
|
2025-10-11 11:51:58 +02:00 |
|
Christian Risi
|
625f79f7c3
|
Fixed imports
|
2025-10-11 11:18:44 +02:00 |
|
GassiGiuseppe
|
96610612fe
|
Batcher added
|
2025-10-10 20:10:08 +02:00 |
|
Christian Risi
|
92ae40013d
|
Added a way to detach models and create them standalone
|
2025-10-10 18:43:20 +02:00 |
|
Christian Risi
|
aac7675b30
|
Pipeline fix and added a util to decode
|
2025-10-09 13:24:48 +02:00 |
|
Christian Risi
|
ba592c3480
|
Disabled Softmax
|
2025-10-09 11:36:56 +02:00 |
|
Christian Risi
|
1f9c30b531
|
Added Custom Learning Rate
|
2025-10-09 11:36:40 +02:00 |
|
Christian Risi
|
c2e13bc9c6
|
Quick fix to architecture
|
2025-10-08 12:34:09 +02:00 |
|
GassiGiuseppe
|
fc44929a7b
|
moved spanned mask variables in init for better reliability, also tested
|
2025-10-07 23:15:50 +02:00 |
|
Christian Risi
|
99b5198c9a
|
WIP
|
2025-10-07 16:38:08 +02:00 |
|
Christian Risi
|
fdece42462
|
Made model Batch ready
|
2025-10-07 16:37:20 +02:00 |
|
Christian Risi
|
109ad9f36b
|
Changed Imports
|
2025-10-07 16:36:59 +02:00 |
|
Christian Risi
|
f9545aca1d
|
Deleted MultiHeadAttention
|
2025-10-07 16:36:11 +02:00 |
|
GassiGiuseppe
|
56d438f01a
|
WIP NanoSocratesCore
|
2025-10-06 18:21:27 +02:00 |
|
GassiGiuseppe
|
e1549d4458
|
Modified decoder and decoder for sequential architecture
|
2025-10-06 18:20:46 +02:00 |
|
Christian Risi
|
e93710af08
|
Fixed illegal tokens being added in target output
|
2025-10-06 16:16:47 +02:00 |
|
Christian Risi
|
b1e7af0607
|
Merge branch 'dev.embedder' of https://repositories.communitynotfound.work/PoliBa-DeepLearning/NanoSocrates into dev.embedder
|
2025-10-06 15:55:44 +02:00 |
|
Christian Risi
|
c217f5dec9
|
Added 2 types of masking
|
2025-10-06 15:45:45 +02:00 |
|
Christian Risi
|
49f0beb6ea
|
Updated imports
|
2025-10-06 15:45:28 +02:00 |
|
GassiGiuseppe
|
948c3fd7ac
|
update to batch attention mask
|
2025-10-06 13:03:03 +02:00 |
|
GassiGiuseppe
|
7e40a36701
|
wip: NanoSocratesCore
|
2025-10-05 22:58:06 +02:00 |
|
GassiGiuseppe
|
0f243eaac2
|
added padding_mask entry to decoder and encoder
|
2025-10-05 18:46:06 +02:00 |
|
GassiGiuseppe
|
6f219f634f
|
Added attention_mask
|
2025-10-05 17:49:01 +02:00 |
|
Christian Risi
|
c60da8ba82
|
Refactoring
|
2025-10-05 15:40:29 +02:00 |
|