Christian Risi
|
ba3a718480
|
Merge branch 'dev.etl' into dev
|
2025-10-05 11:16:54 +02:00 |
|
GassiGiuseppe
|
69fba7c3e9
|
new utility to generate a csv debug file of the output of the pipeline
|
2025-10-04 21:33:09 +02:00 |
|
GassiGiuseppe
|
bbadd4c521
|
update cleaning pipeline with a new method to filter also by number of films,
also updated the signature of the pipeline
|
2025-10-04 19:00:05 +02:00 |
|
GassiGiuseppe
|
64e355e80c
|
Added regex to delete new lines and * from ObjectURI
|
2025-09-30 15:00:07 +02:00 |
|
GassiGiuseppe
|
8167c9d435
|
Added Toy Dataset entry point into the Pipeline class
Before it was forced into the sql_endpoint,
now all the pipeline can be managed in the Pipeline class
|
2025-09-29 16:03:49 +02:00 |
|
GassiGiuseppe
|
bd72ad3571
|
Added file to execute the complete cleaning pipeline
|
2025-09-29 15:21:26 +02:00 |
|