NanoSocrates/Project_Model/Libs/Embedder/Classes/NanoSocratesEmbedder.py

import torch
from ..Utils import fixed_positional_encoding


# WIP FOR BATCHING
class NanoSocratesEmbedder(torch.nn.Module):

    def __init__(self, vocabulary_size: int, embedding_size: int) -> None:
        super().__init__()
        self.__embedder = torch.nn.Embedding(vocabulary_size, embedding_size)

    def forward(self, tokenized_sentence: list[list[int]]) -> torch.Tensor:

        TOKENIZED_TENSOR = torch.tensor(tokenized_sentence)

        computed_embeddings: torch.Tensor = self.__embedder(TOKENIZED_TENSOR)

        _, SENTENCE_LENGHT, EMBEDDING_SIZE = computed_embeddings.shape  # for batching

        POSITIONAL_ENCODINGS = fixed_positional_encoding(
            SENTENCE_LENGHT, EMBEDDING_SIZE
        )

        computed_embeddings = computed_embeddings + POSITIONAL_ENCODINGS  # for batching
        return computed_embeddings
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00			`import torch`
			`from ..Utils import fixed_positional_encoding`
WIP NanoSocratesEmbedder for batching 2025-10-06 21:41:45 +02:00
Added support for batches 2025-10-07 12:15:03 +02:00
WIP NanoSocratesEmbedder for batching 2025-10-06 21:41:45 +02:00			`# WIP FOR BATCHING`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00			`class NanoSocratesEmbedder(torch.nn.Module):`

Added support for batches 2025-10-07 12:15:03 +02:00			`def __init__(self, vocabulary_size: int, embedding_size: int) -> None:`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00			`super().__init__()`
Added support for batches 2025-10-07 12:15:03 +02:00			`self.__embedder = torch.nn.Embedding(vocabulary_size, embedding_size)`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00
WIP NanoSocratesEmbedder for batching 2025-10-06 21:41:45 +02:00			`def forward(self, tokenized_sentence: list[list[int]]) -> torch.Tensor:`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00
			`TOKENIZED_TENSOR = torch.tensor(tokenized_sentence)`

			`computed_embeddings: torch.Tensor = self.__embedder(TOKENIZED_TENSOR)`

Added support for batches 2025-10-07 12:15:03 +02:00			`_, SENTENCE_LENGHT, EMBEDDING_SIZE = computed_embeddings.shape # for batching`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00
			`POSITIONAL_ENCODINGS = fixed_positional_encoding(`
Added support for batches 2025-10-07 12:15:03 +02:00			`SENTENCE_LENGHT, EMBEDDING_SIZE`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00			`)`

Added support for batches 2025-10-07 12:15:03 +02:00			`computed_embeddings = computed_embeddings + POSITIONAL_ENCODINGS # for batching`
Added embedder code for "Attention is all you need" 2025-10-04 19:43:25 +02:00			`return computed_embeddings`