You must log in or register to comment.
Those are not LLMs though.
True, they are probably not even transformers, but they are also trained with gradient descent.
Those are not LLMs though.
True, they are probably not even transformers, but they are also trained with gradient descent.