This is a really clever way to build self-learning!
I'm not sure how we move into RL-type LLM enhancement (like what Andrej Karpathy talks about here: https://www.youtube.com/watch?v=c3b-JASoPi0&t=1521s )
But this seems like a reasonable first step.
This is a really clever way to build self-learning!
I'm not sure how we move into RL-type LLM enhancement (like what Andrej Karpathy talks about here: https://www.youtube.com/watch?v=c3b-JASoPi0&t=1521s )
But this seems like a reasonable first step.