HNNewShowAskJobs
Built with Tanstack Start
Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models(github.com)
8 points by NetRunnerSu 5 hours ago | 2 comments
  • Ifkaluva43 minutes ago

    It’s an interesting idea, I have two questions.

    - Surprise is detected by the norm of the gradients. So, doesn’t this suggest that the model already has a way of adjusting to surprise?

    - Is there a danger of model instability when the gradients become larger and the learning rate is also increased?

    • NetRunnerSu23 minutes ago |parent

      1. an overly strong surprise is like PTSD in humans - it changes the model's previously learned experience forever, this is what we want to avoid

      2. it's bound to happen, and our PILR-S is designed to keep the learning rate within the bell curve and decreasing as the surprise decreases (less new information, less learning).