Angles of attack for continual learning safety

·LessWrong··

This is the fourth post in the sequence Implications of Continual Learning for LLM Agents.SummaryContinual learning is a capability that largely doesn’t exist yet in LLMs. We first want to acknowledge that this may make it difficult to identify tractable angles of attack for making CL safer: it may be too difficult to predict how the development of CL will play out to find good opportunities to positively influence that development. Differential development is one way to get around this issue, b...

Read full article →

Related Articles

Apple is about to make Hide My Email useless
SXX · Hacker News · 14h ago
TIL: You can make HTTP requests without curl using Bash /dev/TCP
mrshu · Hacker News · 16h ago
A backdoor in a LinkedIn job offer
lwhsiao · Hacker News · 1d ago
Mechanical Watch (2022)
razin · Hacker News · 21h ago
Google Chrome's Next Update Will Mark the End of Popular Ad Blockers
arnejenssen · Hacker News · 17h ago