Plenty of progress in models that can use tools and search. Would love to see how one of these tool/search-enabled models do at this kind of a task. In my experience, they don't fabricate things anymore, just sometimes occasionally misrepresent the content of citations (put a citation somewhere where it doesn't actually support what is written).
A few days ago I asked GPT 5 for links to news on the Charlotte murder before the story got reported by the mainstream media. It gave me five different links, including AP and Reuters. Every one, five out of five, was a hallucination.
It hallucinated complete documentation to the tech we asked it about just 2 weeks ago. Completely made up documentation with only vague relationship.to how it really works.
403 for me - which makes me wonder how anyone else is commenting on the actual content of the link, rather than just recycling general comments. without knowing the details.
It supposed to search for actual documents and then process them (extract content, summarize, giving you the links, and so on).