HNNewShowAskJobs
Built with Tanstack Start
Evaluating the factuality of verifiable claims in long-form text generation(aclanthology.org)
9 points by gone35 5 days ago | 1 comment
  • ggm2 days ago

    As a non expert, I found this fascinating. They compared mechanistic verification with 3 assessors, and seem to be saying it's at least as good.

    They also make their scripting available.

    On the whole I'd be more interested in how they rank the various consumer services (grok especially) for "truthiness" but perhaps laying the groundwork by establishing how mechanistic verification performs, is a necessary precursor.

    Essay writers beware! Those cites are going to be more checked than before.