Discussion about this post

User's avatar
Brendon Johnson's avatar

Great read. I'm currently working on a crisis informatics project utilizing BERTopic. I have seen the "beautifully coherent" but useless topics you describe at the end of the article. Looking at general-purpose and domain-specific (crisis texts) encoders yielded some interesting results.

The domain-specific encoder yielded clusters with lower standard topic metrics (NPMI/Coherence) but better represented actionable intelligence and themes of a given crisis. Some care needs to go into how these new topic models are evaluated. In our case, developing an "actionability" score using human and LLM judges better reflected what themes were useful when extracted from the corpus. Modern LLMs struggle to cluster themes at-scale but can serve as a good judge of topics extracted using LDA/BERTopic.

No posts

Ready for more?