Skip to content

v0.7.0: Manage events without agendas in the summarisation workflow

Compare
Choose a tag to compare
@bakaburg1 bakaburg1 released this 11 Apr 13:05
· 47 commits to main since this release
5311889

This pull request includes a series of enhancements and fixes that improve the transcript summarization workflow, add new functionality for entity extraction, and ensure better support for various transcript formats. The changes also address code quality and documentation to provide a more robust and user-friendly experience.

Breaking:

  • Replaced event_audience with audience as argument name for consistency across the framework. Before, some functions used the first and some the second term (Commit: 644fb29).

Enhancements:

  • Workflow Enhancement: Added support for summarizing meetings without an agenda in the workflow. Before, the full workflow function was designed to only work with long meetings organized in sub-talks described by an agenda. (Commit: 644fb29).
  • Entity Extraction Functionality: Introduced the entity_extractor function to identify and extract entities such as people, acronyms, organizations, and concepts from a text, which can be particularly useful for building vocabularies for LLMs from event descriptions or transcripts (Commit: ae4fc3c).
  • Agenda Management: Added the ability to manage deviations from the expected agenda, allowing the LLM to add talks not described in the expected agenda, enhancing the flexibility of the summarization process (Commit: 40f7620).
  • Support for MS Teams VTT Files: Implemented support for importing transcripts from MS Teams VTT files, which do not follow the standard VTT specification (Commit: cfa9673).
  • Output Quality Improvements: Utilized the styler package to enhance the readability of generated agendas and unformatted summary outputs, contributing to better readability and user experience (Commit: 194b8c8).

Fixes:

  • Agenda Generation Bug: Resolved an issue where the agenda generation was creating infinite unnamed speaker lists, exhausting the context window (Commit: bfc5597).

Dependencies:

  • Styler Package Addition: Added the styler package and its dependencies to the project, which is used to improve the formatting of the generated outputs (Commit: e88a6bd).

Haiku:
🐇✨
In the code's gentle weave, a change was made,
With careful paws, the paths were laid.
From VTT to summaries so bright,
A rabbit hopped through code, by night.
Celebrate each line, for through the maze,
We find our way, in the code's embrace.
🌟📜

What's Changed

  • v0.7.0: Manage events without agendas in the summarisation workflow by @bakaburg1 in #19

Full Changelog: v0.6.0...v0.7.0