Skip to content

0.9.0-beta

Compare
Choose a tag to compare
@maciejmajek maciejmajek released this 18 Sep 19:29
· 86 commits to development since this release
da5ac2e

Summary of changes

This release brings a considerable refactoring effort to the RAI's agent system. RAI architecture now consists of two separate agents:

  • HMI agent (for communication, both voice and text)
  • RAI agent (for solving tasks/missions)
    The default workflow now is to add a task to the RAI agent via HMI such as text_hmi (streamlit app). The RAI Agent will pick up the task, report feedback on every iteration and return task's report at the very end (success or no).

We have also added a new, refactored version of the voice interface with much better voice detection system. Our ASR now supports (possibly) any recording device (no matter the sampling rate), as well as wake word using OpenWakeWord.

Additionally, other enhancements were introduced:

  • RAI What I See (Node for intelligent image capture, based on heuristics (did anything change since the last time?))
  • Extended tests
  • CI license checking
  • NoMaD transformer based navigation system
  • Overall docs improvement
  • Global configuration file (config.toml) where one can set the default model that will be used across RAI's modules
  • Local ASR (Whisper)
  • CI has been extended to ROS 2 Humble
  • Pydantic V2 compatibility with latest Langchain version

What's Changed

New Contributors

Thank you for your contributions!

Full Changelog: 0.8.1-beta...0.9.0-beta