Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

nrl-ai / llama-assistant Public

Notifications You must be signed in to change notification settings
Fork 38
Star 459

Code
Issues 8
Pull requests 2
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: nrl-ai/llama-assistant

Releases · nrl-ai/llama-assistant

AnyLearning v0.1.41 - Context handling improvements

08 Dec 05:07

vietanhdev

Compare

Choose a tag to compare

Loading

AnyLearning v0.1.41 - Context handling improvements Latest

Latest

Llama.Assistant.RAG.Demo.-.1080p.mp4

🔧 Changes

Utilize llama cpp KV cache mechanism to make faster inference. See (1)
Summarize the chat history when it is about to exceeds the context length
Recursive check and update missing setting from te DEFAULT CONFIG
Add validators (type, min, max value) for input fields in the setting dialog

(1) llama cpp's KV cache check prefix of your chat history to reuse the K-V cache. For example:

Generated sequence so far = "ABCDEF"
If we modify the chat history somehow like: "ABCDXT". Then it matches prefix and reuses the cache for "ABCD" and newly computes the Key and Value vectors for "XT", then generates new responses.
-> So we need to make the most of this mechanism by keep the history prefix as fixed as possible

Assets 4

Loading

All reactions

AnyLearning v0.1.40 - RAG Support with LlamaIndex

30 Nov 17:17

vietanhdev

Compare

Choose a tag to compare

Loading

AnyLearning v0.1.40 - RAG Support with LlamaIndex

🔧 Changes

💬 Supported continue conversation.
🔍 Added RAG support with LlamaIndex.
⚙️ Added model settings.
📝 Added markdown support.
⌛ Added loading text animation while downloading the models and generating answers.
🔄 Fixed chatbox scrolling issue.

Thank @gallegi for adding these features! 👍

Contributors

gallegi

Assets 2

Loading

gallegi reacted with heart emoji

All reactions

❤️ 1 reaction

1 person reacted

Llama Assistant v0.1.32 - Build for macOS, Linux, Windows

05 Oct 07:13

vietanhdev

Compare

Choose a tag to compare

Loading

Llama Assistant v0.1.32 - Build for macOS, Linux, Windows

🔧 Changes

🔄 Replace Whisper implementation with pywhispercpp.
🖥️ Add built versions:
- 🍎 MacOS
- 🪟 Windows
- 🐧 Linux

Assets 6

Loading

vietanhdev, bunnywaffle, pyhornet, and phanthaiduong22 reacted with heart emoji

All reactions

❤️ 4 reactions

4 people reacted

Llama Assistant v0.1.28/29

04 Oct 16:31

vietanhdev

Compare

Choose a tag to compare

Loading

Llama Assistant v0.1.28/29

🔧 Changes

🎙️🔥 Add offline STT support: WhisperCPP. The base model is downloaded from Hugging Face. Your audio is transcribed locally on your machine. 🔥

Assets 5

Loading

All reactions

Llama Assistant v0.1.26

02 Oct 15:19

vietanhdev

Compare

Choose a tag to compare

Loading

Llama Assistant v0.1.26

🔧 Changes

Add binary build for MacOS with PyInstaller.

Assets 4

Loading

All reactions

Llama Assistant v0.1.24

01 Oct 13:51

vietanhdev

Compare

Choose a tag to compare

Loading

Llama Assistant v0.1.24

🐛 Bugfixes

🖱️ Fixed: wrong cursor position when inserting text.
💥 Fixed: crashing when changing shortcut.
⌨️ Fixed: wrong shortcut keys in macOS.
🙈 Hide "Copy Result" or "Clear" after clearing the chat.
🔄 Handle the last response correctly.

🔧 Changes

🧹 Clear results + input when clicking "Clear".
⌨️ Type "clear" or "cls" to clear.
🚫 Prevent action buttons when no data is input.

Assets 2

Loading

All reactions

Llama Assistant v0.1.20

29 Sep 10:25

vietanhdev

Compare

Choose a tag to compare

Loading

Llama Assistant v0.1.20

Features

📚 Text-only models: Llama 3.2 1B, 3B, Owen 2.5, and many more from HuggingFace.
🖼️ Multimodal models: LLaVA 1.5/1.6, MoonDream2, MiniCPM, and many more from HuggingFace.
🛠️ UI for adding custom models.
⚡ Streaming support for response!
🗣️ Wake word detection: "Hey Llama!".

Assets 2

Loading

All reactions

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.