Commit Graph

11 Commits

Author SHA1 Message Date
Charles Packer
653693c398 feat: chatml-noforce-roles wrapper + cli fix (#738)
* added new wrapper option to turn system style messages into system role messages

* added multirole wrapper

* unrelated issue with cli print due to previous PR (not showing core memory edits)
2023-12-28 22:12:52 -08:00
Charles Packer
43ed0ff714 feat: added new 'hint' wrappers that inject hints into the pre-prefix (#707)
* added new 'hint' wrappers that inject hints into the pre-prefix

* modified basic search functions with extra input sanitization

* updated first message prefix
2023-12-25 11:29:42 -08:00
Charles Packer
4f23934e04 feat: Add new wrapper defaults (#656) 2023-12-21 17:05:38 +04:00
Charles Packer
b65980e2b3 add back dotdict for backcompat (#572) 2023-12-04 23:02:22 -08:00
Charles Packer
5e7676e133 Remove openai package and migrate to requests (#534) 2023-11-30 13:00:13 -08:00
Charles Packer
f92d8dfc8b add a longer prefix that to the default wrapper (#510)
* add a longer prefix that to the default wrapper (not just opening brace, but up to 'function: ' part since that is always present)

* drop print
2023-11-26 19:59:49 -08:00
Sarah Wooders
ec2bda4966 Refactor config + determine LLM via config.model_endpoint_type (#422)
* mark depricated API section

* CLI bug fixes for azure

* check azure before running

* Update README.md

* Update README.md

* bug fix with persona loading

* remove print

* make errors for cli flags more clear

* format

* fix imports

* fix imports

* add prints

* update lock

* update config fields

* cleanup config loading

* commit

* remove asserts

* refactor configure

* put into different functions

* add embedding default

* pass in config

* fixes

* allow overriding openai embedding endpoint

* black

* trying to patch tests (some circular import errors)

* update flags and docs

* patched support for local llms using endpoint and endpoint type passed via configs, not env vars

* missing files

* fix naming

* fix import

* fix two runtime errors

* patch ollama typo, move ollama model question pre-wrapper, modify question phrasing to include link to readthedocs, also have a default ollama model that has a tag included

* disable debug messages

* made error message for failed load more informative

* don't print dynamic linking function warning unless --debug

* updated tests to work with new cli workflow (disabled openai config test for now)

* added skips for tests when vars are missing

* update bad arg

* revise test to soft pass on empty string too

* don't run configure twice

* extend timeout (try to pass against nltk download)

* update defaults

* typo with endpoint type default

* patch runtime errors for when model is None

* catching another case of 'x in model' when model is None (preemptively)

* allow overrides to local llm related config params

* made model wrapper selection from a list vs raw input

* update test for select instead of input

* Fixed bug in endpoint when using local->openai selection, also added validation loop to manual endpoint entry

* updated error messages to be more informative with links to readthedocs

* add back gpt3.5-turbo

---------

Co-authored-by: cpacker <packercharles@gmail.com>
2023-11-14 15:58:19 -08:00
Charles Packer
fe2d8b2b2f add ollama support (#314)
* untested

* patch

* updated

* clarified using tags in docs

* tested ollama, working

* fixed template issue by creating dummy template, also added missing context length indicator

* moved count_tokens to utils.py

* clean
2023-11-06 15:11:22 -08:00
Charles Packer
e90c00ad63 Add grammar-based sampling (for webui, llamacpp, and koboldcpp) (#293)
* add llamacpp server support

* use gbnf loader

* cleanup and warning about grammar when not using llama.cpp

* added memgpt-specific grammar file

* add grammar support to webui api calls

* black

* typo

* add koboldcpp support

* no more defaulting to webui, should error out instead

* fix grammar

* patch kobold (testing, now working) + cleanup log messages

Co-Authored-By: Drake-AI <drake-ai@users.noreply.github.com>
2023-11-04 12:02:44 -07:00
danx0r
2f56e0eaf5 FIx #261 (#300)
* should fix issue 261 - pickle fail on DotDict class

* black patch

---------

Co-authored-by: cpacker <packercharles@gmail.com>
2023-11-03 23:33:59 -07:00
Charles Packer
faaa9a04fa refactored + updated the airo wrapper a bit 2023-10-23 00:41:10 -07:00