* add llamacpp server support
* use gbnf loader
* cleanup and warning about grammar when not using llama.cpp
* added memgpt-specific grammar file
* add grammar support to webui api calls
* black
* typo
* add koboldcpp support
* no more defaulting to webui, should error out instead
* fix grammar
* patch kobold (testing, now working) + cleanup log messages
Co-Authored-By: Drake-AI <drake-ai@users.noreply.github.com>