Issues using the godot-llama-cpp plugin

Godot Version

4.2

Question

Hello, I made a simple chatbox in Godot, which takes as input the text typed by the user and outputs replies generated by a large language model. To run the model, I use Ollama. The project code is available here.

Because replies from Ollama are slow, I would like to use the godot-llama-cpp plugin based on godot.cpp. I followed the installation notes listed in the repo, but I got stuck when adding the LlamaContext node to my scene(point 6). Could anyone explain in detail what I should do? Thanks heaps!