$MODEL API Reference

Complete API reference for the $MODEL data type in Grapa.

Constructor

`$MODEL()`

Creates a new model instance.

Returns: New $MODEL instance

Example:

model = $MODEL();

Methods

`.load(path, method)`

Loads a model from the specified file path.

Parameters: - path (string): Path to the model file (GGUF format) - method (string, optional): Method to use. If not specified, the method will be auto-detected from the file extension or magic bytes.

Returns: - 0 - Success - -1 - Model context creation failed - -2 - Model loading failed - Other negative values - Various error conditions

Example:

/* Load OpenAI text generation model */
result = model.load("gpt-4o", "openai");
if (result.type() != $ERR) {
    "Model loaded successfully".echo();
}

/* Load OpenAI embedding model */
result = model.load("text-embedding-3-small", "openai-embedding");

Error Codes: - -1: Failed to create model context (usually memory or file issues) - -2: Failed to load model (file not found, invalid format, etc.)

`.gen(prompt, params)`

Generates text using the loaded model.

Parameters: - prompt (string): Input text prompt for generation - params (object, optional): Generation parameters to override defaults

Returns: Generated text string

Example:

/* Basic generation */
response = model.gen("Hello, how are you?");

/* Generation with custom parameters */
response = model.gen("Explain AI", {
    "temperature": 0.8,
    "max_tokens": 100
});

Generation Parameters: - temperature (float): Controls randomness (0.0 = deterministic, 1.0+ = more random) - max_tokens (int): Maximum number of tokens to generate - top_k (int): Number of top tokens to consider - top_p (float): Nucleus sampling threshold - repeat_penalty (float): Penalty for repeating tokens - seed (int): Random seed (-1 for random) - context_size (int): Context window size - verbose (int): Logging verbosity level

`.info()`

Returns information about the current model state.

Returns: Object containing: - loaded (boolean): Whether the model is currently loaded - method (string): Method being used - model_path (string): Full path to the loaded model file - model (string): Model filename only

Example:

info = model.info();
("Model loaded: " + info."loaded".str()).echo();
("Method: " + info."method".str()).echo();
("Model path: " + info."model_path".str()).echo();
("Model name: " + info."model".str()).echo();
("Model size: " + info."model_size_bytes".str() + " bytes").echo();

`.params()`

Returns current generation parameters.

Returns: Object containing all current parameters: - temperature (float): Current temperature setting - max_tokens (int): Current max tokens setting - top_k (int): Current top_k setting - top_p (float): Current top_p setting - repeat_penalty (float): Current repeat penalty setting - seed (int): Current seed setting - context_size (int): Current context size setting - verbose (int): Current verbose level setting

Example:

params = model.params();
("Current temperature: " + params."temperature".str()).echo();
("Max tokens: " + params."max_tokens".str()).echo();

`.params(parameters)`

Sets generation parameters using a $GOBJ collection.

Parameters: - parameters ($GOBJ): Collection containing parameter name-value pairs

Returns: None

Example:

model.params({
    "temperature": 0.7,
    "max_tokens": 50,
    "verbose": 2
});

Supported Parameters: - "temperature" (float): 0.0 to 2.0+ - "max_tokens" (int): 1 to 4096+ - "top_k" (int): 1 to 100+ - "top_p" (float): 0.0 to 1.0 - "repeat_penalty" (float): 0.0 to 2.0+ - "seed" (int): -1 for random, or any integer - "context_size" (int): 512 to 131072+ - "verbose" (int): 0-4 (0=silent, 1=errors, 2=warnings, 3=info, 4=debug)

`.context()`

Gets the current context state of the model.

Returns: Object containing context information: - text (string): Current text context - tokens (list): Current token context (for efficient method processing) - method (string): Method being used - model (string): Model filename

Example:

context = model.context();
("Context text: " + context."text".str()).echo();
("Context tokens: " + context."tokens".len().str() + " tokens").echo();
("Method: " + context."method".str()).echo();

`.context(contextData)`

Sets the context for the model.

Parameters: - contextData (string, list, or object): Context data to set - If string: Sets text context - If list: Sets token context (for efficient processing) - If object: Can contain both "text" and "tokens" fields

Returns: None

Example:

/* Set text context */
model.context("Previous conversation: Hello, how are you?");

/* Set token context (more efficient) */
tokenList = [1, 2, 3, 4, 5];
model.context(tokenList);

/* Set both text and tokens */
model.context({
    "text": "Previous conversation",
    "tokens": [1, 2, 3, 4, 5]
});

`.load()`

Unloads the current model and frees memory by calling .load() with no parameters.

Returns: - 0 - Success - -1 - Error during unload

Example:

result = model.load();
if (result.type() != $ERR) {
    "Model unloaded successfully".echo();
}

Default Parameters

When a new $MODEL instance is created, the following default parameters are set:

Parameter	Default Value	Description
`temperature`	0.7	Moderate creativity
`max_tokens`	10	Short responses (good for testing)
`top_k`	40	Balanced token selection
`top_p`	0.9	Nucleus sampling threshold
`repeat_penalty`	1.1	Slight penalty for repetition
`seed`	-1	Random seed
`context_size`	2048	Standard context window
`verbose`	0	Silent operation

Error Handling

Common Error Scenarios

Model Loading Errors:

result = model.load("nonexistent.gguf");
if (result != 0) {
    ("Load failed: " + result.str()).echo();
    /* Handle error appropriately */
}

Generation Errors:

try response = model.gen("Hello");
catch (error): ("Generation error: " + error.str()).echo();

if (response.len() == 0) {
    "Warning: Empty response".echo();
}

Parameter Validation:

/* Validate parameter ranges */
if (temperature >= 0.0 && temperature <= 2.0) {
    model.params({"temperature": temperature});
} else {
    "Invalid temperature value".echo();
}

Thread Safety

The $MODEL type is designed to be thread-safe:

Multiple model instances can be used concurrently
Each instance maintains its own state
No shared global state between instances

Example:

/* Safe to use multiple instances */
model1 = $MODEL();
model2 = $MODEL();

model1.load("model.gguf");
model2.load("model.gguf");

/* These can run concurrently */
response1 = model1.gen("Hello");
response2 = model2.gen("Hi");

Memory Management

Best Practices

Always unload models when done:

model = $MODEL();
model.load("large_model.gguf");

/* Use model... */

model.load();  /* Free memory */

Check available memory before loading large models:

/* Load OpenAI model */
model = $MODEL();
result = model.load("gpt-4o", "openai");
if (result != 0) {
    "Failed to load OpenAI model".echo();
}

Use appropriate context sizes:

/* Smaller context for memory-constrained environments */
model.params({"context_size": 1024});

Performance Considerations

Optimization Tips

Set appropriate parameters for your use case:

/* For fast, short responses */
model.params({
    "max_tokens": 20,
    "temperature": 0.3
});

/* For creative, longer responses */
model.params({
    "max_tokens": 200,
    "temperature": 1.0
});

Disable verbose output for production:

model.params({"verbose": 0});  /* Silent operation */

Use smaller models for faster inference:

/* Q2_K or Q3_K_M for speed */
model.load("qwen2.5-7b-instruct-q2_k.gguf");

Hardware acceleration: CPU-optimized
Thread safety: Yes
Auto-detection: From .gguf file extension or "GGUF" magic bytes

Future Methods

The $MODEL type is designed to support multiple methods:

TensorFlow: For TensorFlow models (auto-detected from .tflite extension)
PyTorch: For PyTorch models (via Python integration)
scikit-learn: For traditional ML models (auto-detected from .pkl extension)

$MODEL API Reference

Constructor

$MODEL()

Methods

.load(path, method)

.gen(prompt, params)

.info()

.params()

.params(parameters)

.context()

.context(contextData)

.load()

Default Parameters

Error Handling

Common Error Scenarios

Thread Safety

Memory Management

Best Practices

Performance Considerations

Optimization Tips

Future Methods

See Also

`$MODEL()`

`.load(path, method)`

`.gen(prompt, params)`

`.info()`

`.params()`

`.params(parameters)`

`.context()`

`.context(contextData)`

`.load()`