Quotas, Limitations, and Technical FAQ

Quota Limits
Known Limitations
Technical Frequently Asked Questions
Next Steps

Quota Limits

The following are the quota for users based, stratified by the three Subscription Plans offered by Fermyon Cloud:

* Signifies a resource that is not an explicit line item in the Fermyon Cloud Subscription Plan but is eligible for a quota increase for Growth and Enterprise Plan users who can reach out to Fermyon team for assistance.

	Starter Plan	Growth Plan	Enterprise Plan
Application Quota
Spin application count	5	100	Contact us
Spin application package size (MB)*	100	100	Contact us
Spin application deployments per minute*	10	10	Contact us
Request handler duration (seconds)*	30	30	Contact us
HTTP body size limit (MB)*	10	10	Contact us
Rate Limiting Quota
Request execution count	100,000	1,000,000	Contact us
Request executions per second*	1,000	1,000	Contact us
Serverless AI inferencing requests per hour*	75	75	Contact us
Serverless AI embedding requests per hour*	200	200	Contact us
Spin application deployments per hour*	100	100	Contact us
Outbound requests per hour per Spin app*	500	500	Contact us
Networking
Custom domains *	5	100	Contact us
Custom Fermyon subdomain character count*	62	62	Contact us
Bandwidth egress (GB)	5	50	Contact us
Storage
Key value store key size (bytes)*	255	255	Contact us
Key value store value size (MB)*	1	1	Contact us
Key value maximum keys*	1,024	1,024	Contact us
Key value store count	5	100	Contact us
Key value storage size (GB)	1	2	Contact us
SQLite database count	1	1	Contact us
SQLite database size (GB)	1	2	Contact us
Serverless AI
Token count per request (response only)*	1,024	1,024	Contact us
Regions
Region count	1	1	Contact us

Note: The above quota numbers are per month and are spread across all apps for a single Cloud account.

Known Limitations

Fermyon Cloud Limitations

A custom Fermyon subdomain must be unique
A user can execute a maximum of 3,000 requests per hour toward the Cloud API. This includes API requests from the CLI (spin) and navigating the Fermyon Cloud website.
The device and browser token lifetime for Fermyon Cloud Dashboard authentication is 7 days
A Spin application can have a maximum of 1 key value store

Custom Domains Limitations

Additional records for custom domains are not supported on Fermyon Cloud at this time. This means any additional records (such as TXT or MX) that you have already set through your registrar will become inactive once top-level (apex) domain delegation to Fermyon Cloud is completed. Updating your domain’s nameserver records will cause any additional records you’ve set through your registrar to no longer take effect e.g. TXT or MX records for services like Google Workspace. If you need to assign your Spin application to an apex domain (example.com rather than a subdomain of the apex domain app.example.com) and you also need to set additional DNS records associated with that domain, please share this feedback here. In the meantime, please consider delegating a subdomain or using a Fermyon custom subdomain for your Spin application instead.

Spin Limitations

Fermyon Cloud supports Spin CLI v0.6.0 or newer. That being said, there are certain Spin SDK triggers and APIs that are not yet supported on Fermyon Cloud. Please review the table below to see what is supported today on Fermyon Cloud:

Feature	SDK Supported?
Triggers
HTTP	Supported
Redis	Not supported
APIs
Outbound HTTP	Supported
Configuration Variables	Supported
Key Value Storage	Supported
SQLite Storage	Supported
Serverless AI	Supported
Service Chaining	Not supported
MySQL	Supported
PostgreSQL	Supported
Outbound Redis	Supported
wasi-cloud-core Drafts
wasi-blobstore	Not supported in Spin
wasi-config	Supported (2024-09-27 snapshot)
wasi-keyvalue	`store` interface supported (2024-10-17 snapshot); `atomics` and `batch` not supported
wasi-messaging	Not supported in Spin
Features
Component dependencies	Supported
Extensibility
Custom Triggers	Not supported

To learn more about what feature support looks like for various programming languages, visit the Spin Language Support Guide.

Other Limitations

You cannot communicate between Spin applications using local name resolution
Runtime configuration and secrets are not supported at this time

Technical Frequently Asked Questions

Why do I see mixed replies from my service during an upgrade?
- When doing an upgrade of an application, there is a gradual roll-out happening. This means that requests will hit both the existing and new modules, as the upgrade completes. You will see a pattern like the one below, showing the body reply from an HTTP request:
Can I export data from my SQLite Database? At this time, Fermyon Cloud does not have a data export feature. We’d love to hear your thoughts on what an ideal experience would look like at github.com/fermyon/feedback
If I upgrade my Spin application, can I still access the same instance of my SQLite Database? Yes, as long as you haven’t deleted your database, the instance will be running and stateful.
If I delete and redeploy my Spin application, can I still access the same instance of my SQLite Database? Yes, Fermyon Cloud uses abstract labels and links to enable seamless database sharing and management among Spin applications.

11:08:13 : Hello from Rust
11:08:18 : Hello from Rust - updated
11:08:19 : Hello from Rust
11:08:23 : Hello from Rust - updated
11:08:24 : Bad Gateway
11:08:26 : Hello from Rust - updated
11:08:27 : Bad Gateway
11:08:29 : Hello from Rust - updated

Q: It’s been over 72 hours and my custom domain hasn’t successfully verified, now what?
- If you’re a Growth or Enterprise plan user, please reach out to support@fermyon.com. Otherwise, please go to Discord’s #cloud channel for assistance.
Q: How do I add a new record?
- At this time, you can only add a maximum of 1 custom domain to your Spin application. We do not support additional records at this time. To file a feedback request, please visit github.com/fermyon/feedback.
Can I configure external DNS for my custom domain on Fermyon Cloud?
- Fermyon Cloud only supports Fermyon DNS at this time for custom domains. To file a feedback request, please visit github.com/fermyon/feedback.
Unable to redeploy an app on Fermyon Cloud?
- If spin deploy fails with the error No channel with app_id xxxx and name spin-deploy, but the app works locally (via spin build --up), please delete the app from your Fermyon Cloud dashboard and try again.
How does Fermyon Serverless AI compare to OpenAI? In what situations am I better off using Fermyon Serverless AI?
- With Fermyon Serverless AI, you don’t need to be worried about vendor lock-in. With Fermyon Serverless AI you can run your inferencing workloads locally with Spin or on your infrastructure of choice with Spin install. In fact, Fermyon Serverless AI is built using open source models that can be used completely outside the context of Spin in case you decide to host your own infrastructure end to end.
Which specific model(s) is Fermyon Serverless AI using for inferencing? [i.e. 7B, 13B, 70B] Why was this model chosen?
- Fermyon Serverless AI is using the 13B model for inferencing, both for llama2-chat and codellama-instruct, as it strikes an excellent balance of performance and accuracy. If you’re interested in another model, please share this feedback at github.com/fermyon/feedback.
How do I choose which model I’m using?
- llama2-chat is trained for chat use cases, although it can be used for general language model tasks as well. code llama is meant for generating code based on instructions. However, these are just high-level guideposts. Please visit Meta AI’s documentation for more specific instructions and to learn more about the tradeoffs between llama2 and Code Llama.
Do you support Fine Tuning? Bring your own Model? Any models other than LLaMa2 and CodeLlama? Vector Databases?
- At this time, we do not support Fine Tuning or bringing your own model. Fermyon Serverless AI supports llama2 and CodeLlama. If you’re interested in another model, please share that feedback with our team by raising an issue here. Vector databases, backed by our SQLite Database.
I read somewhere that I’ll receive data about my AI inferencing usage — where can I find that data? Where can I find a reference as to the definition(s) of each of those data elements?
- In the request response body, you will see the number of prompt tokens and the number of generated tokens for inferencing requests. For embedding requests, you’ll be presented with the number of prompt tokens.
What programming languages / SDKs can I use to invoke Serverless AI?
- Please visit the API Guide for this information.

Next Steps

Learn how to engage with Fermyon to get support

Spin CNCF

Wasm Functions

Cloud

SpinKube CNCF

Platform for Kubernetes