Scope of service
RCP provides the following:
- Hosting of models – there are already some models available, which can be viewed from the portal
- API – We provide an openAI-style API, which is typically a RESTful API designed for model inference, allowing users to send requests to a hosted AI model and receive responses. These APIs follow a client-server model, where the model runs on the RCP infrastructure, and clients interact with it using HTTP requests.
Prerequisites to use the service
Either pre-existing models can be used kindly send an email to rcpsupport@epfl.ch
Please note that models are not always active. They are loaded only when a request is made so as to use resources in the most efficient way. This means that the first time a model is used, it may take between 30 seconds and 4 minutes to respond, as it needs to be loaded first. Once loaded, subsequent responses will be much faster. Availability of the models is not an issue, a slight delay on the first request may be expected.
If such a delay is not acceptable for your application, for example, if you need an immediate response all of the time, you can contact us and we will be happy to discuss and work towards an acceptable solution.
