Cloud LoRA

Spin up your Llama LoRA and get millions of tokens in minutes.



Here's the deal:
        

    # create your Llama model and apply your LoRA adapter
    peft_model = ...

    # create a cloud LoRA model
    cloud_model = CloudLora.create(peft_model)

    # scalable remote inference
    cloud_model.remote().get_completion(...)
        
    

© 2023 Morph Labs. All rights reserved.