Changelog

  1. Web

    • Launched official models
    • Added latest predictions to the deployment detail page
    • Fixed an issue with displaying certain pages on Replicate when logged out
    • Updated the model detail API playground to make it possible to train with inputs that aren’t .zip
    • Made it clearer when creating a deployment that flux fine-tunes shouldn’t be deployed
    • Fixed a bug where numeric inputs could increment on scroll on the model detail API playground
    • Fixed a bug that could cause inaccessible models to show up in collections
    • Improved the experience of sharing predictions via web
    • Improved clarity of autoscaling charts for deployments
    • Made it easier to copy the model identifier (for official models) or the model identifier and sha (for community models)
    • Updated chart for GPU metrics to make values easier to differentiate

    Playground

    • Added an icon to thumbnails to differentiate video content from other types
    • Made it easier to see when errors occur and why
    • Added the ability to remove multiple items from the grid on the playground
    • Fixed a bug when selecting individual items that could cause all items to be deselected
    • Added Vercel AI SDK snippets to playground
    • Improved the loading and handling of large number of videos

    Docs

  2. Official models are always on and have predictable pricing.
  3. A collection of updates to Replicate for the week ending January 17, 2025.
  4. A collection of updates to Replicate for the week ending December 20, 2024.
  5. We've stopped our experimental support for language model training
  6. You can now monitor how much GPU memory your deployments are using so you can make sure you’re using your GPUs efficiently.
  7. A collection of updates to Replicate for the week ending December 06, 2024.
  8. A collection of updates to Replicate for the week ending November 22, 2024.
  9. A collection of updates to Replicate for the week ending November 8, 2024.
  10. Python 3.7 model support ends on November 15th.
  11. A collection of updates to Replicate for the week ending October 25, 2024.
  12. Finally. You can switch it in the footer.
  13. Playground is a way to quickly try out and compare the output of models on Replicate.
  14. Documentation now has a cleaner design, better navigation, and lots of new content to help you build with Replicate.
  15. Our client libraries and API are now much faster at running models, particularly if a file is being returned.
  16. Large logs may be truncated to manage platform load.
  17. Cleaning old predictions to improve API performance and speed.
  18. The training detail page now includes a JSON metadata tab.
  19. Programmatically search for public models using the API
  20. You no longer need to pass `"stream": true` with your prediction request to get back a stream URL - if the model supports streaming, you'll always get one back.
  21. Securely pass sensitive values to models.
  22. Disable API tokens from the web to prevent unauthorized use.
  23. We publish automated feeds about product updates and platform incidents.
  24. Delete models, versions, and deployments via web or HTTP API.
  25. Validate incoming webhooks so you know they're really coming from Replicate.
  26. You can now search for your existing deployments on the website.
  27. We've added new UI features for viewing and tracking webhooks.
  28. The API now validates JSON request body fields.
  29. T4 models now get up to 16 GB of RAM, with no changes to existing prices.
  30. Store API tokens in config, not code, for security.
  31. Bearer token used in HTTP Authorization header for access.
  32. Manage models with more control using our new deployments feature.
  33. We've added a web-based viewer for 3D model outputs.
  34. We've updated our model collections to be more task-oriented.
  35. Use webhooks to receive real-time updates about your predictions and trainings.
  36. Create custom SDXL image generation models without coding.
  37. New slider for comparing image inputs to outputs.
  38. Code snippets for various languages now available on models.
  39. You can now create models programmatically using the API.
  40. Improved training pages for fine-tuning models.
  41. View prediction parameters as JSON on prediction detail page
  42. New API endpoint for listing public models.
  43. Create deployments for controlled model running and customization.
  44. Prediction UUID added as query parameter on web refresh.
  45. View training logs in full-screen mode.
  46. Browser tab favicon shows prediction status.
  47. Our API now supports live language model output with SSE streams.
  48. Create multiple personal API tokens for your user account.
  49. NVIDIA A40 GPUs now supported.
  50. Model pages now show training hardware type and cost.
  51. We've launched a fine-tuning API for training custom language models.
  52. See Git commits and tags in model versions on Replicate.
  53. Website download mechanism improved for multiple and single outputs.
  54. View detailed invoice summaries with cost and model breakdowns.
  55. We've published a Swift client library for AI-powered app development.
  56. Collaborate with your team on Replicate using organizations.
  57. We're released a Node.js client library for model predictions.
  58. The "get a model" API now returns more metadata.
  59. All models have OpenAPI schema objects for their inputs and outputs.
  60. HTTP API reference is now also available as a structured OpenAPI JSON schema.
  61. Browse models on our new Explore page.
  62. API sends webhook events at different prediction lifecycle stages.
  63. API now provides complete Python example code snippets.
  64. Cancel predictions even after navigating away from the page.
  65. Install Cog on macOS with Homebrew using brew command.
  66. DreamBooth API now supports img2img with prompt and image input.
  67. Delete prediction button available on prediction detail page
  68. API prediction data is automatically removed after one hour.
  69. We are now publishing a dedicated changelog for product updates.
  70. Stable Diffusion now has release notes for version changes.
  71. Increased default rate limits for predictions up to 600 per second.
  72. Infrastructure improvements increase reliability and speed.
  73. Run models on Nvidia A100s via hardware upgrade option
  74. Set monthly spend limit to avoid surprise bills easily
  75. API now supports webhooks for prediction completion notifications
  76. Curated collections of models for similar tasks, starting with style transfer.
  77. Scrub through model predictions to see how they evolved.