Model Demos

Demos that demonstrate inference on a particular model.

284-openvoice.png

Voice tone cloning with OpenVoice and OpenVINO.

284-openvoice

GithubBinderColab
283-photo-maker.gif

Text-to-image generation using PhotoMaker and OpenVINO.

283-photo-maker

Github
228-clip-zero-shot-convert.png

Zero-shot Image Classification with SigLIP.

282-siglip-zero-shot-image-classification

GithubColab
281-kosmos2-multimodal-large-language-model.png

Kosmos-2: Multimodal Large Language Model and OpenVINO.

281-kosmos2-multimodal-large-language-model

Github
280-depth-anything.gif

Depth estimation with DepthAnything and OpenVINO.

280-depth-anything

GithubBinderColab
notebook_eye.png

Mobile language assistant with MobileVLM and OpenVINO.

279-mobilevlm-language-assistant

Github
278-stable-diffusion-ip-adapter.png

Image Generation with Stable Diffusion and IP-Adapter.

278-stable-diffusion-ip-adapter

Github
277-amused-lightweight-text-to-image.png

Lightweight image generation with aMUSEd and OpenVINO.

277-amused-lightweight-text-to-image

GithubColab
276-stable-diffusion-torchdynamo-backend.png

Image Generation with Stable Diffusion using OpenVINO TorchDynamo backend.

276-stable-diffusion-torchdynamo-backend

Github
notebook_eye.png

LLM Instruction-following pipeline with OpenVINO.

275-llm-question-answering

Github
274-efficient-sam.png

Object segmentations with EfficientSAM and OpenVINO.

274-efficient-sam

Github
notebook_eye.png

LLM-powered chatbot using Stable-Zephyr-3b and OpenVINO.

273-stable-zephyr-3b-chatbot

Github
272-paint-by-example.png

Paint by Example using Stable Diffusion and OpenVINO.

272-paint-by-example

Github
271-sdxl-turbo.png

Single step image generation using SDXL-turbo and OpenVINO.

271-sdxl-turbo

Github
269-film-slowmo.gif

Frame interpolation using FILM and OpenVINO.

269-film-slowmo

Github
notebook_eye.png

Table Question Answering using TAPAS and OpenVINO.

268-table-question-answering

GithubColab
notebook_eye.png

Automatic speech recognition using Distil-Whisper and OpenVINO.

267-distil-whisper-asr

Github
notebook_eye.png

Text Generation via Speculative Sampling, KV Caching, and OpenVINO.

266-speculative-sampling

Github
265-wuerstchen-image-generation.png

Image generation with Würstchen and OpenVINO.

265-wuerstchen-image-generation

Github
264-qrcode-monster.png

Generate creative QR codes with ControlNet QR Code Monster and OpenVINO.

264-qrcode-monster

Github
263-latent-consistency-models-image-generation.png

Image generation with Latent Consistency Model and OpenVINO.

263-latent-consistency-models-image-generation

Github
284292122-f146e16d-7233-49f7-a401-edcb714b5288.png

Text-to-Image Generation with LCM LoRA and ControlNet Conditioning.

263-lcm-lora-controlnet

Github
notebook_eye.png

SoftVC VITS Singing Voice Conversion and OpenVINO.

262-softvc-voice-conversion

Github
261-fast-segment-anything.gif

Object segmentation with FastSAM and OpenVINO.

261-fast-segment-anything

GithubBinderColab
259-decidiffusion-image-generation.png

Image generation with DeciDiffusion and OpenVINO.

259-decidiffusion-image-generation

Github
258-blip-diffusion-subject-generation.png

Subject-driven image generation and editing using BLIP Diffusion and OpenVINO.

258-blip-diffusion-subject-generation

Github
257-llava-multimodal-chatbot.png

Visual-language assistant with LLaVA and OpenVINO.

257-llava-multimodal-chatbot

Github
notebook_eye.png

Visual-language assistant with Video-LLaVA and OpenVINO.

257-videollava-multimodal-chatbot.ipynb

256-bark-text-to-audio.png

Text-to-speech generation using Bark and OpenVINO.

256-bark-text-to-audio

Github
notebook_eye.png

Create an LLM-powered RAG system using OpenVINO.

254-rag-chatbot

Github
notebook_eye.png

Create an LLM-powered Chatbot using OpenVINO.

254-llm-chatbot

Github
253-zeroscope-text2video.gif

Text-to video synthesis with ZeroScope and OpenVINO™.

253-zeroscope-text2video

Github
notebook_eye.png

Image generation with FastComposer and OpenVINO™.

252-fastcomposer-image-generation

Github
251-tiny-sd-image-generation.png

Image Generation with Tiny-SD and OpenVINO™.

251-tiny-sd-image-generation

GithubColab
250-music-generation.png

Controllable Music Generation with MusicGen and OpenVINO™.

250-music-generation

GithubBinderColab
249-oneformer-segmentation.png

Universal segmentation with OneFormer and OpenVINO™.

249-oneformer-segmentation

Github
248-stable-diffusion-xl.png

High-resolution image generation with Segmind-VegaRT and OpenVINO.

248-segmind-vegart

Github
248-stable-diffusion-xl.png

Image generation with Stable Diffusion XL and OpenVINO™.

248-ssd-b1

Github
248-stable-diffusion-xl.png

Image generation with Stable Diffusion XL and OpenVINO™.

248-stable-diffusion-xl

Github
notebook_eye.png

Identify the programming language used in an arbitrary code snippet.

247-code-language-id

GithubBinder
246-depth-estimation-videpth.png

Monocular Visual-Inertial Depth Estimation with OpenVINO™.

246-depth-estimation-videpth

Github
245-typo-detector.png

English Typo Detection in sentences with OpenVINO™.

245-typo-detector

Github
notebook_eye.png

Named entity recognition with OpenVINO™.

244-named-entity-recognition

GithubColab
243-tflite-selfie-segmentation.gif

Selfie Segmentation using TFLite and OpenVINO™.

243-tflite-selfie-segmentation

GithubBinderColab
notebook_eye.png

High-Quality Text-Free One-Shot Voice Conversion with FreeVC and OpenVINO™

242-freevc-voice-conversion

Github
241-riffusion-text-to-music.png

Text-to-Music generation using Riffusion and OpenVINO™.

241-riffusion-text-to-music

Github
notebook_eye.png

Instruction following using Databricks Dolly 2.0 and OpenVINO™.

240-dolly-2-instruction-following

Github
239-image-bind-convert.png

Binding multimodal data, using ImageBind and OpenVINO™.

239-image-bind-convert

Github
238-deep-floyd-if-optimize.png

Text-to-image generation with DeepFloyd IF and OpenVINO™.

238-deep-floyd-if-optimize

Github
237-segment-anything.png

Prompt based object segmentation mask generation, using Segment Anything and OpenVINO™.

237-segment-anything

Github
236-stable-diffusion-v2-optimum-demo.png

Text-to-image generation with Stable Diffusion v2 and OpenVINO™.

236-stable-diffusion-v2-text-to-image

Github
236-stable-diffusion-v2-optimum-demo.png

Stable Diffusion Text-to-Image Demo.

236-stable-diffusion-v2-text-to-image-demo

Github
236-stable-diffusion-v2-optimum-demo.png

Stable Diffusion v2.1 using Optimum-Intel OpenVINO.

236-stable-diffusion-v2-optimum-demo

Github
236-stable-diffusion-v2-optimum-demo.png

Stable Diffusion v2.1 using Optimum-Intel OpenVINO and multiple Intel Hardware

236-stable-diffusion-v2-optimum-demo-comparison

Github
236-stable-diffusion-v2-infinite-zoom.gif

Text-to-image generation and Infinite Zoom with Stable Diffusion v2 and OpenVINO™.

236-stable-diffusion-v2-infinite-zoom

Github
235-controlnet-stable-diffusion.png

A text-to-image generation with ControlNet Conditioning and OpenVINO™.

235-controlnet-stable-diffusion

Github
234-encodec-audio-compression.png

Audio compression with EnCodec and OpenVINO™.

234-encodec-audio-compression

Github
233-blip-convert.png

Visual Question Answering and Image Captioning using BLIP and OpenVINO.

233-blip-convert

Github
233-blip-convert.png

Post-Training Quantization and Weights Compression of OpenAI BLIP model with NNCF.

233-blip-optimize

Github
232-clip-language-saliency-map.png

Language-visual saliency with CLIP and OpenVINO™.

232-clip-language-saliency-map

GithubColab
231-instruct-pix2pix-image-editing.png

Image editing with InstructPix2Pix.

231-instruct-pix2pix-image-editing

Github
230-yolov8-object-detection.png

Optimize YOLOv8, using NNCF PTQ API.

230-yolov8-optimization

229-distilbert-sequence-classification.png

Sequence classification with OpenVINO.

229-distilbert-sequence-classification

GithubBinderColab
228-clip-zero-shot-quantize.png

Post-Training Quantization of OpenAI CLIP model with NNCF.

228-clip-zero-shot-quantize

Github
228-clip-zero-shot-convert.png

Zero-shot Image Classification with OpenAI CLIP and OpenVINO™.

228-clip-zero-shot-convert

Github
227-whisper-convert.png

Generate subtitles for video with OpenAI Whisper and OpenVINO.

227-whisper-subtitles-generation

226-yolov7-optimization.png

Optimize YOLOv7, using NNCF PTQ API.

226-yolov7-optimization

Github
225-stable-diffusion-text-to-image.png

Text-to-image generation with Stable Diffusion method.

225-stable-diffusion-text-to-image

Github
224-3D-segmentation-point-clouds.png

Process point cloud data and run 3D Part Segmentation with OpenVINO.

224-3D-segmentation-point-clouds

GithubColab
notebook_eye.png

Use pre-trained models to perform text prediction on an input sequence.

223-text-prediction

GithubColab
222-vision-image-colorization.png

Use pre-trained models to colorize black & white images using OpenVINO.

222-vision-image-colorization

GithubBinder
notebook_eye.png

Real-time translation from English to German.

221-machine-translation

GithubBinderColab
220-cross-lingual-books-alignment.png

Cross-lingual Books Alignment With Transformers and OpenVINO™

220-cross-lingual-books-alignment

GithubBinderColab
notebook_eye.png

Optimize the knowledge graph embeddings model (ConvE) with OpenVINO.

219-knowledge-graphs-conve

GithubBinderColab
218-vehicle-detection-and-recognition.png

Use pre-trained models to detect and recognize vehicles and their attributes with OpenVINO.

218-vehicle-detection-and-recognition

GithubBinder
notebook_eye.png

The attention center model with OpenVINO™

216-attention-center

GithubColab
215-image-inpainting.gif

Fill missing pixels with image in-painting.

215-image-inpainting

GithubBinder
notebook_eye.png

Grammatical error correction with OpenVINO.

214-grammar-correction

Github
213-question-answering.png

Answer your questions basing on a context.

213-question-answering

GithubBinderColab
212-pyannote-speaker-diarization.png

Run inference on speaker diarization pipeline.

212-pyannote-speaker-diarization

Github
notebook_eye.png

Run inference on speech-to-text recognition model.

211-speech-to-text

GithubBinderColab
210-slowfast-video-recognition.gif

Video Recognition using SlowFast and OpenVINO™

210-slowfast-video-recognition

GithubBinder
209-handwritten-ocr.png

OCR for handwritten simplified Chinese and Japanese.

209-handwritten-ocrn

208-optical-character-recognition.png

Annotate text on images using text recognition resnet.

208-optical-character-recognition

GithubColab
207-vision-paddlegan-superresolution.png

Upscale small images with superresolution using a PaddleGAN model.

207-vision-paddlegan-superresolution

GithubColab
206-vision-paddlegan-anime.png

Turn an image into anime using a GAN.

206-vision-paddlegan-anime

GithubColab
205-vision-background-removal.png

Background Removal Demo.

205-vision-background-removal

GithubBinderColab
204-segmenter-semantic-segmentation.png

Semantic segmentation with OpenVINO™ using Segmenter.

204-segmenter-semantic-segmentation

GithubColab
203-meter-reader.png

PaddlePaddle pre-trained models to read industrial meter’s value.

203-meter-reader

GithubBinder
202-vision-superresolution-video.gif

Turn 360p into 1080p video using a super resolution model.

202-vision-superresolution-video

GithubBinderColab
202-vision-superresolution-image.png

Upscale raw images with a super resolution model.

202-vision-superresolution-image

GithubBinderColab
201-vision-monodepth.gif

Monocular depth estimation with images and video.

201-vision-monodepth

GithubBinderColab