Model Demos

Demos that demonstrate inference on a particular model.

moment.gif

Frame interpolation using FILM and OpenVINO.

269-film-slowmo

Github
notebook_eye.png

Table Question Answering using TAPAS and OpenVINO.

268-table-question-answering

GithubColab
notebook_eye.png

Automatic speech recognition using Distil-Whisper and OpenVINO.

267-distil-whisper-asr

Github
notebook_eye.png

Text Generation via Speculative Sampling, KV Caching, and OpenVINO.

266-speculative-sampling

Github
277724498-6917c558-d74c-4cc9-b81a-679ce0a299ee.png

Image generation with Würstchen and OpenVINO.

265-wuerstchen-image-generation

Github
278011447-1a5978c6-e7a0-4824-9318-a3d8f4912c47.png

Generate creative QR codes with ControlNet QR Code Monster and OpenVINO.

264-qrcode-monster

Github
277367065-13a8f622-8ea7-4d12-b3f8-241d4499305e.png

Image generation with Latent Consistency Model and OpenVINO.

263-latent-consistency-models-image-generation

Github
notebook_eye.png

SoftVC VITS Singing Voice Conversion and OpenVINO.

262-softvc-voice-conversion

Github
248551984-d98f0f6d-7535-45d0-b380-2e1440b52ad7.jpg

Object segmentation with FastSAM and OpenVINO.

261-fast-segment-anything

GithubBinderColab
274927904-cd734349-9954-4656-ab96-08a903e846ef.png

Image generation with DeciDiffusion and OpenVINO.

259-decidiffusion-image-generation

Github
275485611-0ecf621f-b544-44ae-8258-8a49be704989.png

Subject-driven image generation and editing using BLIP Diffusion and OpenVINO.

258-blip-diffusion-subject-generation

Github
llava_logo.png

Visual-language assistant with LLaVA and OpenVINO.

257-llava-multimodal-chatbot

Github
269278630-9a770279-0045-480e-95f2-1a2f2d0a5115.png

Text-to-speech generation using Bark and OpenVINO.

256-bark-text-to-audio

Github
notebook_eye.png

Create an LLM-powered Chatbot using OpenVINO.

254-llm-chatbot

Github
261102399-500956d5-4aac-4710-a77c-4df34bcda3be.gif

Text-to video synthesis with ZeroScope and OpenVINO™.

253-zeroscope-text2video

Github
notebook_eye.png

Image generation with FastComposer and OpenVINO™.

252-fastcomposer-image-generation

Github
260904650-274fc2f9-24d2-46a3-ac3d-d660ec3c9a19.png

Image Generation with Tiny-SD and OpenVINO™.

251-tiny-sd-image-generation

GithubColab
260439306-81c81c8d-1f9c-41d0-b881-9491766def8e.png

Controllable Music Generation with MusicGen and OpenVINO™.

250-music-generation

GithubBinderColab
68747470733a2f2f6769746875622d70726f64756374696f6e2d757365722d61737365742d3632313064662e73332e616d617a6f6e6177732e636f6d2f37363136313235362f3235383634303731332d66383031626430392d653932372d346162642d616132662d3939393064653463616638642e676966

Universal segmentation with OneFormer and OpenVINO™.

249-oneformer-segmentation

Github
258651862-28b63016-c5ff-4263-9da8-73ca31100165.jpeg

Image generation with Stable Diffusion XL and OpenVINO™.

248-stable-diffusion-xl

Github
notebook_eye.png

Identify the programming language used in an arbitrary code snippet.

247-code-language-id

GithubBinder
void_samples.png

Monocular Visual-Inertial Depth Estimation with OpenVINO™.

246-depth-estimation-videpth

Github
224564463-ee686386-f846-4b2b-91af-7163586014b7.png

English Typo Detection in sentences with OpenVINO™.

245-typo-detector

Github
notebook_eye.png

Named entity recognition with OpenVINO™.

244-named-entity-recognition

GithubColab
251085926-14045ebc-273b-4ccb-b04f-82a3f7811b87.gif

Selfie Segmentation using TFLite and OpenVINO™.

243-tflite-selfie-segmentation

GithubBinderColab
163544861-fa2ad64b-77df-4c16-b065-79183e8ed964.png

High-Quality Text-Free One-Shot Voice Conversion with FreeVC and OpenVINO™

242-freevc-voice-conversion

Github
244291912-bbc6e08c-c0a9-41fe-bc2d-5f89a0d2463b.png

Text-to-Music generation using Riffusion and OpenVINO™.

241-riffusion-text-to-music

Github
237160118-e881f4a4-fcc8-427a-afe1-7dd80aebd66e.png

Instruction following using Databricks Dolly 2.0 and OpenVINO™.

240-dolly-2-instruction-following

Github
240364108-39868933-d221-41e6-9b2e-dac1b14ef32f.png

Binding multimodal data, using ImageBind and OpenVINO™.

239-image-bind-convert

Github
241643886-dfcf3c48-8d50-4730-ae28-a21595d9504f.png

Text-to-image generation with DeepFloyd IF and OpenVINO™.

238-deep-floyd-if-optimize

Github
231468849-1cd11e68-21e2-44ed-8088-b792ef50c32d.png

Prompt based object segmentation mask generation, using Segment Anything and OpenVINO™.

237-segment-anything

Github
228882108-25c1f65d-4c23-4e1d-8ba4-f6164280a3e3.gif

Text-to-image generation with Stable Diffusion v2 and OpenVINO™.

236-stable-diffusion-v2-text-to-image

Github
229231281-065641fd-53ea-4940-8c52-b1eebfbaa7fa.png

Stable Diffusion Text-to-Image Demo.

236-stable-diffusion-v2-text-to-image-demo

Github
229231281-065641fd-53ea-4940-8c52-b1eebfbaa7fa.png

Stable Diffusion v2.1 using Optimum-Intel OpenVINO.

236-stable-diffusion-v2-optimum-demo

Github
229231281-065641fd-53ea-4940-8c52-b1eebfbaa7fa.png

Stable Diffusion v2.1 using Optimum-Intel OpenVINO and multiple Intel Hardware

236-stable-diffusion-v2-optimum-demo-comparison

Github
228882108-25c1f65d-4c23-4e1d-8ba4-f6164280a3e3.gif

Text-to-image generation and Infinite Zoom with Stable Diffusion v2 and OpenVINO™.

236-stable-diffusion-v2-infinite-zoom

Github
224541412-9d13443e-0e42-43f2-8210-aa31820c5b44.png

A text-to-image generation with ControlNet Conditioning and OpenVINO™.

235-controlnet-stable-diffusion

Github
thumbnail.png

Audio compression with EnCodec and OpenVINO™.

234-encodec-audio-compression

Github
221933762-4ff32ecb-5e5d-4484-80e1-e9396cb3c511.png

Visual Question Answering and Image Captioning using BLIP and OpenVINO.

233-blip-convert

Github
221933762-4ff32ecb-5e5d-4484-80e1-e9396cb3c511.png

Post-Training Quantization and Weights Compression of OpenAI BLIP model with NNCF.

233-blip-optimize

Github
218967961-9858efd5-fff2-4eb0-bde9-60852f4b31cb.JPG

Language-visual saliency with CLIP and OpenVINO™.

232-clip-language-saliency-map

GithubColab
219943222-d46a2e2d-d348-4259-8431-37cf14727eda.png

Image editing with InstructPix2Pix.

231-instruct-pix2pix-image-editing

Github
212105105-f61c8aab-c1ff-40af-a33f-d0ed1fccc72e.png

Optimize YOLOv8, using NNCF PTQ API.

230-yolov8-optimization

206130638-d9847414-357a-4c79-9ca7-76f4ae5a6d7f.png

Sequence classification with OpenVINO.

229-distilbert-sequence-classification

GithubBinderColab
207795060-437b42f9-e801-4332-a91f-cc26471e5ba2.png

Post-Training Quantization of OpenAI CLIP model with NNCF.

228-clip-zero-shot-quantize

Github
68747470733a2f2f757365722d696d616765732e67697468756275736572636f6e74656e742e636f6d2f32393435343439392f3230373737333438312d64373763616366382d366364632d343736352d613331622d6131363639343736643632302e706e67

Zero-shot Image Classification with OpenAI CLIP and OpenVINO™.

228-clip-zero-shot-convert

Github
204548693-1304ef33-c790-490d-8a8b-d5766acb6254.png

Generate subtitles for video with OpenAI Whisper and OpenVINO.

227-whisper-subtitles-generation

horses_prediction.jpg

Optimize YOLOv7, using NNCF PTQ API.

226-yolov7-optimization

Github
200945747-1c584e5c-b3f2-4e43-b1c1-e35fd6edc2c3.png

Text-to-image generation with Stable Diffusion method.

225-stable-diffusion-text-to-image

Github
185752178-3882902c-907b-4614-b0e6-ea1de08bf3ef.png

Process point cloud data and run 3D Part Segmentation with OpenVINO.

224-3D-segmentation-point-clouds

GithubColab
185105225-0f996b0b-0a3b-4486-872d-364ac6fab68b.png

Use pre-trained models to perform text prediction on an input sequence.

223-text-prediction

GithubColab
166343139-c6568e50-b856-4066-baef-5cdbd4e8bc18.png

Use pre-trained models to colorize black & white images using OpenVINO.

222-vision-image-colorization

GithubBinder
notebook_eye.png

Real-time translation from English to German.

221-machine-translation

GithubBinderColab
254583163-3bb85143-627b-4f02-b628-7bef37823520.png

Cross-lingual Books Alignment With Transformers and OpenVINO™

220-cross-lingual-books-alignment

GithubBinderColab
notebook_eye.png

Optimize the knowledge graph embeddings model (ConvE) with OpenVINO.

219-knowledge-graphs-conve

GithubBinderColab
163544861-fa2ad64b-77df-4c16-b065-79183e8ed964.png

Use pre-trained models to detect and recognize vehicles and their attributes with OpenVINO.

218-vehicle-detection-and-recognition

GithubBinder
158430181-05d07f42-cdb8-4b7a-b7dc-e7f7d9391877.png

Deblur images with DeblurGAN-v2.

217-vision-deblur

GithubBinder
notebook_eye.png

The attention center model with OpenVINO™

216-attention-center

GithubColab
167121084-ec58fbdb-b269-4de2-9d4c-253c5b95de1e.png

Fill missing pixels with image in-painting.

215-image-inpainting

GithubBinder
notebook_eye.png

Grammatical error correction with OpenVINO.

214-grammar-correction

Github
152571639-ace628b2-e3d2-433e-8c28-9a5546d76a86.gif

Answer your questions basing on a context.

213-question-answering

GithubBinderColab
218432101-0bd0c424-e1d8-46af-ba1d-ee29ed6d1229.png

Run inference on speaker diarization pipeline.

212-pyannote-speaker-diarization

Github
140987347-279de058-55d7-4772-b013-0f2b12deaa61.png

Run inference on speech-to-text recognition model.

211-speech-to-text

GithubBinderColab
ava_demo.gif

Video Recognition using SlowFast and OpenVINO™

210-slowfast-video-recognition

GithubBinder
132660640-da2211ec-c389-450e-8980-32a75ed14abb.png

OCR for handwritten simplified Chinese and Japanese.

209-handwritten-ocrn

129315292-a37266dc-dfb2-4749-bca5-2ac9c1e93d64.jpg

Annotate text on images using text recognition resnet.

208-optical-character-recognition

GithubColab
127170593-86976dc3-e5e4-40be-b0a6-206379cd7df5.jpg

Upscale small images with superresolution using a PaddleGAN model.

207-vision-paddlegan-superresolution

GithubColab
127788059-1f069ae1-8705-4972-b50e-6314a6f36632.jpeg

Turn an image into anime using a GAN.

206-vision-paddlegan-anime

GithubColab
223854308-d1ac4a39-cc0c-4618-9e4f-d9d4d8b991e8.jpg

Semantic segmentation with OpenVINO™ using Segmenter.

204-segmenter-semantic-segmentation

GithubColab
166135627-194405b0-6c25-4fd8-9ad1-83fb3a00a081.jpg

PaddlePaddle pre-trained models to read industrial meter’s value.

203-meter-reader

GithubBinder
127269258-a8e2c03e-731e-4317-b5b2-ed2ee767ff5e.gif

Turn 360p into 1080p video using a super resolution model.

202-vision-superresolution-video

GithubBinderColab
170005347-e4409f9e-ec34-416b-afdf-a9d8185929ca.jpg

Upscale raw images with a super resolution model.

202-vision-superresolution-image

GithubBinderColab
127752390-f6aa371f-31b5-4846-84b9-18dd4f662406.gif

Monocular depth estimation with images and video.

201-vision-monodepth

GithubBinderColab