尊敬的 微信汇率:1円 ≈ 0.046166 元 支付宝汇率:1円 ≈ 0.046257元 [退出登录]
SlideShare a Scribd company logo
Video 
Indexing and Retrieval 
Rachmat Wahid Saleh Insani, S.Kom 
Multimedia Database Management System - Chapter 7
Video 
A complete video consist of: 
• Subtitle. 
• Sound track. 
• Images recorded or played out continuously at a 
fixed rate. 
Multimedia Database Management System - Chapter 7
Video IR Method 
• Metadata-based method 
Structured metadata using traditional DBMS. 
• Text-based method 
Associated subtitles using IR technique. 
• Audio-based method 
Associated soundtracks using Audio IR. 
• Content-based method 
Video as collection of independent images. Video sequences 
divides into groups of similar frames. 
• Integrated approach 
Combination of two or more above methods. 
Multimedia Database Management System - Chapter 7
Shot-based Video IR 
Video is made of video shot. A shot is a sequence of 
contiguous frames have one or more following features: 
• The frames depict the same scene. 
• The frames signify a single camera operation. 
• The frames contain a distinct event. 
• The frames are chosen as a single indexable entity 
by the user. 
Multimedia Database Management System - Chapter 7
Shot-based Video IR 
Shot-based video IR consist of few steps: 
• Segment the video into shots 
This is called video temporal segmentation 
• Index each shot 
The steps: identify keyframes for each shot, and use image IR 
• Apply a similarity measurement between queries 
and video shots, then retrieve shots with high 
similarities 
Use image IR based on feature vectors in 2nd step 
Multimedia Database Management System - Chapter 7
Video Shot Detection or 
Segmentation 
Segmentation is a process for dividing a video 
sequence into shots. 
Require suitable quantitative measure that captures the 
differences between a pair of frames. 
If the differences exceeds a threshold, it may be 
interpreted as indicating a segment boundary. 
Establishing suitable differences metrics and technique 
for applying them are the key issues in automatic 
partitioning. 
Multimedia Database Management System - Chapter 7
Basic Video Segment 
Techniques 
The key issue is how to measure the frame-to-frame 
differences. There are two basic techniques: 
• Sum pixel-to-pixel differences between neighbouring 
frames 
If the sum is larger than a press threshold, a shot boundary exist 
between these two frames 
• Measures colour histogram distance between 
neighbouring frames 
Object motion causes histogram differences. If large differences is 
found, a camera break occurred 
Multimedia Database Management System - Chapter 7
Video Indexing and 
Retrieval 
Shots are needs to be represented and indexed, in 
order to locate and retrieved the shots quickly. 
We represents each shot by one or more keyframes or r 
frames (representative frames). 
Video retrieval is based on similarity between the query 
and r frames. 
Multimedia Database Management System - Chapter 7
IR Based on r Frames of 
Video Shots 
How to represent and index each shot in IR based on r 
frames? 
1. Use r frames (representative frames) to represent each 
shot. 
2. During retrieval, features on the r frames are 
compared with queries. 
3. If the frames is similar/relevant, it is presented to the 
user. 
How many representative frame(s) should be used in a 
shot, and how we select the representative frames? 
Multimedia Database Management System - Chapter 7
How Many R Frames 
Should be Used in a Shot? 
There are few methods: 
• Uses one r frames per shot. 
• Assigns the number of r frames to shots according to 
their length. 
• Divides a shot into subshots, and assign one r frame 
into each subshot. Subshot are detected based on 
changes on the content. 
Multimedia Database Management System - Chapter 7
How to Select the R 
Frames? 
There are few methods: 
• Use first frame for each segment/shot as the r frame. 
• Use a frame which is similar with average frame. 
Each pixel in this frame is the average of pixel values at the same 
grid point in all frames of the segment. 
• Use a frame whose its histogram is closest to the 
average histogram. 
The histograms of all the frames in the segment are averaged. 
Multimedia Database Management System - Chapter 7
IR Based on Motion 
Information 
• Motion information is derived from optical flow or 
motion vectors. Parameters used are: 
• Motion content 
Total amount of motion within a given video 
• Motion uniformity 
Smoothness of the motion within a video as a function of time 
• Motion panning 
Left-to-right or Right-to-left motion of the camera 
• Motion tilting 
Vertical motion component of the motion within a video 
sequence 
Multimedia Database Management System - Chapter 7
IR Based on Objects 
Object is a group of pixels that move together. 
To index the video, use the segmented objects. 
Object motion can help to construct a description of 
that motion for use in subsequent retrieval of the video 
shot. 
Object-based video IR is easy when the video is 
compressed using the MPEG-4 object-based coding 
standard. 
Multimedia Database Management System - Chapter 7
MPEG-4 Object based 
Coding Standard 
An MPEG-4 video session (VS) is a collection of one or 
more VOs. 
A VO consists of one or more video object layers (VOLs). 
Each VOL consists of an ordered sequence of snapshots 
in time called the video object planes (VOP) 
A VOP is a semantic object in the scene containing 
shape and motion information. 
Accompanying each VOP is the composition that 
indicates where and when each VOP is to be displayed 
Multimedia Database Management System - Chapter 7
MPEG-4 Object based 
Coding Standard 
To index MPEG-4 compressed video, we use the 
following parameters: 
• The birth and death frames of individual objects. 
• Global motion characteristics/camera operations 
observed in the scene. 
• Representative key frames that capture the major 
transformations each object undergoes. 
• The dominant motion characteristics of each object 
throughout its lifetime. 
Multimedia Database Management System - Chapter 7
IR Based on Metadata 
Video IR based on metadata is using conventional 
DBMS. 
Metadata such as: title, video type, directors, genre, 
etc. 
Multimedia Database Management System - Chapter 7
IR Based on Annotation 
Video IR based on annotation is using IR technique. 
Annotation is obtained in ways: 
• Video is manually interpreted and annotated. 
A time consuming task, automatic high-level video content 
understanding is currently not possible for general video. 
• Videos have associated transcripts and subtitles. 
• Speech recognition within video. 
Extract spoken words for indexing and retrieval. 
Multimedia Database Management System - Chapter 7
Integrated Video Indexing 
and Retrieval 
Combining all techniques discussed before. 
Multimedia Database Management System - Chapter 7
Effective Video Representation 
and Abstraction 
Video representation and abstraction tools have three 
main applications: 
• First, the application is video browsing 
• Second, the application is for presentation of video 
retrieval results. 
• Third, the application is to reduce network 
bandwidth requirements and delay. 
How do we organize and compactly represents the 
video? 
Multimedia Database Management System - Chapter 7
Topical or Subject 
Classification 
Organize based on subject classification. 
Multimedia Database Management System - Chapter 7
Storyboard 
A storyboard is a collection of representative frames that 
faithfully represent the main events and action in a video. 
The main difference is that a clipmap shows three-dimensional 
micons but a storyboard shows only 
representative frames. 
A storyboard is 2 orders of magnitude smaller (in storage 
requirements) than the corresponding compressed entire 
video. 
This greatly reduces the bandwidth and delay required to 
deliver the information over a network for quick preview or 
browsing. 
Multimedia Database Management System - Chapter 7
Scene Transition Graph 
The scene transition graph (STG) is a directed graph 
structure that compactly captures both the content and 
temporal flow of video. 
An STG consists of a number of nodes connected by 
directed edges. 
Each node is represented by a typical image and 
represents one or more video shots. 
Directed edges indicate the content and temporal flow 
of the video. 
Multimedia Database Management System - Chapter 7

More Related Content

What's hot

Efficient video indexing for fast motion video
Efficient video indexing for fast motion videoEfficient video indexing for fast motion video
Efficient video indexing for fast motion video
ijcga
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...
webhostingguy
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)
Raja Sekar
 
Content-based Image Retrieval
Content-based Image RetrievalContent-based Image Retrieval
Content-based Image Retrieval
University of Zurich
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
rubaiyat11
 
Content-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A SurveyContent-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A Survey
Eswar Publications
 
CBIR For Medical Imaging...
CBIR For Medical  Imaging...CBIR For Medical  Imaging...
CBIR For Medical Imaging...
Isha Sharma
 
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image RetrievalIjaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
INFOGAIN PUBLICATION
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
Jonathon Hare
 
Image Indexing and Retrieval
Image Indexing and RetrievalImage Indexing and Retrieval
Image Indexing and Retrieval
Rachmat Wahid Saleh Insani
 
Image and video annotation
Image and video annotationImage and video annotation
Image and video annotation
NityJaiswal1
 
CBIR MIni project2
CBIR MIni project2CBIR MIni project2
CBIR MIni project2
YASWANTH KALEPU
 
Content based image retrieval (cbir) using
Content based image retrieval (cbir) usingContent based image retrieval (cbir) using
Content based image retrieval (cbir) using
ijcsity
 
CONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEMCONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEM
Vamsi IV
 
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
Jonathon Hare
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
Prem kumar
 
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
CSCJournals
 
CBIR by deep learning
CBIR by deep learningCBIR by deep learning
CBIR by deep learning
Vigen Sahakyan
 
Color and texture based image retrieval
Color and texture based image retrievalColor and texture based image retrieval
Color and texture based image retrieval
eSAT Journals
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
paddu123
 

What's hot (20)

Efficient video indexing for fast motion video
Efficient video indexing for fast motion videoEfficient video indexing for fast motion video
Efficient video indexing for fast motion video
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)
 
Content-based Image Retrieval
Content-based Image RetrievalContent-based Image Retrieval
Content-based Image Retrieval
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
 
Content-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A SurveyContent-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A Survey
 
CBIR For Medical Imaging...
CBIR For Medical  Imaging...CBIR For Medical  Imaging...
CBIR For Medical Imaging...
 
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image RetrievalIjaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
 
Image Indexing and Retrieval
Image Indexing and RetrievalImage Indexing and Retrieval
Image Indexing and Retrieval
 
Image and video annotation
Image and video annotationImage and video annotation
Image and video annotation
 
CBIR MIni project2
CBIR MIni project2CBIR MIni project2
CBIR MIni project2
 
Content based image retrieval (cbir) using
Content based image retrieval (cbir) usingContent based image retrieval (cbir) using
Content based image retrieval (cbir) using
 
CONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEMCONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEM
 
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
 
CBIR by deep learning
CBIR by deep learningCBIR by deep learning
CBIR by deep learning
 
Color and texture based image retrieval
Color and texture based image retrievalColor and texture based image retrieval
Color and texture based image retrieval
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 

Similar to Video Indexing and Retrieval

VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
Journal For Research
 
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET-Feature Extraction from Video Data for Indexing and Retrieval IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET Journal
 
Key frame extraction methodology for video annotation
Key frame extraction methodology for video annotationKey frame extraction methodology for video annotation
Key frame extraction methodology for video annotation
IAEME Publication
 
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
ijtsrd
 
Video Content Identification using Video Signature: Survey
Video Content Identification using Video Signature: SurveyVideo Content Identification using Video Signature: Survey
Video Content Identification using Video Signature: Survey
IRJET Journal
 
Dynamic Threshold in Clip Analysis and Retrieval
Dynamic Threshold in Clip Analysis and RetrievalDynamic Threshold in Clip Analysis and Retrieval
Dynamic Threshold in Clip Analysis and Retrieval
CSCJournals
 
Flip invariant video copy detection using sparse-coded features
Flip invariant video copy detection using sparse-coded featuresFlip invariant video copy detection using sparse-coded features
Flip invariant video copy detection using sparse-coded features
eSAT Publishing House
 
Key frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptorsKey frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptors
eSAT Journals
 
Key frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptorsKey frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptors
eSAT Publishing House
 
Optimal Repeated Frame Compensation Using Efficient Video Coding
Optimal Repeated Frame Compensation Using Efficient Video  CodingOptimal Repeated Frame Compensation Using Efficient Video  Coding
Optimal Repeated Frame Compensation Using Efficient Video Coding
IOSR Journals
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage Summarization
IRJET Journal
 
C1 mala1 akila
C1 mala1 akilaC1 mala1 akila
C1 mala1 akila
Jasline Presilda
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)
PACE 2.0
 
Real-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big DataReal-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big Data
IRJET Journal
 
L0956974
L0956974L0956974
L0956974
IOSR Journals
 
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
Best Jobs
 
A Novel Approach for Compressing Surveillance System Videos
A Novel Approach for Compressing Surveillance System VideosA Novel Approach for Compressing Surveillance System Videos
A Novel Approach for Compressing Surveillance System Videos
INFOGAIN PUBLICATION
 
Video Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
Video Key-Frame Extraction using Unsupervised Clustering and Mutual ComparisonVideo Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
Video Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
CSCJournals
 
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATIONVISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
cscpconf
 
Video and animation
Video and animationVideo and animation
Video and animation
Gem WeBlog
 

Similar to Video Indexing and Retrieval (20)

VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
 
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET-Feature Extraction from Video Data for Indexing and Retrieval IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
 
Key frame extraction methodology for video annotation
Key frame extraction methodology for video annotationKey frame extraction methodology for video annotation
Key frame extraction methodology for video annotation
 
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
 
Video Content Identification using Video Signature: Survey
Video Content Identification using Video Signature: SurveyVideo Content Identification using Video Signature: Survey
Video Content Identification using Video Signature: Survey
 
Dynamic Threshold in Clip Analysis and Retrieval
Dynamic Threshold in Clip Analysis and RetrievalDynamic Threshold in Clip Analysis and Retrieval
Dynamic Threshold in Clip Analysis and Retrieval
 
Flip invariant video copy detection using sparse-coded features
Flip invariant video copy detection using sparse-coded featuresFlip invariant video copy detection using sparse-coded features
Flip invariant video copy detection using sparse-coded features
 
Key frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptorsKey frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptors
 
Key frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptorsKey frame extraction for video summarization using motion activity descriptors
Key frame extraction for video summarization using motion activity descriptors
 
Optimal Repeated Frame Compensation Using Efficient Video Coding
Optimal Repeated Frame Compensation Using Efficient Video  CodingOptimal Repeated Frame Compensation Using Efficient Video  Coding
Optimal Repeated Frame Compensation Using Efficient Video Coding
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage Summarization
 
C1 mala1 akila
C1 mala1 akilaC1 mala1 akila
C1 mala1 akila
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)
 
Real-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big DataReal-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big Data
 
L0956974
L0956974L0956974
L0956974
 
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...
 
A Novel Approach for Compressing Surveillance System Videos
A Novel Approach for Compressing Surveillance System VideosA Novel Approach for Compressing Surveillance System Videos
A Novel Approach for Compressing Surveillance System Videos
 
Video Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
Video Key-Frame Extraction using Unsupervised Clustering and Mutual ComparisonVideo Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
Video Key-Frame Extraction using Unsupervised Clustering and Mutual Comparison
 
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATIONVISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
 
Video and animation
Video and animationVideo and animation
Video and animation
 

More from Rachmat Wahid Saleh Insani

01 Mengenal Struktur Data
01 Mengenal Struktur Data01 Mengenal Struktur Data
01 Mengenal Struktur Data
Rachmat Wahid Saleh Insani
 
#2 LIST | PEMROGRAMAN PYTHON
#2 LIST | PEMROGRAMAN PYTHON#2 LIST | PEMROGRAMAN PYTHON
#2 LIST | PEMROGRAMAN PYTHON
Rachmat Wahid Saleh Insani
 
#1 PENGENALAN PYTHON
#1 PENGENALAN PYTHON#1 PENGENALAN PYTHON
#1 PENGENALAN PYTHON
Rachmat Wahid Saleh Insani
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
Rachmat Wahid Saleh Insani
 
Indexing and Retrieval of Audio
Indexing and Retrieval of AudioIndexing and Retrieval of Audio
Indexing and Retrieval of Audio
Rachmat Wahid Saleh Insani
 
Teori Probabilitas
Teori ProbabilitasTeori Probabilitas
Teori Probabilitas
Rachmat Wahid Saleh Insani
 
Certainty Factor Theory
Certainty Factor TheoryCertainty Factor Theory
Certainty Factor Theory
Rachmat Wahid Saleh Insani
 
DNS (Domain Name System)
DNS (Domain Name System)DNS (Domain Name System)
DNS (Domain Name System)
Rachmat Wahid Saleh Insani
 

More from Rachmat Wahid Saleh Insani (8)

01 Mengenal Struktur Data
01 Mengenal Struktur Data01 Mengenal Struktur Data
01 Mengenal Struktur Data
 
#2 LIST | PEMROGRAMAN PYTHON
#2 LIST | PEMROGRAMAN PYTHON#2 LIST | PEMROGRAMAN PYTHON
#2 LIST | PEMROGRAMAN PYTHON
 
#1 PENGENALAN PYTHON
#1 PENGENALAN PYTHON#1 PENGENALAN PYTHON
#1 PENGENALAN PYTHON
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
 
Indexing and Retrieval of Audio
Indexing and Retrieval of AudioIndexing and Retrieval of Audio
Indexing and Retrieval of Audio
 
Teori Probabilitas
Teori ProbabilitasTeori Probabilitas
Teori Probabilitas
 
Certainty Factor Theory
Certainty Factor TheoryCertainty Factor Theory
Certainty Factor Theory
 
DNS (Domain Name System)
DNS (Domain Name System)DNS (Domain Name System)
DNS (Domain Name System)
 

Recently uploaded

ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB
 
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My IdentityCNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
Cynthia Thomas
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2
DianaGray10
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
Databarracks
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
Day 4 - Excel Automation and Data Manipulation
Day 4 - Excel Automation and Data ManipulationDay 4 - Excel Automation and Data Manipulation
Day 4 - Excel Automation and Data Manipulation
UiPathCommunity
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
Facilitation Skills - When to Use and Why.pptx
Facilitation Skills - When to Use and Why.pptxFacilitation Skills - When to Use and Why.pptx
Facilitation Skills - When to Use and Why.pptx
Knoldus Inc.
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
An All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS MarketAn All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS Market
ScyllaDB
 
ThousandEyes New Product Features and Release Highlights: June 2024
ThousandEyes New Product Features and Release Highlights: June 2024ThousandEyes New Product Features and Release Highlights: June 2024
ThousandEyes New Product Features and Release Highlights: June 2024
ThousandEyes
 
Introduction to ThousandEyes AMER Webinar
Introduction  to ThousandEyes AMER WebinarIntroduction  to ThousandEyes AMER Webinar
Introduction to ThousandEyes AMER Webinar
ThousandEyes
 
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc
 
Guidelines for Effective Data Visualization
Guidelines for Effective Data VisualizationGuidelines for Effective Data Visualization
Guidelines for Effective Data Visualization
UmmeSalmaM1
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
dipikamodels1
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 

Recently uploaded (20)

ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
 
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My IdentityCNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
CNSCon 2024 Lightning Talk: Don’t Make Me Impersonate My Identity
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
Day 4 - Excel Automation and Data Manipulation
Day 4 - Excel Automation and Data ManipulationDay 4 - Excel Automation and Data Manipulation
Day 4 - Excel Automation and Data Manipulation
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
Facilitation Skills - When to Use and Why.pptx
Facilitation Skills - When to Use and Why.pptxFacilitation Skills - When to Use and Why.pptx
Facilitation Skills - When to Use and Why.pptx
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
An All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS MarketAn All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS Market
 
ThousandEyes New Product Features and Release Highlights: June 2024
ThousandEyes New Product Features and Release Highlights: June 2024ThousandEyes New Product Features and Release Highlights: June 2024
ThousandEyes New Product Features and Release Highlights: June 2024
 
Introduction to ThousandEyes AMER Webinar
Introduction  to ThousandEyes AMER WebinarIntroduction  to ThousandEyes AMER Webinar
Introduction to ThousandEyes AMER Webinar
 
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
 
Guidelines for Effective Data Visualization
Guidelines for Effective Data VisualizationGuidelines for Effective Data Visualization
Guidelines for Effective Data Visualization
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 

Video Indexing and Retrieval

  • 1. Video Indexing and Retrieval Rachmat Wahid Saleh Insani, S.Kom Multimedia Database Management System - Chapter 7
  • 2. Video A complete video consist of: • Subtitle. • Sound track. • Images recorded or played out continuously at a fixed rate. Multimedia Database Management System - Chapter 7
  • 3. Video IR Method • Metadata-based method Structured metadata using traditional DBMS. • Text-based method Associated subtitles using IR technique. • Audio-based method Associated soundtracks using Audio IR. • Content-based method Video as collection of independent images. Video sequences divides into groups of similar frames. • Integrated approach Combination of two or more above methods. Multimedia Database Management System - Chapter 7
  • 4. Shot-based Video IR Video is made of video shot. A shot is a sequence of contiguous frames have one or more following features: • The frames depict the same scene. • The frames signify a single camera operation. • The frames contain a distinct event. • The frames are chosen as a single indexable entity by the user. Multimedia Database Management System - Chapter 7
  • 5. Shot-based Video IR Shot-based video IR consist of few steps: • Segment the video into shots This is called video temporal segmentation • Index each shot The steps: identify keyframes for each shot, and use image IR • Apply a similarity measurement between queries and video shots, then retrieve shots with high similarities Use image IR based on feature vectors in 2nd step Multimedia Database Management System - Chapter 7
  • 6. Video Shot Detection or Segmentation Segmentation is a process for dividing a video sequence into shots. Require suitable quantitative measure that captures the differences between a pair of frames. If the differences exceeds a threshold, it may be interpreted as indicating a segment boundary. Establishing suitable differences metrics and technique for applying them are the key issues in automatic partitioning. Multimedia Database Management System - Chapter 7
  • 7. Basic Video Segment Techniques The key issue is how to measure the frame-to-frame differences. There are two basic techniques: • Sum pixel-to-pixel differences between neighbouring frames If the sum is larger than a press threshold, a shot boundary exist between these two frames • Measures colour histogram distance between neighbouring frames Object motion causes histogram differences. If large differences is found, a camera break occurred Multimedia Database Management System - Chapter 7
  • 8. Video Indexing and Retrieval Shots are needs to be represented and indexed, in order to locate and retrieved the shots quickly. We represents each shot by one or more keyframes or r frames (representative frames). Video retrieval is based on similarity between the query and r frames. Multimedia Database Management System - Chapter 7
  • 9. IR Based on r Frames of Video Shots How to represent and index each shot in IR based on r frames? 1. Use r frames (representative frames) to represent each shot. 2. During retrieval, features on the r frames are compared with queries. 3. If the frames is similar/relevant, it is presented to the user. How many representative frame(s) should be used in a shot, and how we select the representative frames? Multimedia Database Management System - Chapter 7
  • 10. How Many R Frames Should be Used in a Shot? There are few methods: • Uses one r frames per shot. • Assigns the number of r frames to shots according to their length. • Divides a shot into subshots, and assign one r frame into each subshot. Subshot are detected based on changes on the content. Multimedia Database Management System - Chapter 7
  • 11. How to Select the R Frames? There are few methods: • Use first frame for each segment/shot as the r frame. • Use a frame which is similar with average frame. Each pixel in this frame is the average of pixel values at the same grid point in all frames of the segment. • Use a frame whose its histogram is closest to the average histogram. The histograms of all the frames in the segment are averaged. Multimedia Database Management System - Chapter 7
  • 12. IR Based on Motion Information • Motion information is derived from optical flow or motion vectors. Parameters used are: • Motion content Total amount of motion within a given video • Motion uniformity Smoothness of the motion within a video as a function of time • Motion panning Left-to-right or Right-to-left motion of the camera • Motion tilting Vertical motion component of the motion within a video sequence Multimedia Database Management System - Chapter 7
  • 13. IR Based on Objects Object is a group of pixels that move together. To index the video, use the segmented objects. Object motion can help to construct a description of that motion for use in subsequent retrieval of the video shot. Object-based video IR is easy when the video is compressed using the MPEG-4 object-based coding standard. Multimedia Database Management System - Chapter 7
  • 14. MPEG-4 Object based Coding Standard An MPEG-4 video session (VS) is a collection of one or more VOs. A VO consists of one or more video object layers (VOLs). Each VOL consists of an ordered sequence of snapshots in time called the video object planes (VOP) A VOP is a semantic object in the scene containing shape and motion information. Accompanying each VOP is the composition that indicates where and when each VOP is to be displayed Multimedia Database Management System - Chapter 7
  • 15. MPEG-4 Object based Coding Standard To index MPEG-4 compressed video, we use the following parameters: • The birth and death frames of individual objects. • Global motion characteristics/camera operations observed in the scene. • Representative key frames that capture the major transformations each object undergoes. • The dominant motion characteristics of each object throughout its lifetime. Multimedia Database Management System - Chapter 7
  • 16. IR Based on Metadata Video IR based on metadata is using conventional DBMS. Metadata such as: title, video type, directors, genre, etc. Multimedia Database Management System - Chapter 7
  • 17. IR Based on Annotation Video IR based on annotation is using IR technique. Annotation is obtained in ways: • Video is manually interpreted and annotated. A time consuming task, automatic high-level video content understanding is currently not possible for general video. • Videos have associated transcripts and subtitles. • Speech recognition within video. Extract spoken words for indexing and retrieval. Multimedia Database Management System - Chapter 7
  • 18. Integrated Video Indexing and Retrieval Combining all techniques discussed before. Multimedia Database Management System - Chapter 7
  • 19. Effective Video Representation and Abstraction Video representation and abstraction tools have three main applications: • First, the application is video browsing • Second, the application is for presentation of video retrieval results. • Third, the application is to reduce network bandwidth requirements and delay. How do we organize and compactly represents the video? Multimedia Database Management System - Chapter 7
  • 20. Topical or Subject Classification Organize based on subject classification. Multimedia Database Management System - Chapter 7
  • 21. Storyboard A storyboard is a collection of representative frames that faithfully represent the main events and action in a video. The main difference is that a clipmap shows three-dimensional micons but a storyboard shows only representative frames. A storyboard is 2 orders of magnitude smaller (in storage requirements) than the corresponding compressed entire video. This greatly reduces the bandwidth and delay required to deliver the information over a network for quick preview or browsing. Multimedia Database Management System - Chapter 7
  • 22. Scene Transition Graph The scene transition graph (STG) is a directed graph structure that compactly captures both the content and temporal flow of video. An STG consists of a number of nodes connected by directed edges. Each node is represented by a typical image and represents one or more video shots. Directed edges indicate the content and temporal flow of the video. Multimedia Database Management System - Chapter 7
  翻译: