Información del autor
Autor Cheng, Wen-Huang |
Documentos disponibles escritos por este autor (7)



25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part I / Kompatsiaris, Ioannis ; Huet, Benoit ; Mezaris, Vasileios ; Gurrin, Cathal ; Cheng, Wen-Huang ; Vrochidis, Stefanos
![]()
TÃtulo : 25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part I Tipo de documento: documento electrónico Autores: Kompatsiaris, Ioannis, ; Huet, Benoit, ; Mezaris, Vasileios, ; Gurrin, Cathal, ; Cheng, Wen-Huang, ; Vrochidis, Stefanos, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2019 Número de páginas: XXVI, 721 p. 260 ilustraciones, 233 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-05710-7 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Sistemas de reconocimiento de patrones Sistemas de almacenamiento y recuperación de información. Software de la aplicacion Sistemas de información multimedia Reconocimiento de patrones automatizado Almacenamiento y recuperación de información Aplicaciones informáticas y de sistemas de información Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11295 y 11296 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2019, celebrada en Salónica, Grecia, en enero de 2019. De los 172 artÃculos completos presentados, 49 fueron seleccionados para presentación oral y 47 para presentación de carteles; Además, se aceptaron 6 artÃculos de demostración, 5 artÃculos de la industria, 6 artÃculos de talleres y 6 artÃculos para Video Browser Showdown 2019. Todos los artÃculos presentados fueron cuidadosamente revisados ​​y seleccionados entre 204 presentaciones. Nota de contenido: Sentiment-aware Multi-modal Recommendation on Tourist Attractions -- SCOD: Dynamical Spatial Constraints for Object Detection -- STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection -- Hierarchical Vision-Language Alignment for Video Captioning -- Task-Driven Biometric Authentication of Users in Virtual Reality (VR) Environments -- Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs -- Subjective Visual Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs -- Joint EPC and RAN Caching of Tiled VR Videos for Mobile Networks -- Foveated Ray Tracing for VR Headsets -- Preferred Model of Adaptation to Dark for Virtual Reality Headsets -- From Movement to Events: Improving Soccer Match Annotations -- Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario -- Integration of Exploration and Search: A Case Study of the M^3 Model -- Face Swapping for SolvingCollateral Privacy Issues in Multimedia Analytics -- Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions -- Fashion Police: Towards Semantic Indexing of Clothing Information In Surveillance Data -- CNN-Based Non-Contact Detection of Food Level in Bottles from RGB Images -- Personalized Recommendation of Photography Based on Deep Learning -- Two-level Attention with Multi-task Learning for Facial Emotion Estimation -- User Interaction for Visual Lifelog Retrieval in a Virtual Environment -- Query-by-Dancing: A Dance Music Retrieval System Based on Body-Motion Similarity -- Joint Visual-Textual Sentiment Analysis Based on Cross-modality Attention Mechanism -- Deep Hashing with Triplet Labels and Unification Binary Code Selection for Fast Image Retrieval -- Incremental Training for Face Recognition -- Character Prediction in TV Series via a Semantic Projection Network -- A Test Collection for Interactive Lifelog Retrieval -- SEPHLA: Challenges and Opportunities withinEnvironment – Personal Health Archives -- Athens Urban Soundscape (ATHUS): A dataset for urban soundscape quality recognition -- V3C - a Research Video Collection -- Image Aesthetics Assessment using Fully Convolutional Neural Networks -- Detecting tampered videos with multimedia forensics and deep learning -- Improving Robustness of Image Tampering Detection for Compression -- Audiovisual annotation procedure for multi-view field recordings -- A Robust Multi-Athlete Tracking Algorithm by Exploiting Discriminant Features and Long-Term Dependencies -- Early Identification of Oil Spills in Satellite Images Using Deep CNNs -- Point Cloud Colorization Based on Densely Annotated 3D Shape Dataset -- evolve2vec: Learning Network Representations Using Temporal Unfolding -- The Impact of Packet Loss and Google Congestion Control on QoE for WebRTC-based Mobile Multiparty Audiovisual Telemeetings -- Hierarchical Temporal Pooling for Efficient Online Action Recognition -- Generative Adversarial Networks withEnhanced Symmetric Residual Units for Single Image Super-Resolution -- 3D ResNets for 3D object classification -- Four Models for Automatic Recognition of Left and Right Eye in Fundus Images -- On the unsolved problem of Shot Boundary Detection for Music Videos -- Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention -- Exploiting Incidence Relation Between Subgroups for Improving Clustering-Based Recommendation Model -- Hierarchical Bayesian Network based Incremental Model for Flood Prediction -- A New Female Body Segmentation and Feature Localisation Method for Image-based Anthropometry -- Greedy Salient Dictionary Learning For Activity Video Summarization -- Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution -- Automatic Segmentation of Brain Tumor Images Based on Region Growing with Co-constraint -- Proposal of an Annotation Method for Integrating Musical Technique Knowledge using a GTTM Time-Span Tree -- A hierarchical level set approach to for RGBD image matting -- A Genetic Programming Approach to Integrate Multilayer CNN Features for Image Classification -- Improving Micro-Expression Recognition Accuracy using Twofold Feature Extraction -- An effective dual-fisheye lens stitching method based on feature points -- 3D Skeletal Gesture Recognition via Sparse Coding of Time-Warping Invariant Riemannian Trajectories -- Efficient Graph based Multi-View Leaning -- DANTE speaker recognition module. An efficient and robust automatic speaker searching solution for terrorism-related scenarios. Tipo de medio : Computadora Summary : The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part I [documento electrónico] / Kompatsiaris, Ioannis, ; Huet, Benoit, ; Mezaris, Vasileios, ; Gurrin, Cathal, ; Cheng, Wen-Huang, ; Vrochidis, Stefanos, . - 1 ed. . - [s.l.] : Springer, 2019 . - XXVI, 721 p. 260 ilustraciones, 233 ilustraciones en color.
ISBN : 978-3-030-05710-7
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Sistemas de reconocimiento de patrones Sistemas de almacenamiento y recuperación de información. Software de la aplicacion Sistemas de información multimedia Reconocimiento de patrones automatizado Almacenamiento y recuperación de información Aplicaciones informáticas y de sistemas de información Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11295 y 11296 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2019, celebrada en Salónica, Grecia, en enero de 2019. De los 172 artÃculos completos presentados, 49 fueron seleccionados para presentación oral y 47 para presentación de carteles; Además, se aceptaron 6 artÃculos de demostración, 5 artÃculos de la industria, 6 artÃculos de talleres y 6 artÃculos para Video Browser Showdown 2019. Todos los artÃculos presentados fueron cuidadosamente revisados ​​y seleccionados entre 204 presentaciones. Nota de contenido: Sentiment-aware Multi-modal Recommendation on Tourist Attractions -- SCOD: Dynamical Spatial Constraints for Object Detection -- STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection -- Hierarchical Vision-Language Alignment for Video Captioning -- Task-Driven Biometric Authentication of Users in Virtual Reality (VR) Environments -- Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs -- Subjective Visual Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs -- Joint EPC and RAN Caching of Tiled VR Videos for Mobile Networks -- Foveated Ray Tracing for VR Headsets -- Preferred Model of Adaptation to Dark for Virtual Reality Headsets -- From Movement to Events: Improving Soccer Match Annotations -- Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario -- Integration of Exploration and Search: A Case Study of the M^3 Model -- Face Swapping for SolvingCollateral Privacy Issues in Multimedia Analytics -- Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions -- Fashion Police: Towards Semantic Indexing of Clothing Information In Surveillance Data -- CNN-Based Non-Contact Detection of Food Level in Bottles from RGB Images -- Personalized Recommendation of Photography Based on Deep Learning -- Two-level Attention with Multi-task Learning for Facial Emotion Estimation -- User Interaction for Visual Lifelog Retrieval in a Virtual Environment -- Query-by-Dancing: A Dance Music Retrieval System Based on Body-Motion Similarity -- Joint Visual-Textual Sentiment Analysis Based on Cross-modality Attention Mechanism -- Deep Hashing with Triplet Labels and Unification Binary Code Selection for Fast Image Retrieval -- Incremental Training for Face Recognition -- Character Prediction in TV Series via a Semantic Projection Network -- A Test Collection for Interactive Lifelog Retrieval -- SEPHLA: Challenges and Opportunities withinEnvironment – Personal Health Archives -- Athens Urban Soundscape (ATHUS): A dataset for urban soundscape quality recognition -- V3C - a Research Video Collection -- Image Aesthetics Assessment using Fully Convolutional Neural Networks -- Detecting tampered videos with multimedia forensics and deep learning -- Improving Robustness of Image Tampering Detection for Compression -- Audiovisual annotation procedure for multi-view field recordings -- A Robust Multi-Athlete Tracking Algorithm by Exploiting Discriminant Features and Long-Term Dependencies -- Early Identification of Oil Spills in Satellite Images Using Deep CNNs -- Point Cloud Colorization Based on Densely Annotated 3D Shape Dataset -- evolve2vec: Learning Network Representations Using Temporal Unfolding -- The Impact of Packet Loss and Google Congestion Control on QoE for WebRTC-based Mobile Multiparty Audiovisual Telemeetings -- Hierarchical Temporal Pooling for Efficient Online Action Recognition -- Generative Adversarial Networks withEnhanced Symmetric Residual Units for Single Image Super-Resolution -- 3D ResNets for 3D object classification -- Four Models for Automatic Recognition of Left and Right Eye in Fundus Images -- On the unsolved problem of Shot Boundary Detection for Music Videos -- Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention -- Exploiting Incidence Relation Between Subgroups for Improving Clustering-Based Recommendation Model -- Hierarchical Bayesian Network based Incremental Model for Flood Prediction -- A New Female Body Segmentation and Feature Localisation Method for Image-based Anthropometry -- Greedy Salient Dictionary Learning For Activity Video Summarization -- Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution -- Automatic Segmentation of Brain Tumor Images Based on Region Growing with Co-constraint -- Proposal of an Annotation Method for Integrating Musical Technique Knowledge using a GTTM Time-Span Tree -- A hierarchical level set approach to for RGBD image matting -- A Genetic Programming Approach to Integrate Multilayer CNN Features for Image Classification -- Improving Micro-Expression Recognition Accuracy using Twofold Feature Extraction -- An effective dual-fisheye lens stitching method based on feature points -- 3D Skeletal Gesture Recognition via Sparse Coding of Time-Warping Invariant Riemannian Trajectories -- Efficient Graph based Multi-View Leaning -- DANTE speaker recognition module. An efficient and robust automatic speaker searching solution for terrorism-related scenarios. Tipo de medio : Computadora Summary : The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part II / Kompatsiaris, Ioannis ; Huet, Benoit ; Mezaris, Vasileios ; Gurrin, Cathal ; Cheng, Wen-Huang ; Vrochidis, Stefanos
![]()
TÃtulo : 25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part II Tipo de documento: documento electrónico Autores: Kompatsiaris, Ioannis, ; Huet, Benoit, ; Mezaris, Vasileios, ; Gurrin, Cathal, ; Cheng, Wen-Huang, ; Vrochidis, Stefanos, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2019 Número de páginas: XXVI, 701 p. 287 ilustraciones, 212 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-05716-9 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Sistemas de reconocimiento de patrones Sistemas de almacenamiento y recuperación de información. Software de la aplicacion Sistemas de información multimedia Reconocimiento de patrones automatizado Almacenamiento y recuperación de información Aplicaciones informáticas y de sistemas de información Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11295 y 11296 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2019, celebrada en Salónica, Grecia, en enero de 2019. De los 172 artÃculos completos presentados, 49 fueron seleccionados para presentación oral y 47 para presentación de carteles; Además, se aceptaron 6 artÃculos de demostración, 5 artÃculos de la industria, 6 artÃculos de talleres y 6 artÃculos para Video Browser Showdown 2019. Todos los artÃculos presentados fueron cuidadosamente revisados ​​y seleccionados entre 204 presentaciones. Nota de contenido: Regular and Special Session Papers -- Industry Papers -- Demonstrations -- Video Browser Showdown -- MANPU 2019 Workshop Papers. Tipo de medio : Computadora Summary : The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 25th International Conference, MMM 2019, Thessaloniki, Greece, January 8–11, 2019, Proceedings, Part II [documento electrónico] / Kompatsiaris, Ioannis, ; Huet, Benoit, ; Mezaris, Vasileios, ; Gurrin, Cathal, ; Cheng, Wen-Huang, ; Vrochidis, Stefanos, . - 1 ed. . - [s.l.] : Springer, 2019 . - XXVI, 701 p. 287 ilustraciones, 212 ilustraciones en color.
ISBN : 978-3-030-05716-9
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Sistemas de reconocimiento de patrones Sistemas de almacenamiento y recuperación de información. Software de la aplicacion Sistemas de información multimedia Reconocimiento de patrones automatizado Almacenamiento y recuperación de información Aplicaciones informáticas y de sistemas de información Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11295 y 11296 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2019, celebrada en Salónica, Grecia, en enero de 2019. De los 172 artÃculos completos presentados, 49 fueron seleccionados para presentación oral y 47 para presentación de carteles; Además, se aceptaron 6 artÃculos de demostración, 5 artÃculos de la industria, 6 artÃculos de talleres y 6 artÃculos para Video Browser Showdown 2019. Todos los artÃculos presentados fueron cuidadosamente revisados ​​y seleccionados entre 204 presentaciones. Nota de contenido: Regular and Special Session Papers -- Industry Papers -- Demonstrations -- Video Browser Showdown -- MANPU 2019 Workshop Papers. Tipo de medio : Computadora Summary : The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I / Ro, Yong Man ; Cheng, Wen-Huang ; Kim, Junmo ; Chu, Wei-Ta ; Cui, Peng ; Choi, Jung-Woo ; Hu, Min-Chun ; De Neve, Wesley
![]()
TÃtulo : 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I Tipo de documento: documento electrónico Autores: Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2020 Número de páginas: XXIX, 844 p. 461 ilustraciones, 324 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-37731-1 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Audio and Signal Processing -- Light Field Reconstruction using Dynamically Generated Filters -- Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information -- Gen-Res-Net: a Novel Generative Model for Singing Voice Separation -- A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation -- Exploiting the Importance of Personalization When Selecting Music for Relaxation -- Coding and HVS -- An Efficient Encoding Method for Video Compositing in HEVC -- VHS to HDTV Video Translation using Multi-task Adversarial Learning -- Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics -- Down-Sampling Based Video Coding with Degradation-aware Restoration-Reconstruction Deep Neural Network -- Beyond Literal Visual Modeling: Understanding Image Metaphor based on Literal-Implied Concept Mapping -- Color Processing and Art -- Deep Palette-based Color Decomposition for Image Recoloring with Aesthetic Suggestion -- On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations -- Image Captioning based on Visual and Semantic Attention -- An Illumination Insensitive and Structure-aware Image Color Layer Decomposition Method -- CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator -- Detection and Classification -- Multi-Condition Place Generator for Robust Place Recognition -- Guided Refine-Head for Object Detection -- Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics -- Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification -- Automatic Material Classification using Thermal Finger Impression -- Face -- Face Attributes Recognition Based on One-way Inferential Correlation between Attributes -- Eulerian Motion Based 3DCNN Architecture for Facial Micro-expression Recognition -- Emotion Recognition with Facial Landmark Heatmaps -- One-shot Face Recognition with Feature Rectification via Adversarial Learning -- Visual Sentiment Analysis by Leveraging Local Regions and Human Faces -- Image Processing -- Prediction-error Value Ordering for High-fidelity Reversible Data Hiding -- Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation -- Multi-branch Body Region Alignment Network for Person Re-Identification -- DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search -- 3D Spatial Coverage Measurement of Aerial Images -- Learning and Knowledge Representation -- Instance Image Retrieval with Generative Adversarial Training -- An Effective Way to Boost Black-box Adversarial Attack -- Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain -- Improved Model Structure with Cosine Margin OIM Loss For End-to-End Person Search -- Effective Barcode Hunter via Semantic Segmentation in the Wild -- Video Processing -- Wonderful Clips of Playing Basketball: A Database forLocalizing Wonderful Actions -- Structural Pyramid Network for Cascaded Optical Flow Estimation -- Real-time Multiple Pedestrians Tracking in Multi-camera System -- Learning Multi-feature based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking -- Unsupervised Video Summarization via Attention-Driven Adversarial Learning -- Poster Papers -- Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping -- Fine-grain level sports video search engine -- The Korean Sign Language Dataset for Action Recognition -- SEE-LPR: A Semantic Segmentation based End-to-End System for Unconstrained License Plate Detection and Recognition -- Action Co-Localization in an Untrimmed Video by Graph Neural Networks -- A Novel Attention Enhanced Dense Network For Image Super-Resolution -- Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network -- Multi-scale Spatial Location Preference for Semantic Segmentation -- HRTF Representation with Convolutional Auto-Encoder -- Unsupervised Feature Propagation for Video Object Detection using Generative Adversarial Networks -- OmniEyes: Analysis and Synthesis of Artistically Painted Eyes -- LDSNE: Learning Structural Network Embeddings by Encoding Local Distances -- FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks -- Multi-step Coding Structure of Spatial Audio Object Coding -- Thermal Face Recognition based on Transformation by Residual U-Net and Pixel Shuffle Upsampling -- K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-point Clustering -- Resolution Booster: Global Structure Preserving Stitching Method For Ultra-High Resolution Image Translation -- Cross Fusion for Egocentric Interactive Action Recognition -- Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion -- Texture-based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC -- An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features -- Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos -- Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism -- Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis -- High Accuracy Perceptual video hashing via Low-Rank decomposition and DWT -- HMM-Based Person Re-Identification in Large-scale Open Scenario -- No Reference Image Quality Assessment by Information Decomposition. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I [documento electrónico] / Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, . - 1 ed. . - [s.l.] : Springer, 2020 . - XXIX, 844 p. 461 ilustraciones, 324 ilustraciones en color.
ISBN : 978-3-030-37731-1
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Audio and Signal Processing -- Light Field Reconstruction using Dynamically Generated Filters -- Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information -- Gen-Res-Net: a Novel Generative Model for Singing Voice Separation -- A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation -- Exploiting the Importance of Personalization When Selecting Music for Relaxation -- Coding and HVS -- An Efficient Encoding Method for Video Compositing in HEVC -- VHS to HDTV Video Translation using Multi-task Adversarial Learning -- Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics -- Down-Sampling Based Video Coding with Degradation-aware Restoration-Reconstruction Deep Neural Network -- Beyond Literal Visual Modeling: Understanding Image Metaphor based on Literal-Implied Concept Mapping -- Color Processing and Art -- Deep Palette-based Color Decomposition for Image Recoloring with Aesthetic Suggestion -- On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations -- Image Captioning based on Visual and Semantic Attention -- An Illumination Insensitive and Structure-aware Image Color Layer Decomposition Method -- CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator -- Detection and Classification -- Multi-Condition Place Generator for Robust Place Recognition -- Guided Refine-Head for Object Detection -- Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics -- Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification -- Automatic Material Classification using Thermal Finger Impression -- Face -- Face Attributes Recognition Based on One-way Inferential Correlation between Attributes -- Eulerian Motion Based 3DCNN Architecture for Facial Micro-expression Recognition -- Emotion Recognition with Facial Landmark Heatmaps -- One-shot Face Recognition with Feature Rectification via Adversarial Learning -- Visual Sentiment Analysis by Leveraging Local Regions and Human Faces -- Image Processing -- Prediction-error Value Ordering for High-fidelity Reversible Data Hiding -- Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation -- Multi-branch Body Region Alignment Network for Person Re-Identification -- DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search -- 3D Spatial Coverage Measurement of Aerial Images -- Learning and Knowledge Representation -- Instance Image Retrieval with Generative Adversarial Training -- An Effective Way to Boost Black-box Adversarial Attack -- Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain -- Improved Model Structure with Cosine Margin OIM Loss For End-to-End Person Search -- Effective Barcode Hunter via Semantic Segmentation in the Wild -- Video Processing -- Wonderful Clips of Playing Basketball: A Database forLocalizing Wonderful Actions -- Structural Pyramid Network for Cascaded Optical Flow Estimation -- Real-time Multiple Pedestrians Tracking in Multi-camera System -- Learning Multi-feature based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking -- Unsupervised Video Summarization via Attention-Driven Adversarial Learning -- Poster Papers -- Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping -- Fine-grain level sports video search engine -- The Korean Sign Language Dataset for Action Recognition -- SEE-LPR: A Semantic Segmentation based End-to-End System for Unconstrained License Plate Detection and Recognition -- Action Co-Localization in an Untrimmed Video by Graph Neural Networks -- A Novel Attention Enhanced Dense Network For Image Super-Resolution -- Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network -- Multi-scale Spatial Location Preference for Semantic Segmentation -- HRTF Representation with Convolutional Auto-Encoder -- Unsupervised Feature Propagation for Video Object Detection using Generative Adversarial Networks -- OmniEyes: Analysis and Synthesis of Artistically Painted Eyes -- LDSNE: Learning Structural Network Embeddings by Encoding Local Distances -- FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks -- Multi-step Coding Structure of Spatial Audio Object Coding -- Thermal Face Recognition based on Transformation by Residual U-Net and Pixel Shuffle Upsampling -- K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-point Clustering -- Resolution Booster: Global Structure Preserving Stitching Method For Ultra-High Resolution Image Translation -- Cross Fusion for Egocentric Interactive Action Recognition -- Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion -- Texture-based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC -- An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features -- Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos -- Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism -- Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis -- High Accuracy Perceptual video hashing via Low-Rank decomposition and DWT -- HMM-Based Person Re-Identification in Large-scale Open Scenario -- No Reference Image Quality Assessment by Information Decomposition. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II / Ro, Yong Man ; Cheng, Wen-Huang ; Kim, Junmo ; Chu, Wei-Ta ; Cui, Peng ; Choi, Jung-Woo ; Hu, Min-Chun ; De Neve, Wesley
![]()
TÃtulo : 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II Tipo de documento: documento electrónico Autores: Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2020 Número de páginas: XXX, 820 p. 385 ilustraciones, 271 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-37734-2 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Poster Papers -- Multi-Scale Comparison Network for Few-Shot Learning -- Semantic and Morphological Information guided Chinese Text Classification -- A Delay-aware Adaptation Framework for Cloud Gaming under the Computation Constraint of User Devices -- Efficient Edge Caching for High-Quality 360-Degree Video Delivery -- Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach -- PRIME: Block-wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems -- A New Local Transformation Module for Few-shot Segmentation -- Background Segmentation for Vehicle Re-Identification -- Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units -- A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos -- Generate images with obfuscated attributes for private image classifcation -- Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution -- A Compact Deep Neural Network for Single Image Super-Resolution -- An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network -- Structured Neural Motifs: Scene Graph Parsing via Enhanced Context -- Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet -- TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation -- More-Natural Mimetic Words Generation for Fine-grained Gait Description -- Lite Hourglass Network for Multi-person Pose Estimation -- SS1: AI-Powered 3D Vision -- Single View Depth Estimation via Dense Convolution Network with Self-supervision -- Multi-Data UAV Images for Large Scale Reconstruction of Buildings -- Deformed Phase Prediction Using SVM for Structured Light Depth Generation -- Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization -- Similarity Graph Convolutional Construction Network for Interactive Action Recognition -- Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning -- Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance -- InSphereNet: a Concise Representation and Classification Method for 3D Object -- 3-D Oral Shape Retrieval Using Registration Algorithm -- Face Super-Resolution by Learning Multi-view Texture Compensation -- Light Field Salient Object Detection via Hybrid Priors -- SS2: Multimedia Analytics: Perspectives, Tools and Applications -- Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content -- Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings -- An inverse mapping with manifold alignment for zero-shot learning -- Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System -- An Extensible Framework for Interactive Real-time Visualizations of Large-scale Heterogeneous Multimedia Information from Online Sources -- SS3: MDRE: Multimedia Datasets for Repeatable Experimentation -- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset -- Kvasir-SEG: A Segmented Polyp Dataset -- Rethinking the Test Collection Methodology for Personal Self-Tracking Data -- Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset -- SS4: MMAC: Multi-Modal Affective Computing of Large-Scale Multimedia Data -- Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection -- Enhanced Gaze Following via Object Detection and Human Pose Estimation -- Region Based Adversarial Synthesis of Facial Action Units -- Facial Expression Restoration Based on Improved Graph Convolutional Networks -- Global Affective Video Content Regression Based on Complementary Audio-Visual Features -- SS5: MULTIMED: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments -- Using Publicly Available Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction -- AttenNet: Deep Attention based Retinal Disease Classification in OCT Images -- NOVA: A Tool for Explanatory Multimodal Behavior Analysis and its Application to Psychotherapy -- Instrument Recognition in Laparoscopy for Technical Skill Assessment -- Real-time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding -- Model-based and Class-based Fusion of Multisensor Data -- Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos -- SS6: Intelligent Multimedia Security -- Compact Position-aware Attention Network for Image Semantic Segmentation -- Law is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design -- Rational Delegation Computing Using Information Theory and Game Theory Approach -- Multi-hop Interactive Cross-modal Retrieval -- Demo Papers -- Browsing Visual Sentiment Datasets using Psycholinguistic Groundings -- Framework Design for Multiplayer Motion Sensing Game in Mixture Reality -- Lyrics-Conditioned Neural Melody Generation -- A Web-based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images -- An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement -- DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models -- Real-time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems -- CNN-based Multi-Scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications -- Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution -- VBS Papers -- diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020 -- Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search -- An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts -- VIREO @ Video Browser Showdown 2020 -- VERGE in VBS 2020 -- VIRET at Video Browser Showdown 2020 -- SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop -- Exquisitor at the Video Browser Showdown 2020 -- Deep Learning-Based Video Retrieval using Object Relationships and Associated Audio Classes -- IVIST: Interactive Video Search Tool in VBS 2020. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II [documento electrónico] / Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, . - 1 ed. . - [s.l.] : Springer, 2020 . - XXX, 820 p. 385 ilustraciones, 271 ilustraciones en color.
ISBN : 978-3-030-37734-2
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Poster Papers -- Multi-Scale Comparison Network for Few-Shot Learning -- Semantic and Morphological Information guided Chinese Text Classification -- A Delay-aware Adaptation Framework for Cloud Gaming under the Computation Constraint of User Devices -- Efficient Edge Caching for High-Quality 360-Degree Video Delivery -- Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach -- PRIME: Block-wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems -- A New Local Transformation Module for Few-shot Segmentation -- Background Segmentation for Vehicle Re-Identification -- Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units -- A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos -- Generate images with obfuscated attributes for private image classifcation -- Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution -- A Compact Deep Neural Network for Single Image Super-Resolution -- An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network -- Structured Neural Motifs: Scene Graph Parsing via Enhanced Context -- Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet -- TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation -- More-Natural Mimetic Words Generation for Fine-grained Gait Description -- Lite Hourglass Network for Multi-person Pose Estimation -- SS1: AI-Powered 3D Vision -- Single View Depth Estimation via Dense Convolution Network with Self-supervision -- Multi-Data UAV Images for Large Scale Reconstruction of Buildings -- Deformed Phase Prediction Using SVM for Structured Light Depth Generation -- Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization -- Similarity Graph Convolutional Construction Network for Interactive Action Recognition -- Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning -- Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance -- InSphereNet: a Concise Representation and Classification Method for 3D Object -- 3-D Oral Shape Retrieval Using Registration Algorithm -- Face Super-Resolution by Learning Multi-view Texture Compensation -- Light Field Salient Object Detection via Hybrid Priors -- SS2: Multimedia Analytics: Perspectives, Tools and Applications -- Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content -- Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings -- An inverse mapping with manifold alignment for zero-shot learning -- Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System -- An Extensible Framework for Interactive Real-time Visualizations of Large-scale Heterogeneous Multimedia Information from Online Sources -- SS3: MDRE: Multimedia Datasets for Repeatable Experimentation -- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset -- Kvasir-SEG: A Segmented Polyp Dataset -- Rethinking the Test Collection Methodology for Personal Self-Tracking Data -- Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset -- SS4: MMAC: Multi-Modal Affective Computing of Large-Scale Multimedia Data -- Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection -- Enhanced Gaze Following via Object Detection and Human Pose Estimation -- Region Based Adversarial Synthesis of Facial Action Units -- Facial Expression Restoration Based on Improved Graph Convolutional Networks -- Global Affective Video Content Regression Based on Complementary Audio-Visual Features -- SS5: MULTIMED: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments -- Using Publicly Available Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction -- AttenNet: Deep Attention based Retinal Disease Classification in OCT Images -- NOVA: A Tool for Explanatory Multimodal Behavior Analysis and its Application to Psychotherapy -- Instrument Recognition in Laparoscopy for Technical Skill Assessment -- Real-time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding -- Model-based and Class-based Fusion of Multisensor Data -- Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos -- SS6: Intelligent Multimedia Security -- Compact Position-aware Attention Network for Image Semantic Segmentation -- Law is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design -- Rational Delegation Computing Using Information Theory and Game Theory Approach -- Multi-hop Interactive Cross-modal Retrieval -- Demo Papers -- Browsing Visual Sentiment Datasets using Psycholinguistic Groundings -- Framework Design for Multiplayer Motion Sensing Game in Mixture Reality -- Lyrics-Conditioned Neural Melody Generation -- A Web-based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images -- An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement -- DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models -- Real-time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems -- CNN-based Multi-Scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications -- Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution -- VBS Papers -- diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020 -- Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search -- An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts -- VIREO @ Video Browser Showdown 2020 -- VERGE in VBS 2020 -- VIRET at Video Browser Showdown 2020 -- SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop -- Exquisitor at the Video Browser Showdown 2020 -- Deep Learning-Based Video Retrieval using Object Relationships and Associated Audio Classes -- IVIST: Interactive Video Search Tool in VBS 2020. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] Advances in Multimedia Information Processing – PCM 2018 / Hong, Richang ; Cheng, Wen-Huang ; Yamasaki, Toshihiko ; Wang, Meng ; Ngo, Chong-Wah
![]()
TÃtulo : Advances in Multimedia Information Processing – PCM 2018 : 19th Pacific-Rim Conference on Multimedia, Hefei, China, September 21-22, 2018, Proceedings, Part I Tipo de documento: documento electrónico Autores: Hong, Richang, ; Cheng, Wen-Huang, ; Yamasaki, Toshihiko, ; Wang, Meng, ; Ngo, Chong-Wah, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2018 Número de páginas: XXX, 897 p. 372 ilustraciones ISBN/ISSN/DL: 978-3-030-00776-8 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Procesamiento de datos Protección de datos Sistemas de información multimedia MinerÃa de datos y descubrimiento de conocimientos Seguridad de datos e información Clasificación: 006.7 Resumen: El conjunto de tres volúmenes LNCS 101164, 11165 y 11166 constituye las actas arbitradas de la 19.ª Conferencia sobre Multimedia de la Cuenca del PacÃfico, PCM 2018, celebrada en Hefei, China, en septiembre de 2018. Los 209 artÃculos regulares presentados junto con 20 artÃculos de sesiones especiales fueron cuidadosamente revisados ​​y seleccionados entre 452 presentaciones. Los artÃculos cubren temas tales como: análisis de contenido multimedia; procesamiento de señales multimedia y comunicaciones; y aplicaciones y servicios multimedia. Nota de contenido: Multimedia content analysis -- Multimedia signal processing and communications -- Multimedia applications and services. Tipo de medio : Computadora Summary : The three-volume set LNCS 101164, 11165, and 11166 constitutes the refereed proceedings of the 19th Pacific-Rim Conference on Multimedia, PCM 2018, held in Hefei, China, in September 2018. The 209 regular papers presented together with 20 special session papers were carefully reviewed and selected from 452 submissions. The papers cover topics such as: multimedia content analysis; multimedia signal processing and communications; and multimedia applications and services. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] Advances in Multimedia Information Processing – PCM 2018 : 19th Pacific-Rim Conference on Multimedia, Hefei, China, September 21-22, 2018, Proceedings, Part I [documento electrónico] / Hong, Richang, ; Cheng, Wen-Huang, ; Yamasaki, Toshihiko, ; Wang, Meng, ; Ngo, Chong-Wah, . - 1 ed. . - [s.l.] : Springer, 2018 . - XXX, 897 p. 372 ilustraciones.
ISBN : 978-3-030-00776-8
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Procesamiento de datos Protección de datos Sistemas de información multimedia MinerÃa de datos y descubrimiento de conocimientos Seguridad de datos e información Clasificación: 006.7 Resumen: El conjunto de tres volúmenes LNCS 101164, 11165 y 11166 constituye las actas arbitradas de la 19.ª Conferencia sobre Multimedia de la Cuenca del PacÃfico, PCM 2018, celebrada en Hefei, China, en septiembre de 2018. Los 209 artÃculos regulares presentados junto con 20 artÃculos de sesiones especiales fueron cuidadosamente revisados ​​y seleccionados entre 452 presentaciones. Los artÃculos cubren temas tales como: análisis de contenido multimedia; procesamiento de señales multimedia y comunicaciones; y aplicaciones y servicios multimedia. Nota de contenido: Multimedia content analysis -- Multimedia signal processing and communications -- Multimedia applications and services. Tipo de medio : Computadora Summary : The three-volume set LNCS 101164, 11165, and 11166 constitutes the refereed proceedings of the 19th Pacific-Rim Conference on Multimedia, PCM 2018, held in Hefei, China, in September 2018. The 209 regular papers presented together with 20 special session papers were carefully reviewed and selected from 452 submissions. The papers cover topics such as: multimedia content analysis; multimedia signal processing and communications; and multimedia applications and services. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] Advances in Multimedia Information Processing – PCM 2018 / Hong, Richang ; Cheng, Wen-Huang ; Yamasaki, Toshihiko ; Wang, Meng ; Ngo, Chong-Wah
![]()
PermalinkAdvances in Multimedia Information Processing – PCM 2018 / Hong, Richang ; Cheng, Wen-Huang ; Yamasaki, Toshihiko ; Wang, Meng ; Ngo, Chong-Wah
![]()
Permalink