Información del autor
Autor Kim, Junmo |
Documentos disponibles escritos por este autor (3)



14th International Conference, MIWAI 2021, Virtual Event, July 2–3, 2021, Proceedings / Chomphuwiset, Phatthanaphong ; Kim, Junmo ; Pawara, Pornntiwa
![]()
TÃtulo : 14th International Conference, MIWAI 2021, Virtual Event, July 2–3, 2021, Proceedings Tipo de documento: documento electrónico Autores: Chomphuwiset, Phatthanaphong, ; Kim, Junmo, ; Pawara, Pornntiwa, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2021 Número de páginas: XIV, 189 p. 80 ilustraciones, 52 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-80253-0 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Inteligencia artificial IngenierÃa Informática Red de computadoras Visión por computador IngenierÃa Informática y Redes Clasificación: 006.3 Resumen: Este libro constituye las actas arbitradas de la 14.ª Conferencia Internacional sobre Tendencias Multidisciplinarias en Inteligencia Artificial, MIWAI 2021, celebrada en lÃnea en julio de 2021. Los 13 artÃculos completos y 3 artÃculos breves presentados fueron cuidadosamente revisados ​​y seleccionados entre 33 presentaciones. Cubren una amplia gama de temas en teorÃa, métodos y herramientas en subáreas de IA, como ciencia cognitiva, filosofÃa computacional, inteligencia computacional, teorÃa de juegos, aprendizaje automático, sistemas multiagente, lenguaje natural, representación y razonamiento, minerÃa de datos. , voz, visión por computadora e Internet, asà como sus aplicaciones en big data, bioinformática, biometrÃa, soporte de decisiones, gestión del conocimiento, privacidad, sistemas de recomendación, seguridad, ingenierÃa de software, filtrado de spam, vigilancia, telecomunicaciones, servicios web e IoT. Nota de contenido: 3D Point Cloud Upsampling and Colorization using GAN -- Learning Behavioral Rules from Multi-Agent Simulations for Optimizing Hospital Processes -- An Open-World Novelty Generator for Authoring Reinforcement Learning Environment of Standardized Toolkits -- Book Cover and Content Similarity Retrieval using Computer Vision and NLP Techniques -- Fast Classification Learning with Neural Networks and Conceptors for Speech Recognition and Car Driving Maneuvers -- Feature Group Importance for Automated Essay Scoring -- Feature Extraction Efficient for Face Verification Based on Residual Network Architecture -- Acquiring Input Features from Stock Market Summaries: A NLG Perspective -- A Comparative of A New Hybrid based on Neural Networks and SARIMA Models for Time Series Forecasting -- Cartpole Problem with PDL and GP using Multi-Objective Fitness Functions Differing in A Priori Knowledge -- Learning Robot Arm Controls using Augmented Random Search in Simulated Environments -- An Analytical Evaluation of a Deep Learning Model to Detect Network Intrusion -- Application of Machine Learning Techniques to Predict Breast Cancer Survival -- Thai Handwritten Recognition on BEST2019 Datasets using Deep Learning -- Comparing of Multi-class Text Classification Methods for Automatic Ratings of Consumer Reviews -- Designing An Algorithm for Scheduling Tasks for Multiagent Systems. Tipo de medio : Computadora Summary : This book constitutes the refereed proceedings of the 14th International Conference on Multi-disciplinary Trends in Artificial Intelligence, MIWAI 2021, held online in July 2021. The 13 full papers and 3 short papers presented were carefully reviewed and selected from 33 submissions. They cover a wide range of topics in theory, methods, and tools in AI sub-areas such as cognitive science, computational philosophy, computational intelligence, game theory, machine learning, multi-agent systems, natural language, representation and reasoning, data mining, speech, computer vision and the Web as well as their applications in big data, bioinformatics, biometrics, decision support, knowledge management, privacy, recommender systems, security, software engineering, spam filtering, surveillance, telecommunications, Web services, and IoT. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 14th International Conference, MIWAI 2021, Virtual Event, July 2–3, 2021, Proceedings [documento electrónico] / Chomphuwiset, Phatthanaphong, ; Kim, Junmo, ; Pawara, Pornntiwa, . - 1 ed. . - [s.l.] : Springer, 2021 . - XIV, 189 p. 80 ilustraciones, 52 ilustraciones en color.
ISBN : 978-3-030-80253-0
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Inteligencia artificial IngenierÃa Informática Red de computadoras Visión por computador IngenierÃa Informática y Redes Clasificación: 006.3 Resumen: Este libro constituye las actas arbitradas de la 14.ª Conferencia Internacional sobre Tendencias Multidisciplinarias en Inteligencia Artificial, MIWAI 2021, celebrada en lÃnea en julio de 2021. Los 13 artÃculos completos y 3 artÃculos breves presentados fueron cuidadosamente revisados ​​y seleccionados entre 33 presentaciones. Cubren una amplia gama de temas en teorÃa, métodos y herramientas en subáreas de IA, como ciencia cognitiva, filosofÃa computacional, inteligencia computacional, teorÃa de juegos, aprendizaje automático, sistemas multiagente, lenguaje natural, representación y razonamiento, minerÃa de datos. , voz, visión por computadora e Internet, asà como sus aplicaciones en big data, bioinformática, biometrÃa, soporte de decisiones, gestión del conocimiento, privacidad, sistemas de recomendación, seguridad, ingenierÃa de software, filtrado de spam, vigilancia, telecomunicaciones, servicios web e IoT. Nota de contenido: 3D Point Cloud Upsampling and Colorization using GAN -- Learning Behavioral Rules from Multi-Agent Simulations for Optimizing Hospital Processes -- An Open-World Novelty Generator for Authoring Reinforcement Learning Environment of Standardized Toolkits -- Book Cover and Content Similarity Retrieval using Computer Vision and NLP Techniques -- Fast Classification Learning with Neural Networks and Conceptors for Speech Recognition and Car Driving Maneuvers -- Feature Group Importance for Automated Essay Scoring -- Feature Extraction Efficient for Face Verification Based on Residual Network Architecture -- Acquiring Input Features from Stock Market Summaries: A NLG Perspective -- A Comparative of A New Hybrid based on Neural Networks and SARIMA Models for Time Series Forecasting -- Cartpole Problem with PDL and GP using Multi-Objective Fitness Functions Differing in A Priori Knowledge -- Learning Robot Arm Controls using Augmented Random Search in Simulated Environments -- An Analytical Evaluation of a Deep Learning Model to Detect Network Intrusion -- Application of Machine Learning Techniques to Predict Breast Cancer Survival -- Thai Handwritten Recognition on BEST2019 Datasets using Deep Learning -- Comparing of Multi-class Text Classification Methods for Automatic Ratings of Consumer Reviews -- Designing An Algorithm for Scheduling Tasks for Multiagent Systems. Tipo de medio : Computadora Summary : This book constitutes the refereed proceedings of the 14th International Conference on Multi-disciplinary Trends in Artificial Intelligence, MIWAI 2021, held online in July 2021. The 13 full papers and 3 short papers presented were carefully reviewed and selected from 33 submissions. They cover a wide range of topics in theory, methods, and tools in AI sub-areas such as cognitive science, computational philosophy, computational intelligence, game theory, machine learning, multi-agent systems, natural language, representation and reasoning, data mining, speech, computer vision and the Web as well as their applications in big data, bioinformatics, biometrics, decision support, knowledge management, privacy, recommender systems, security, software engineering, spam filtering, surveillance, telecommunications, Web services, and IoT. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I / Ro, Yong Man ; Cheng, Wen-Huang ; Kim, Junmo ; Chu, Wei-Ta ; Cui, Peng ; Choi, Jung-Woo ; Hu, Min-Chun ; De Neve, Wesley
![]()
TÃtulo : 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I Tipo de documento: documento electrónico Autores: Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2020 Número de páginas: XXIX, 844 p. 461 ilustraciones, 324 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-37731-1 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Audio and Signal Processing -- Light Field Reconstruction using Dynamically Generated Filters -- Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information -- Gen-Res-Net: a Novel Generative Model for Singing Voice Separation -- A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation -- Exploiting the Importance of Personalization When Selecting Music for Relaxation -- Coding and HVS -- An Efficient Encoding Method for Video Compositing in HEVC -- VHS to HDTV Video Translation using Multi-task Adversarial Learning -- Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics -- Down-Sampling Based Video Coding with Degradation-aware Restoration-Reconstruction Deep Neural Network -- Beyond Literal Visual Modeling: Understanding Image Metaphor based on Literal-Implied Concept Mapping -- Color Processing and Art -- Deep Palette-based Color Decomposition for Image Recoloring with Aesthetic Suggestion -- On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations -- Image Captioning based on Visual and Semantic Attention -- An Illumination Insensitive and Structure-aware Image Color Layer Decomposition Method -- CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator -- Detection and Classification -- Multi-Condition Place Generator for Robust Place Recognition -- Guided Refine-Head for Object Detection -- Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics -- Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification -- Automatic Material Classification using Thermal Finger Impression -- Face -- Face Attributes Recognition Based on One-way Inferential Correlation between Attributes -- Eulerian Motion Based 3DCNN Architecture for Facial Micro-expression Recognition -- Emotion Recognition with Facial Landmark Heatmaps -- One-shot Face Recognition with Feature Rectification via Adversarial Learning -- Visual Sentiment Analysis by Leveraging Local Regions and Human Faces -- Image Processing -- Prediction-error Value Ordering for High-fidelity Reversible Data Hiding -- Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation -- Multi-branch Body Region Alignment Network for Person Re-Identification -- DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search -- 3D Spatial Coverage Measurement of Aerial Images -- Learning and Knowledge Representation -- Instance Image Retrieval with Generative Adversarial Training -- An Effective Way to Boost Black-box Adversarial Attack -- Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain -- Improved Model Structure with Cosine Margin OIM Loss For End-to-End Person Search -- Effective Barcode Hunter via Semantic Segmentation in the Wild -- Video Processing -- Wonderful Clips of Playing Basketball: A Database forLocalizing Wonderful Actions -- Structural Pyramid Network for Cascaded Optical Flow Estimation -- Real-time Multiple Pedestrians Tracking in Multi-camera System -- Learning Multi-feature based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking -- Unsupervised Video Summarization via Attention-Driven Adversarial Learning -- Poster Papers -- Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping -- Fine-grain level sports video search engine -- The Korean Sign Language Dataset for Action Recognition -- SEE-LPR: A Semantic Segmentation based End-to-End System for Unconstrained License Plate Detection and Recognition -- Action Co-Localization in an Untrimmed Video by Graph Neural Networks -- A Novel Attention Enhanced Dense Network For Image Super-Resolution -- Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network -- Multi-scale Spatial Location Preference for Semantic Segmentation -- HRTF Representation with Convolutional Auto-Encoder -- Unsupervised Feature Propagation for Video Object Detection using Generative Adversarial Networks -- OmniEyes: Analysis and Synthesis of Artistically Painted Eyes -- LDSNE: Learning Structural Network Embeddings by Encoding Local Distances -- FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks -- Multi-step Coding Structure of Spatial Audio Object Coding -- Thermal Face Recognition based on Transformation by Residual U-Net and Pixel Shuffle Upsampling -- K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-point Clustering -- Resolution Booster: Global Structure Preserving Stitching Method For Ultra-High Resolution Image Translation -- Cross Fusion for Egocentric Interactive Action Recognition -- Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion -- Texture-based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC -- An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features -- Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos -- Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism -- Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis -- High Accuracy Perceptual video hashing via Low-Rank decomposition and DWT -- HMM-Based Person Re-Identification in Large-scale Open Scenario -- No Reference Image Quality Assessment by Information Decomposition. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I [documento electrónico] / Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, . - 1 ed. . - [s.l.] : Springer, 2020 . - XXIX, 844 p. 461 ilustraciones, 324 ilustraciones en color.
ISBN : 978-3-030-37731-1
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Audio and Signal Processing -- Light Field Reconstruction using Dynamically Generated Filters -- Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information -- Gen-Res-Net: a Novel Generative Model for Singing Voice Separation -- A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation -- Exploiting the Importance of Personalization When Selecting Music for Relaxation -- Coding and HVS -- An Efficient Encoding Method for Video Compositing in HEVC -- VHS to HDTV Video Translation using Multi-task Adversarial Learning -- Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics -- Down-Sampling Based Video Coding with Degradation-aware Restoration-Reconstruction Deep Neural Network -- Beyond Literal Visual Modeling: Understanding Image Metaphor based on Literal-Implied Concept Mapping -- Color Processing and Art -- Deep Palette-based Color Decomposition for Image Recoloring with Aesthetic Suggestion -- On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations -- Image Captioning based on Visual and Semantic Attention -- An Illumination Insensitive and Structure-aware Image Color Layer Decomposition Method -- CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator -- Detection and Classification -- Multi-Condition Place Generator for Robust Place Recognition -- Guided Refine-Head for Object Detection -- Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics -- Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification -- Automatic Material Classification using Thermal Finger Impression -- Face -- Face Attributes Recognition Based on One-way Inferential Correlation between Attributes -- Eulerian Motion Based 3DCNN Architecture for Facial Micro-expression Recognition -- Emotion Recognition with Facial Landmark Heatmaps -- One-shot Face Recognition with Feature Rectification via Adversarial Learning -- Visual Sentiment Analysis by Leveraging Local Regions and Human Faces -- Image Processing -- Prediction-error Value Ordering for High-fidelity Reversible Data Hiding -- Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation -- Multi-branch Body Region Alignment Network for Person Re-Identification -- DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search -- 3D Spatial Coverage Measurement of Aerial Images -- Learning and Knowledge Representation -- Instance Image Retrieval with Generative Adversarial Training -- An Effective Way to Boost Black-box Adversarial Attack -- Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain -- Improved Model Structure with Cosine Margin OIM Loss For End-to-End Person Search -- Effective Barcode Hunter via Semantic Segmentation in the Wild -- Video Processing -- Wonderful Clips of Playing Basketball: A Database forLocalizing Wonderful Actions -- Structural Pyramid Network for Cascaded Optical Flow Estimation -- Real-time Multiple Pedestrians Tracking in Multi-camera System -- Learning Multi-feature based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking -- Unsupervised Video Summarization via Attention-Driven Adversarial Learning -- Poster Papers -- Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping -- Fine-grain level sports video search engine -- The Korean Sign Language Dataset for Action Recognition -- SEE-LPR: A Semantic Segmentation based End-to-End System for Unconstrained License Plate Detection and Recognition -- Action Co-Localization in an Untrimmed Video by Graph Neural Networks -- A Novel Attention Enhanced Dense Network For Image Super-Resolution -- Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network -- Multi-scale Spatial Location Preference for Semantic Segmentation -- HRTF Representation with Convolutional Auto-Encoder -- Unsupervised Feature Propagation for Video Object Detection using Generative Adversarial Networks -- OmniEyes: Analysis and Synthesis of Artistically Painted Eyes -- LDSNE: Learning Structural Network Embeddings by Encoding Local Distances -- FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks -- Multi-step Coding Structure of Spatial Audio Object Coding -- Thermal Face Recognition based on Transformation by Residual U-Net and Pixel Shuffle Upsampling -- K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-point Clustering -- Resolution Booster: Global Structure Preserving Stitching Method For Ultra-High Resolution Image Translation -- Cross Fusion for Egocentric Interactive Action Recognition -- Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion -- Texture-based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC -- An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features -- Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos -- Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism -- Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis -- High Accuracy Perceptual video hashing via Low-Rank decomposition and DWT -- HMM-Based Person Re-Identification in Large-scale Open Scenario -- No Reference Image Quality Assessment by Information Decomposition. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II / Ro, Yong Man ; Cheng, Wen-Huang ; Kim, Junmo ; Chu, Wei-Ta ; Cui, Peng ; Choi, Jung-Woo ; Hu, Min-Chun ; De Neve, Wesley
![]()
TÃtulo : 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II Tipo de documento: documento electrónico Autores: Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, Mención de edición: 1 ed. Editorial: [s.l.] : Springer Fecha de publicación: 2020 Número de páginas: XXX, 820 p. 385 ilustraciones, 271 ilustraciones en color. ISBN/ISSN/DL: 978-3-030-37734-2 Nota general: Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos. Idioma : Inglés (eng) Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Poster Papers -- Multi-Scale Comparison Network for Few-Shot Learning -- Semantic and Morphological Information guided Chinese Text Classification -- A Delay-aware Adaptation Framework for Cloud Gaming under the Computation Constraint of User Devices -- Efficient Edge Caching for High-Quality 360-Degree Video Delivery -- Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach -- PRIME: Block-wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems -- A New Local Transformation Module for Few-shot Segmentation -- Background Segmentation for Vehicle Re-Identification -- Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units -- A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos -- Generate images with obfuscated attributes for private image classifcation -- Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution -- A Compact Deep Neural Network for Single Image Super-Resolution -- An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network -- Structured Neural Motifs: Scene Graph Parsing via Enhanced Context -- Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet -- TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation -- More-Natural Mimetic Words Generation for Fine-grained Gait Description -- Lite Hourglass Network for Multi-person Pose Estimation -- SS1: AI-Powered 3D Vision -- Single View Depth Estimation via Dense Convolution Network with Self-supervision -- Multi-Data UAV Images for Large Scale Reconstruction of Buildings -- Deformed Phase Prediction Using SVM for Structured Light Depth Generation -- Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization -- Similarity Graph Convolutional Construction Network for Interactive Action Recognition -- Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning -- Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance -- InSphereNet: a Concise Representation and Classification Method for 3D Object -- 3-D Oral Shape Retrieval Using Registration Algorithm -- Face Super-Resolution by Learning Multi-view Texture Compensation -- Light Field Salient Object Detection via Hybrid Priors -- SS2: Multimedia Analytics: Perspectives, Tools and Applications -- Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content -- Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings -- An inverse mapping with manifold alignment for zero-shot learning -- Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System -- An Extensible Framework for Interactive Real-time Visualizations of Large-scale Heterogeneous Multimedia Information from Online Sources -- SS3: MDRE: Multimedia Datasets for Repeatable Experimentation -- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset -- Kvasir-SEG: A Segmented Polyp Dataset -- Rethinking the Test Collection Methodology for Personal Self-Tracking Data -- Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset -- SS4: MMAC: Multi-Modal Affective Computing of Large-Scale Multimedia Data -- Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection -- Enhanced Gaze Following via Object Detection and Human Pose Estimation -- Region Based Adversarial Synthesis of Facial Action Units -- Facial Expression Restoration Based on Improved Graph Convolutional Networks -- Global Affective Video Content Regression Based on Complementary Audio-Visual Features -- SS5: MULTIMED: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments -- Using Publicly Available Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction -- AttenNet: Deep Attention based Retinal Disease Classification in OCT Images -- NOVA: A Tool for Explanatory Multimodal Behavior Analysis and its Application to Psychotherapy -- Instrument Recognition in Laparoscopy for Technical Skill Assessment -- Real-time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding -- Model-based and Class-based Fusion of Multisensor Data -- Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos -- SS6: Intelligent Multimedia Security -- Compact Position-aware Attention Network for Image Semantic Segmentation -- Law is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design -- Rational Delegation Computing Using Information Theory and Game Theory Approach -- Multi-hop Interactive Cross-modal Retrieval -- Demo Papers -- Browsing Visual Sentiment Datasets using Psycholinguistic Groundings -- Framework Design for Multiplayer Motion Sensing Game in Mixture Reality -- Lyrics-Conditioned Neural Melody Generation -- A Web-based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images -- An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement -- DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models -- Real-time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems -- CNN-based Multi-Scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications -- Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution -- VBS Papers -- diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020 -- Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search -- An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts -- VIREO @ Video Browser Showdown 2020 -- VERGE in VBS 2020 -- VIRET at Video Browser Showdown 2020 -- SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop -- Exquisitor at the Video Browser Showdown 2020 -- Deep Learning-Based Video Retrieval using Object Relationships and Associated Audio Classes -- IVIST: Interactive Video Search Tool in VBS 2020. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...] 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II [documento electrónico] / Ro, Yong Man, ; Cheng, Wen-Huang, ; Kim, Junmo, ; Chu, Wei-Ta, ; Cui, Peng, ; Choi, Jung-Woo, ; Hu, Min-Chun, ; De Neve, Wesley, . - 1 ed. . - [s.l.] : Springer, 2020 . - XXX, 820 p. 385 ilustraciones, 271 ilustraciones en color.
ISBN : 978-3-030-37734-2
Libro disponible en la plataforma SpringerLink. Descarga y lectura en formatos PDF, HTML y ePub. Descarga completa o por capítulos.
Idioma : Inglés (eng)
Palabras clave: Sistemas multimedia Visión por computador Inteligencia artificial Software de la aplicacion Interfaces de usuario (sistemas informáticos) La interacción persona-ordenador Sistemas de información multimedia Aplicaciones informáticas y de sistemas de información Interfaces de usuario e interacción persona-computadora Clasificación: 006.7 Resumen: El conjunto de dos volúmenes LNCS 11961 y 11962 constituye las actas minuciosamente arbitradas de la 25.ª Conferencia Internacional sobre Modelado Multimedia, MMM 2020, celebrada en Daejeon, Corea del Sur, en enero de 2020. De los 171 artÃculos de investigación completos presentados, 40 fueron seleccionados para presentación oral y 46 para presentación en póster; Se seleccionaron 28 trabajos de sesiones especiales para presentación oral y 8 para presentación en póster; Además, se aceptaron 9 artÃculos de demostración y 6 artÃculos para Video Browser Showdown 2020. Los artÃculos de LNCS 11961 están organizados en las siguientes secciones temáticas: procesamiento de señales y audio; codificación y HVS; procesamiento de color y arte; detección y clasificación; rostro; procesamiento de imágenes; aprendizaje y representación del conocimiento; procesamiento de vÃdeo; papeles para carteles; Los artÃculos de LNCS 11962 están organizados en las siguientes secciones temáticas: artÃculos de carteles; Visión 3D impulsada por IA; análisis multimedia: perspectivas, herramientas y aplicaciones; conjuntos de datos multimedia para experimentación repetible; computación afectiva multimodal de datos multimedia a gran escala; análisis multimedia y multimodal en el ámbito médico y entornos generalizados; seguridad multimedia inteligente; documentos de demostración; y artÃculos de VBS. Nota de contenido: Poster Papers -- Multi-Scale Comparison Network for Few-Shot Learning -- Semantic and Morphological Information guided Chinese Text Classification -- A Delay-aware Adaptation Framework for Cloud Gaming under the Computation Constraint of User Devices -- Efficient Edge Caching for High-Quality 360-Degree Video Delivery -- Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach -- PRIME: Block-wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems -- A New Local Transformation Module for Few-shot Segmentation -- Background Segmentation for Vehicle Re-Identification -- Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units -- A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos -- Generate images with obfuscated attributes for private image classifcation -- Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution -- A Compact Deep Neural Network for Single Image Super-Resolution -- An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network -- Structured Neural Motifs: Scene Graph Parsing via Enhanced Context -- Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet -- TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation -- More-Natural Mimetic Words Generation for Fine-grained Gait Description -- Lite Hourglass Network for Multi-person Pose Estimation -- SS1: AI-Powered 3D Vision -- Single View Depth Estimation via Dense Convolution Network with Self-supervision -- Multi-Data UAV Images for Large Scale Reconstruction of Buildings -- Deformed Phase Prediction Using SVM for Structured Light Depth Generation -- Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization -- Similarity Graph Convolutional Construction Network for Interactive Action Recognition -- Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning -- Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance -- InSphereNet: a Concise Representation and Classification Method for 3D Object -- 3-D Oral Shape Retrieval Using Registration Algorithm -- Face Super-Resolution by Learning Multi-view Texture Compensation -- Light Field Salient Object Detection via Hybrid Priors -- SS2: Multimedia Analytics: Perspectives, Tools and Applications -- Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content -- Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings -- An inverse mapping with manifold alignment for zero-shot learning -- Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System -- An Extensible Framework for Interactive Real-time Visualizations of Large-scale Heterogeneous Multimedia Information from Online Sources -- SS3: MDRE: Multimedia Datasets for Repeatable Experimentation -- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset -- Kvasir-SEG: A Segmented Polyp Dataset -- Rethinking the Test Collection Methodology for Personal Self-Tracking Data -- Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset -- SS4: MMAC: Multi-Modal Affective Computing of Large-Scale Multimedia Data -- Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection -- Enhanced Gaze Following via Object Detection and Human Pose Estimation -- Region Based Adversarial Synthesis of Facial Action Units -- Facial Expression Restoration Based on Improved Graph Convolutional Networks -- Global Affective Video Content Regression Based on Complementary Audio-Visual Features -- SS5: MULTIMED: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments -- Using Publicly Available Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction -- AttenNet: Deep Attention based Retinal Disease Classification in OCT Images -- NOVA: A Tool for Explanatory Multimodal Behavior Analysis and its Application to Psychotherapy -- Instrument Recognition in Laparoscopy for Technical Skill Assessment -- Real-time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding -- Model-based and Class-based Fusion of Multisensor Data -- Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos -- SS6: Intelligent Multimedia Security -- Compact Position-aware Attention Network for Image Semantic Segmentation -- Law is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design -- Rational Delegation Computing Using Information Theory and Game Theory Approach -- Multi-hop Interactive Cross-modal Retrieval -- Demo Papers -- Browsing Visual Sentiment Datasets using Psycholinguistic Groundings -- Framework Design for Multiplayer Motion Sensing Game in Mixture Reality -- Lyrics-Conditioned Neural Melody Generation -- A Web-based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images -- An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement -- DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models -- Real-time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems -- CNN-based Multi-Scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications -- Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution -- VBS Papers -- diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020 -- Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search -- An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts -- VIREO @ Video Browser Showdown 2020 -- VERGE in VBS 2020 -- VIRET at Video Browser Showdown 2020 -- SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop -- Exquisitor at the Video Browser Showdown 2020 -- Deep Learning-Based Video Retrieval using Object Relationships and Associated Audio Classes -- IVIST: Interactive Video Search Tool in VBS 2020. Tipo de medio : Computadora Summary : The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers. Enlace de acceso : https://link-springer-com.biblioproxy.umanizales.edu.co/referencework/10.1007/97 [...]