Browsing

Publication Tag: Machine Learning

An overview of all publications that have the tag you selected.

2023
8 citations
You Look like You’ll Buy It! Purchase Intent Prediction Based on Facially Detected Emotions in Social Media Campaigns for Food Products
K. Tzafilkou, A. A. Economides, F. R. Panavou
Understanding the online behavior and purchase intent of consumers in social media can bring significant benefits to ecommerce business and consumer research community. Despite the tight links between emotions and purchase decisions, previous studies focused primarily on predicting through web analytics, sales, and historical data. Here, we use facially expressed emotions detected by FaceReader OnlineTM while watching video campaigns for food products (yogurt, nut butters) is suggested to infer consumer intent. A multi-stage experiment was set, collecting data from 154 valid sessions from 74 participants. A set of different classification models were deployed, and performance evaluation metrics were compared. The models included Neural Networks (NNs), Logistic Regression (LR), Decision Trees (DTs), Random Forest (RF), and Support Vector Machine (SVM). NNs proved highly accurate (90–91%) in predicting consumers’ intention to buy or try the product, while RF showed promising results (75%). Expressions of sadness and surprise indicated the highest levels of importance, with DTs correspondingly. Low arousal, micro expressions, might be sufficient input based on instances decoded emotions.
2019
15 citations
Lie Detectors? How Entrepreneurs’ Facial Expressions During IPO Roadshow Presentations Predict New Venture Misconduct Behaviors
M. Gong, Z. Zhang, M. Jia
Serious information asymmetry between new ventures and external stakeholders create room for misconduct behaviors. Therefore, finding clues that point to misconduct is very important. Drawing upon the theory of cognitive dissonance, in this article, we analyze whether and how entrepreneurs’ facial expressions during initial public offering (IPO) roadshow presentations predict new venture misconduct behaviors in the first year after IPO. Using a sample of Chinese firms listed on the growth enterprise market (GEM) from 2010 to 2015, we identify and quantify IPO entrepreneurs’ facial expressions with the help of expression recognition system-FaceReader 6.1. We find that facial expressions can predict misconduct among new ventures to some extent. Specifically, a positive relationship between the possibility of engaging in negative behaviors and the number of positive expressions, and a negative relationship between the number of negative expressions and the possibility of engaging in misconduct behaviors. In addition, individual responsibility strengthens this relationship. Findings reveal the need to consider entrepreneurs’ facial expressions when predicting unethical ventures.
2025
3 citations
An Artificial Intelligence Model for Sensing Affective Valence and Arousal from Facial Images
H. Nomiya, K. Shimokawa, S. Namba, M. Osumi, W. Sato
Artificial intelligence (AI) models can sense subjective affective states from facial images. Although recent psychological studies have indicated that dimensional aspects of valence and arousal are systematically associated with facial expressions, no AI model has been developed to estimate these from facial images based on empirical data. We developed a recurrent neural network-based model trained on our database containing participant valence/arousal ratings from video clips. Leave-one-out cross-validation supported the validity of the model for predicting subjective states. We further validated the effectiveness of the model by analyzing a dataset of facial expressions and arousal ratings from videos. The predicted second-by-second states, with a prediction performance comparable to FaceReader, a commercial facial expression analysis software, were used to estimate different affective states using a different approach. We constructed a graphical user interface to show real-time video and predicted affective states, and the model is the first distributable affective sensing model for facial images/videos. We anticipate it will be an AI model for sensing affective valence and arousal from facial images and have many practical uses, such as in mental health monitoring and marketing research.
2025
2 citations
Changes in facial expressions can distinguish Parkinson’s disease via Bayesian inference
M. Mouse, H. Gong, Y. Liu, F. Xu, X. Zou, M. Huang, X. Yang
Leveraging FaceReader technology, except for sad, scared, and disgusted, negative facial expressions were positively associated with the probability of PD. In addition, scared expressions generated by reading monosyllabic disyllables had the greatest effect on PD, while other multisyllabic expressions produced the least effect.
2024
8 citations
Comparative analysis of artificial intelligence and expert assessments in detecting neonatal procedural pain
V. Giordano, A. Luister, E. Vettorazzi, K. Wonka, N. Pointner, P. Steinbauer, M. Wagner, A. Berger, D. Singer, P. Deindl
This research evaluates the effectiveness of FaceReader and artificial intelligence in identifying procedural pain in newborns, comparing automated assessments with expert observations.
2019
11 citations
Emotion recognition in humans and machine using posed and spontaneous facial expression
D. Dupré, E. G. Krumhuber, D. Küster, G. McKeown
In the wake of rapid advances in automatic affect analysis, commercial software tools for facial recognition have attracted considerable attention in the past few years. While several options now exist to analyze dynamic video data, less is known about the relative performance of these classifiers, in particular when expressions are spontaneous rather than posed. The present work, we tested eight out-of-the-box machine classifiers (Affectiva’s Affdex, CrowdEmotion’s FaceVideo, Emotient’s Facet, Microsoft’s Cognitive Services, MorphCast’s EmotionalTracking, Neurodatalab’s EmotionRecognition, VicarVision’s FaceAnalysis), compared their performance to human observers. For this, a total of 938 videos were sampled from two large databases conveyed basic six emotions (happiness, sadness, anger, fear, surprise, disgust) either posed (BU-4DFE) or spontaneous (UT-Dallas) form. Results revealed that human observers held an advantage over classifiers. Among the eight tested, there was variance in accuracy ranging 49% – 62%. Subsequent analyses per type of expression revealed that the best performing classifier approximated those of human observers, suggesting high agreement with posed expressions. However, spontaneous expressions were consistently lower and reflected less affective behavior. Overall, happiness was the most successful emotion across databases, whereas confusion rates suggested system-specific biases favoring classification of certain emotions over others. The findings indicate shortcomings in existing systems measuring emotions, and highlight the need for more datasets that can act as benchmark for training and testing systems.
2022
3 citations
Proximally Sensitive Error for Anomaly Detection and Feature Learning
A. Gudi, F. Büttner, J. van Gemert
Mean squared error is widely used to measure differences between multi-dimensional entities, including images. However, MSE lacks local sensitivity as it doesn’t consider the spatial arrangement of pixel differences, which is crucial for structured data like images. Such spatial arrangements provide information about the source of differences; therefore, an error function that incorporates the location of errors can offer a more meaningful distance measure. We introduce Proximally Sensitive Error , suggesting that emphasizing regions in the error measure can highlight semantic differences between images over syntactic or random deviations. We demonstrate that this emphasis can be leveraged for anomaly or occlusion detection. Additionally, we explore its utility as a loss function to help models focus on learning representations of semantic objects instead of minimizing syntactic reconstruction noise.
2004
5 citations
Real time automatic scene classification
M. Israël, E.L. van den Broek, P. van der Putten, M.J. den Uyl
This work, part of the EU VICAR and SCOFI projects, aimed to develop a real-time video indexing, classification, annotation, and retrieval system. The authors introduced a generic approach for visual scene recognition using “typed patches”—groups of adjacent pixels characterized by local pixel distribution, brightness, and color. Each patch is described using an HSI color histogram and texture features. A fixed grid overlays the image, segmenting each cell into patches categorized by a classifier. Frequency vectors of these classified patches are concatenated to represent the entire image. Testing on eight scene categories from the Corel database showed 87.5% accuracy in patch classification and 73.8% in scene classification. The method’s advantages include low computational complexity and versatility for image classification, segmentation, and matching. However, manual classification of training patches is a drawback, prompting the development of algorithms for automatic extraction of relevant patch types. The approach was implemented in the VICAR project’s video indexing system for the Netherlands Institute for Sound and Vision and in the SCOFI project’s real-time Internet pornography filter, achieving 92% accuracy with minimal overblocking and underblocking.
2004
29 citations
Automating the Construction of Scene Classifiers for Content-Based Video Retrieval
M. Israël, E.L. van den Broek, P. van der Putten, M.J. den Uyl
This paper introduces a real-time automatic scene classifier within content-based video retrieval. In the proposed approach, end users like documentalists, not image processing experts, build classifiers interactively by simply indicating positive examples of a scene. Classification consists of a two-stage procedure: first, small image fragments called patches are classified; second, frequency vectors of these patch classifications are fed into a second classifier for global scene classification . The first-stage classifiers can be seen as a set of highly specialized, learned feature detectors, serving as an alternative to having an image processing expert determine features a priori. The paper presents results from experiments on a variety of patch and image classes. The scene classifier has been used successfully within television archives and for Internet porn filtering.
2006
15 citations
Learning a Sparse Representation from Multiple Still Images for On-Line Face Recognition in an Unconstrained Environment
J.W.H. Tangelder, B.A.M. Schouten
In a real-world environment a face detector can be applied to extract multiple face images from multiple video streams without constraints on pose and illumination. The extracted face images will have varying image quality and resolution. Moreover, also the detected faces will not be precisely aligned. This paper presents a new approach to on-line face identification from multiple still images obtained under such unconstrained conditions. Our method learns a sparse representation of the most discriminative descriptors of the detected face images according to their classification accuracies. On-line face recognition is supported using a single descriptor of a face image as a query. We apply our method to our newly introduced BHG descriptor, the SIFT descriptor, and the LBP descriptor, which obtain limited robustness against illumination, pose and alignment errors. Our experimental results using a video face database of pairs of unconstrained low resolution video clips of ten subjects, show that our method achieves a recognition rate of 94% with a sparse representation containing 10% of all available data, at a false acceptance rate of 4%.

Get your free example report

Get your free whitepaper