Soft Metrics Research Articles

Large Language Models (LLMs) can solve some undergraduate-level to graduate-level physics textbook problems and are proficient at coding. Combining these two capabilities could one day enable AI systems to simulate and predict the physical world. We present an evaluation of state-of-the-art (SOTA) LLMs on PhD-level to research-level computational physics problems. We condition LLM generation on the use of well-documented and widely-used packages to elicit coding capabilities in the physics and astrophysics domains. We contribute ∼50 original and challenging problems in celestial mechanics (with REBOUND), stellar physics (with MESA), 1D fluid dynamics (with Dedalus) and non-linear dynamics (with SciPy). Since our problems do not admit unique solutions, we evaluate LLM performance on several soft metrics: counts of lines that contain different types of errors (coding, physics, necessity and sufficiency) as well as a more educational’ Pass-Fail metric focused on capturing the salient physical ingredients of the problem at hand. As expected, today's SOTA LLM (GPT4) zero-shot fails most of our problems, although about 40% of the solutions could plausibly get a passing grade. About 70%–90% of the code lines produced are necessary, sufficient and correct (coding & physics). Physics and coding errors are the most common, with some unnecessary or insufficient lines. We observe significant variations across problem class and difficulty. We identify several failure modes of GPT4 in the computational physics domain, such as poor physical units handling, poor code versioning, tendency to hallucinate plausible sub-modules, lack of physical justification for global run parameters (e.g., simulation time, or upper-lower bounds for parametric exploration) and inability to define steady-state or stopping conditions reliably. Our reconnaissance work provides a snapshot of current computational capabilities in classical physics and points to obvious improvement targets if AI systems are ever to reach a basic level of autonomy in physics simulation capabilities.

Read full abstract

ABSTRACT Purpose – The purpose of this study is to examine the antecedents of service quality and customer value in a manufacturer-distributor context by elaborating the basic principles of general systems theory. Based on previous research in the field we demonstrate how an initially multidimensional and complicated phenomenon can be explained and predicted by a relatively simple research model. Methodology/approach – After the theoretical discussion, this paper develops a systemic research model for measuring service quality and customer value. The model suggests that input dimensions positively affect process dimensions, which subsequently have a positive effect on output quality. It shows the mechanisms by which factors outside the traditional service management domain impact service outcomes directly and indirectly. Quantitative empirical research was carried out in order to test the hypotheses inherent in the research model, and the data were analyzed using the partial least squares (PLS) method. Findings – Study findings support the widespread idea that perceived service quality and customer value are grounded in the quality of the service process and also on critical input factors. Considering both direct and indirect effects, it seems that the most significant driver behind the service quality and customer value is employee response followed by employee assurance. Nevertheless, it is important to note that both tangibles and visuals (visually appealing physical facilities, equipment, and appearance of personnel), as well as information items (quality and accessibility of information and communication quality), are quite strong predictors of the associated process structures. These service attributes should not be rejected when in pursuit of a comprehensive quality policy in practice. Research Implications – The chief contribution of this study to the research community is that a more definite conceptualization and explanation of the service success can be found by the general systems theory. Practical Implications – Our advice to practitioners, and above all to service management, is that they must do everything in their power to increase the level of employee responsiveness. Without ignoring other dimensions in the quality system, the soft metrics inherent in employee assurance are valued highly by customers. Originality/value – The research model reveals previously unrecognized interactions between eight constructs. Our data and the empirical tests confirm that the adopted approach explains service quality and customer value exceptionally well. In addition to the explanatory power of the proposed approach, due to the methodology chosen, it also has strong predictive power. Thus, the model can be used to predict observations for cases that are similar to the case used in the sample. Even though the specific focus of the study is on the manufacturer-distributor context, the results are applicable to service management in general.

Read full abstract

Soft Metrics Research Articles

Related Topics

Articles published on Soft Metrics

Physics simulation capabilities of LLMs

Metrization of soft metric spaces and its application to fixed point theory

Convergence : Partially ordered soft topological space

Efficient Iterative Timing Recovery of Low-Density Parity-Check Decoding Metrics Using the Steepest Descent Algorithm for Satellite Communications at Low SNRs

Performance Metrics for Fluidic Soft Robot Rotational Actuators.

Evaluating Service Quality and Service Value in Manufacturer-Distributor Settings: A Systems Approach

Defining Environments: Understanding Architectural Performance through Modelling, Simulation and Visualisation

Redeployment of clinical support to assist junior doctors: A cost analysis

Demonstrating the Value of Marketing

Leveraging Localized Social Media Insights for Early Warning Systems

Soft Metrics and Their Performance Analysis for Optimal Data Detection in the Presence of Strong Oscillator Phase Noise

Soft Metrics and EXIT Chart Analysis of Noncoherent MFSK with Diversity Reception in Rician Fading Channel

Performance of Soft Decision Decoded Synchronous FHSS Multiple Access Networks Using MFSK Modulation under Rayleigh Fading

ROMI-Driven Sales Promotions: How The Biggest Coca-Cola Bottler Outside Of The U.S. Learned How To Measure The Impact Of Their Sales Promotions

Soft metrics: What are they and what use are they for the Intelligence Community?

Soft-Decision Decoding in Asynchronous FH/SSMA Networks Using MFSK Modulation

Performance of soft metrics for convolutional coded asynchronous fast FHSS-MA networks using BFSK under rayleigh fading

Iterative multiuser detection

Internet Industry Mergers and Acquisitions

Adaptive soft-input soft-output algorithms for iterative detection with parametric uncertainty

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Soft Metrics Research Articles

Related Topics

Articles published on Soft Metrics

Physics simulation capabilities of LLMs

Metrization of soft metric spaces and its application to fixed point theory

Convergence : Partially ordered soft topological space

Efficient Iterative Timing Recovery of Low-Density Parity-Check Decoding Metrics Using the Steepest Descent Algorithm for Satellite Communications at Low SNRs

Performance Metrics for Fluidic Soft Robot Rotational Actuators.

Evaluating Service Quality and Service Value in Manufacturer-Distributor Settings: A Systems Approach

Defining Environments: Understanding Architectural Performance through Modelling, Simulation and Visualisation

Redeployment of clinical support to assist junior doctors: A cost analysis

Demonstrating the Value of Marketing

Leveraging Localized Social Media Insights for Early Warning Systems

Soft Metrics and Their Performance Analysis for Optimal Data Detection in the Presence of Strong Oscillator Phase Noise

Soft Metrics and EXIT Chart Analysis of Noncoherent MFSK with Diversity Reception in Rician Fading Channel

Performance of Soft Decision Decoded Synchronous FHSS Multiple Access Networks Using MFSK Modulation under Rayleigh Fading

ROMI-Driven Sales Promotions: How The Biggest Coca-Cola Bottler Outside Of The U.S. Learned How To Measure The Impact Of Their Sales Promotions

Soft metrics: What are they and what use are they for the Intelligence Community?

Soft-Decision Decoding in Asynchronous FH/SSMA Networks Using MFSK Modulation

Performance of soft metrics for convolutional coded asynchronous fast FHSS-MA networks using BFSK under rayleigh fading

Iterative multiuser detection

Internet Industry Mergers and Acquisitions

Adaptive soft-input soft-output algorithms for iterative detection with parametric uncertainty