TOWARD RAM FORENSICS SUPPORTEDBY MACHINE-LEARNING METHODS

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

In this article, we propose an enhancement to the computer forensics technique of using Machine Learning tools to analyse the contents of RAM in order to extract information potentially useful during an investigation. In the specific case presented, the use of the extracted information to generate more optimal dictionaries for dictionary cryptanalysis is considered. Increasing user awareness is making cryptanalysis of passwords increasingly difficult for law enforcement. Long and complex passwords are impossible to crack, even when high-performance computing platforms are available. A sensible method of optimization is to look for hints to use a dictionary that contains text phrases more likely to be used in the specific case under attack. Such a hint could be an analysis of RAM taken from the suspect computer. Machine learning methods can significantly facilitate this task. In this article, we also explore the effectiveness of such an approach and its usefulness in practical applications. We also consider applications of the proposed approach for other purposes, such as OSINT.

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 18
  • 10.3390/cancers13143611
Assessing Versatile Machine Learning Models for Glioma Radiogenomic Studies across Hospitals
  • Jul 19, 2021
  • Cancers
  • Risa K Kawaguchi + 8 more

Simple SummaryRadiogenomics enables prediction of the status and prognosis of patients using non-invasively obtained imaging data. Current machine learning (ML) methods used in radiogenomics require huge datasets, which involve the handling of large heterogeneous datasets from multiple cohorts/hospitals. In this study, two different glioma datasets were used to test various ML and image pre-processing methods to confirm whether the models trained on one dataset are universally applicable to other datasets. Our result suggested that the ML method that yielded the highest accuracy in a single dataset was likely to be overfitted. We demonstrated that implementation of standardization and dimension reduction procedures prior to classification, enabled the development of ML methods that are less affected by the multiple cohort difference. We advocate using caution in interpreting the results of radiogenomic studies of the training and testing datasets that are small or mixed, with a view to implementing practical ML methods in radiogenomics.Radiogenomics use non-invasively obtained imaging data, such as magnetic resonance imaging (MRI), to predict critical biomarkers of patients. Developing an accurate machine learning (ML) technique for MRI requires data from hundreds of patients, which cannot be gathered from any single local hospital. Hence, a model universally applicable to multiple cohorts/hospitals is required. We applied various ML and image pre-processing procedures on a glioma dataset from The Cancer Image Archive (TCIA, n = 159). The models that showed a high level of accuracy in predicting glioblastoma or WHO Grade II and III glioma using the TCIA dataset, were then tested for the data from the National Cancer Center Hospital, Japan (NCC, n = 166) whether they could maintain similar levels of high accuracy. Results: we confirmed that our ML procedure achieved a level of accuracy (AUROC = 0.904) comparable to that shown previously by the deep-learning methods using TCIA. However, when we directly applied the model to the NCC dataset, its AUROC dropped to 0.383. Introduction of standardization and dimension reduction procedures before classification without re-training improved the prediction accuracy obtained using NCC (0.804) without a loss in prediction accuracy for the TCIA dataset. Furthermore, we confirmed the same tendency in a model for IDH1/2 mutation prediction with standardization and application of dimension reduction that was also applicable to multiple hospitals. Our results demonstrated that overfitting may occur when an ML method providing the highest accuracy in a small training dataset is used for different heterogeneous data sets, and suggested a promising process for developing an ML method applicable to multiple cohorts.

  • Research Article
  • Cite Count Icon 4
  • 10.1016/j.gfj.2024.100945
Unlocking the black box of sentiment and cryptocurrency: What, which, why, when and how?
  • Feb 15, 2024
  • Global Finance Journal
  • Donyetta Bennett + 3 more

Unlocking the black box of sentiment and cryptocurrency: What, which, why, when and how?

  • Research Article
  • Cite Count Icon 161
  • 10.1016/j.engappai.2023.105961
Applications of machine learning in friction stir welding: Prediction of joint properties, real-time control and tool failure diagnosis
  • Feb 14, 2023
  • Engineering Applications of Artificial Intelligence
  • Ammar H Elsheikh

Applications of machine learning in friction stir welding: Prediction of joint properties, real-time control and tool failure diagnosis

  • Research Article
  • Cite Count Icon 19
  • 10.1016/j.petsci.2022.09.002
A systematic machine learning method for reservoir identification and production prediction
  • Feb 1, 2023
  • Petroleum Science
  • Wei Liu + 3 more

Reservoir identification and production prediction are two of the most important tasks in petroleum exploration and development. Machine learning (ML) methods are used for petroleum-related studies, but have not been applied to reservoir identification and production prediction based on reservoir identification. Production forecasting studies are typically based on overall reservoir thickness and lack accuracy when reservoirs contain a water or dry layer without oil production. In this paper, a systematic ML method was developed using classification models for reservoir identification, and regression models for production prediction. The production models are based on the reservoir identification results. To realize the reservoir identification, seven optimized ML methods were used: four typical single ML methods and three ensemble ML methods. These methods classify the reservoir into five types of layers: water, dry and three levels of oil (Ⅰ oil layer, Ⅱ oil layer, Ⅲ oil layer). The validation and test results of these seven optimized ML methods suggest the three ensemble methods perform better than the four single ML methods in reservoir identification. The XGBoost produced the model with the highest accuracy; up to 99%. The effective thickness of Ⅰ and Ⅱ oil layers determined during the reservoir identification was fed into the models for predicting production. Effective thickness considers the distribution of the water and the oil resulting in a more reasonable production prediction compared to predictions based on the overall reservoir thickness. To validate the superiority of the ML methods, reference models using overall reservoir thickness were built for comparison. The models based on effective thickness outperformed the reference models in every evaluation metric. The prediction accuracy of the ML models using effective thickness were 10% higher than that of reference model. Without the personal error or data distortion existing in traditional methods, this novel system realizes rapid analysis of data while reducing the time required to resolve reservoir classification and production prediction challenges. The ML models using the effective thickness obtained from reservoir identification were more accurate when predicting oil production compared to previous studies which use overall reservoir thickness.

  • Research Article
  • Cite Count Icon 1
  • 10.3390/agriculture15010036
UAV-Multispectral Based Maize Lodging Stress Assessment with Machine and Deep Learning Methods
  • Dec 26, 2024
  • Agriculture
  • Minghu Zhao + 4 more

Maize lodging is a prevalent stress that can significantly diminish corn yield and quality. Unmanned aerial vehicles (UAVs) remote sensing is a practical means to quickly obtain lodging information at field scale, such as area, severity, and distribution. However, existing studies primarily use machine learning (ML) methods to qualitatively analyze maize lodging (lodging and non-lodging) or estimate the maize lodging percentage, while there is less research using deep learning (DL) to quantitatively estimate maize lodging parameters (type, severity, and direction). This study aims to introduce advanced DL algorithms into the maize lodging classification task using UAV-multispectral images and investigate the advantages of DL compared with traditional ML methods. This study collected a UAV-multispectral dataset containing non-lodging maize and lodging maize with different lodging types, severities, and directions. Additionally, 22 vegetation indices (VIs) were extracted from multispectral data, followed by spatial aggregation and image cropping. Five ML classifiers and three DL models were trained to classify the maize lodging parameters. Finally, we compared the performance of ML and DL models in evaluating maize lodging parameters. The results indicate that the Random Forest (RF) model outperforms the other four ML algorithms, achieving an overall accuracy (OA) of 89.29% and a Kappa coefficient of 0.8852. However, the maize lodging classification performance of DL models is significantly better than that of ML methods. Specifically, Swin-T performs better than ResNet-50 and ConvNeXt-T, with an OA reaching 96.02% and a Kappa coefficient of 0.9574. This can be attributed to the fact that Swin-T can more effectively extract detailed information that accurately characterizes maize lodging traits from UAV-multispectral data. This study demonstrates that combining DL with UAV-multispectral data enables a more comprehensive understanding of maize lodging type, severity, and direction, which is essential for post-disaster rescue operations and agricultural insurance claims.

  • Dissertation
  • 10.53846/goediss-6872
Context- and Physiology-aware Machine Learning for Upper-Limb Myocontrol
  • Feb 21, 2022
  • Gauravkumar K Patel

Context- and Physiology-aware Machine Learning for Upper-Limb Myocontrol

  • Research Article
  • Cite Count Icon 552
  • 10.1139/er-2020-0019
A review of machine learning applications in wildfire science and management
  • Jul 28, 2020
  • Environmental Reviews
  • Piyush Jain + 5 more

Artificial intelligence has been applied in wildfire science and management since the 1990s, with early applications including neural networks and expert systems. Since then, the field has rapidly progressed congruently with the wide adoption of machine learning (ML) methods in the environmental sciences. Here, we present a scoping review of ML applications in wildfire science and management. Our overall objective is to improve awareness of ML methods among wildfire researchers and managers, as well as illustrate the diverse and challenging range of problems in wildfire science available to ML data scientists. To that end, we first present an overview of popular ML approaches used in wildfire science to date and then review the use of ML in wildfire science as broadly categorized into six problem domains, including (i) fuels characterization, fire detection, and mapping; (ii) fire weather and climate change; (iii) fire occurrence, susceptibility, and risk; (iv) fire behavior prediction; (v) fire effects; and (vi) fire management. Furthermore, we discuss the advantages and limitations of various ML approaches relating to data size, computational requirements, generalizability, and interpretability, as well as identify opportunities for future advances in the science and management of wildfires within a data science context. In total, to the end of 2019, we identified 300 relevant publications in which the most frequently used ML methods across problem domains included random forests, MaxEnt, artificial neural networks, decision trees, support vector machines, and genetic algorithms. As such, there exists opportunities to apply more current ML methods — including deep learning and agent-based learning — in the wildfire sciences, especially in instances involving very large multivariate datasets. We must recognize, however, that despite the ability of ML models to learn on their own, expertise in wildfire science is necessary to ensure realistic modelling of fire processes across multiple scales, while the complexity of some ML methods such as deep learning requires a dedicated and sophisticated knowledge of their application. Finally, we stress that the wildfire research and management communities play an active role in providing relevant, high-quality, and freely available wildfire data for use by practitioners of ML methods.

  • Research Article
  • Cite Count Icon 3
  • 10.1259/bjr.20220373
Prediction of dose deposition matrix using voxel features driven machine learning approach.
  • Mar 6, 2023
  • The British Journal of Radiology
  • Shengxiu Jiao + 2 more

A dose deposition matrix (DDM) prediction method using several voxel features and a machine learning (ML) approach is proposed for plan optimization in radiation therapy. Head and lung cases with the inhomogeneous medium are used as training and testing data. The prediction model is a cascade forward backprop neural network where the input is the features of the voxel, including 1) voxel to body surface distance along the beamlet axis, 2) voxel to beamlet axis distance, 3) voxel density, 4) heterogeneity corrected voxel to body surface distance, 5) heterogeneity corrected voxel to beamlet axis, and (6) the dose of voxel obtained from the pencil beam (PB) algorithm. The output is the predicted voxel dose corresponding to a beamlet. The predicted DDM was used for plan optimization (ML method) and compared with the dose of MC-based plan optimization (MC method) and the dose of pencil beam-based plan optimization (PB method). The mean absolute error (MAE) value was calculated for full volume relative to the dose of the MC method to evaluate the overall dose performance of the final plan. For patient with head tumor, the ML method achieves MAE value 0.49 × 10-4 and PB has MAE 1.86 × 10-4. For patient with lung tumor, the ML method has MAE 1.42 × 10-4 and PB has MAE 3.72 × 10-4. The maximum percentage difference in PTV dose coverage (D98) between ML and MC methods is no more than 1.2% for patient with head tumor, while the difference is larger than 10% using the PB method. For patient with lung tumor, the maximum percentage difference in PTV dose coverage (D98) between ML and MC methods is no more than 2.1%, while the difference is larger than 16% using the PB method. In this work, a reliable DDM prediction method is established for plan optimization by applying several voxel features and the ML approach. The results show that the ML method based on voxel features can obtain plans comparable to the MC method and is better than the PB method in achieving accurate dose to the patient, which is helpful for rapid plan optimization and accurate dose calculation. Establishment of a new machine learning method based on the relationship between the voxel and beamlet features for dose deposition matrix prediction in radiation therapy.

  • Research Article
  • Cite Count Icon 70
  • 10.1007/s12039-021-01995-2
Artificial intelligence: machine learning for chemical sciences.
  • Dec 21, 2021
  • Journal of chemical sciences (Bangalore, India)
  • Akshaya Karthikeyan + 1 more

Research in molecular sciences witnessed the rise and fall of Artificial Intelligence (AI)/ Machine Learning (ML) methods, especially artificial neural networks, few decades ago. However, we see a major resurgence in the use of modern ML methods in scientific research during the last few years. These methods have had phenomenal success in the areas of computer vision, speech recognition, natural language processing (NLP), etc. This has inspired chemists and biologists to apply these algorithms to problems in natural sciences. Availability of high performance Graphics Processing Unit (GPU) accelerators, large datasets, new algorithms, and libraries has enabled this surge. ML algorithms have successfully been applied to various domains in molecular sciences by providing much faster and sometimes more accurate solutions compared to traditional methods like Quantum Mechanical (QM) calculations, Density Functional Theory (DFT) or Molecular Mechanics (MM) based methods, etc. Some of the areas where the potential of ML methods are shown to be effective are in drug design, prediction of high–level quantum mechanical energies, molecular design, molecular dynamics materials, and retrosynthesis of organic compounds, etc. This article intends to conceptually introduce various modern ML methods and their relevance and applications in computational natural sciences.Graphical abstract Synopsis Recent surge in the application of machine learning (ML) methods in fundamental sciences has led to a perspective that these methods may become important tools in chemical science. This perspective provides an overview of the modern ML methods and their successful applications in chemistry during the last few years.

  • Research Article
  • Cite Count Icon 588
  • 10.1109/tcyb.2019.2950779
A Survey of Optimization Methods From a Machine Learning Perspective.
  • Nov 18, 2019
  • IEEE Transactions on Cybernetics
  • Shiliang Sun + 3 more

Machine learning develops rapidly, which has made many theoretical breakthroughs and is widely applied in various fields. Optimization, as an important part of machine learning, has attracted much attention of researchers. With the exponential growth of data amount and the increase of model complexity, optimization methods in machine learning face more and more challenges. A lot of work on solving optimization problems or improving optimization methods in machine learning has been proposed successively. The systematic retrospect and summary of the optimization methods from the perspective of machine learning are of great significance, which can offer guidance for both developments of optimization and machine learning research. In this article, we first describe the optimization problems in machine learning. Then, we introduce the principles and progresses of commonly used optimization methods. Finally, we explore and give some challenges and open problems for the optimization in machine learning.

  • Research Article
  • Cite Count Icon 18
  • 10.1016/j.conbuildmat.2022.129116
Inference of mechanical properties and structural grades of bamboo by machine learning methods
  • Nov 1, 2022
  • Construction and Building Materials
  • Juan F Correal + 3 more

Inference of mechanical properties and structural grades of bamboo by machine learning methods

  • Research Article
  • Cite Count Icon 2
  • 10.1016/j.psj.2024.104489
An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers
  • Nov 1, 2024
  • Poultry Science
  • Bogong Liu + 6 more

An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers

  • Research Article
  • Cite Count Icon 9
  • 10.1016/j.imu.2022.100861
The prediction power of machine learning on estimating the sepsis mortality in the intensive care unit
  • Jan 1, 2022
  • Informatics in Medicine Unlocked
  • Mehtap Selcuk + 2 more

The prediction power of machine learning on estimating the sepsis mortality in the intensive care unit

  • Book Chapter
  • 10.1093/acrefore/9780199389414.013.625
Machine Learning Tools for Water Resources Modeling and Management
  • Mar 20, 2024
  • Giorgio Guariso + 1 more

The pervasive diffusion of information and communication technologies that has characterized the end of the 20th and the beginning of the 21st centuries has profoundly impacted the way water management issues are studied. The possibility of collecting and storing large data sets has allowed the development of new classes of models that try to infer the relationships between the variables of interest directly from data rather than fit the classical physical and chemical laws to them. This approach, known as “data-driven,” belongs to the broader area of machine learning (ML) methods and can be applied to many water management problems. In hydrological modeling, ML tools can process diverse data sets, including satellite imagery, meteorological data, and historical records, to enhance predictions of streamflow, groundwater levels, and water availability and thus support water allocation, infrastructure planning, and operational decision-making. In water demand management, ML models can analyze historical water consumption patterns, weather data, and socioeconomic factors to predict future water demands. These models can support water utilities and policymakers in optimizing water allocation, planning infrastructure, and implementing effective conservation strategies. In reservoir management, advanced ML tools may be used to determine the operating rule of water structures by directly searching for the management policy or by mimicking a set of decisions with some desired properties. They may also be used to develop surrogate models that can be rapidly executed to determine the optimal course of action as a component of a decision-support system. ML methods have revolutionized water management studies by showing the power of data-driven insights. Thanks to their ability to make accurate forecasts, enhanced monitoring, and optimized resource allocation, adopting these tools is predicted to expand and consistently modify water management practices. Continued advancements in ML tools, data availability, and interdisciplinary collaborations will further propel the use of ML methods to address global water challenges and pave the way for a more resilient and sustainable water future.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 151
  • 10.3390/rs11212575
Landslide Detection Using Multi-Scale Image Segmentation and Different Machine Learning Models in the Higher Himalayas
  • Nov 2, 2019
  • Remote Sensing
  • Sepideh Tavakkoli Piralilou + 7 more

Landslides represent a severe hazard in many areas of the world. Accurate landslide maps are needed to document the occurrence and extent of landslides and to investigate their distribution, types, and the pattern of slope failures. Landslide maps are also crucial for determining landslide susceptibility and risk. Satellite data have been widely used for such investigations—next to data from airborne or unmanned aerial vehicle (UAV)-borne campaigns and Digital Elevation Models (DEMs). We have developed a methodology that incorporates object-based image analysis (OBIA) with three machine learning (ML) methods, namely, the multilayer perceptron neural network (MLP-NN) and random forest (RF), for landslide detection. We identified the optimal scale parameters (SP) and used them for multi-scale segmentation and further analysis. We evaluated the resulting objects using the object pureness index (OPI), object matching index (OMI), and object fitness index (OFI) measures. We then applied two different methods to optimize the landslide detection task: (a) an ensemble method of stacking that combines the different ML methods for improving the performance, and (b) Dempster–Shafer theory (DST), to combine the multi-scale segmentation and classification results. Through the combination of three ML methods and the multi-scale approach, the framework enhanced landslide detection when it was tested for detecting earthquake-triggered landslides in Rasuwa district, Nepal. PlanetScope optical satellite images and a DEM were used, along with the derived landslide conditioning factors. Different accuracy assessment measures were used to compare the results against a field-based landslide inventory. All ML methods yielded the highest overall accuracies ranging from 83.3% to 87.2% when using objects with the optimal SP compared to other SPs. However, applying DST to combine the multi-scale results of each ML method significantly increased the overall accuracies to almost 90%. Overall, the integration of OBIA with ML methods resulted in appropriate landslide detections, but using the optimal SP and ML method is crucial for success.

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.