data mining

Research on Weibo marketing advertising push method based on social network data mining

The current advertising push methods have low accuracy and poor advertising conversion effects. Therefore, a Weibo marketing advertising push method based on social network data mining is studied. Firstly, establish a social network graph and use graph clustering algorithm to mine the association relationships of users in the network. Secondly, through sparsisation processing, the association between nodes in the social network graph is excavated. Then, evaluate the tightness between user preferences and other nodes in the social network, and use the TF-IDF algorithm to extract user interest features. Finally, an attention mechanism is introduced to improve the deep learning model, which matches user interests with advertising domain features and outputs push results. The experimental results show that the push accuracy of this method is higher than 95%, with a maximum advertising click through rate of 82.7% and a maximum advertising conversion rate of 60.7%.




data mining

Human resource management and organisation decision optimisation based on data mining

The utilisation of big data presents significant opportunities for businesses to create value and gain a competitive edge. This capability enables firms to anticipate and uncover information quickly and intelligently. The author introduces a human resource scheduling optimisation strategy using a parallel network fusion structure model. The author's approach involves designing a set of network structures based on parallel networks and streaming media, enabling the macro implementation of the enterprise parallel network fusion structure. Furthermore, the author proposes a human resource scheduling optimisation method based on a parallel deep learning network fusion structure. It combines convolutional neural networks and transformer networks to fuse streaming media features, thereby achieving comprehensive identification of the effectiveness of the current human resource scheduling in enterprises. The result shows that the macro and deep learning methods achieve a recognition rate of 87.53%, making it feasible to assess the current state of human resource scheduling in enterprises.




data mining

Design of data mining system for sports training biochemical indicators based on artificial intelligence and association rules

Physiological indicators are an important basis for reflecting the physiological health status of the human body and play an important role in medical practice. Association rules have also been one of the important research hotspots in recent years. This study aims to create a data mining system of association rules and artificial intelligence in biochemical indicators of sports training. This article uses Markov logic for network creation and system training, and tests whether the Markov logic network can be associated with the training system. The results show that the accuracy and recall rate obtained are about 90%, which shows that it is feasible to establish biochemical indicators of sports training based on Markov logic network, and the system has universal, guiding and constructive significance, ensuring that the construction of training system indicators will not go in the wrong direction.




data mining

International Journal of Data Mining and Bioinformatics




data mining

A Tools-Based Approach to Teaching Data Mining Methods




data mining

Using Educational Data Mining to Predict Students’ Academic Performance for Applying Early Interventions

Aim/Purpose: One of the main objectives of higher education institutions is to provide a high-quality education to their students and reduce dropout rates. This can be achieved by predicting students’ academic achievement early using Educational Data Mining (EDM). This study aims to predict students’ final grades and identify honorary students at an early stage. Background: EDM research has emerged as an exciting research area, which can unfold valuable knowledge from educational databases for many purposes, such as identifying the dropouts and students who need special attention and discovering honorary students for allocating scholarships. Methodology: In this work, we have collected 300 undergraduate students’ records from three departments of a Computer and Information Science College at a university located in Saudi Arabia. We compared the performance of six data mining methods in predicting academic achievement. Those methods are C4.5, Simple CART, LADTree, Naïve Bayes, Bayes Net with ADTree, and Random Forest. Contribution: We tested the significance of correlation attribute predictors using four different methods. We found 9 out of 18 proposed features with a significant correlation for predicting students’ academic achievement after their 4th semester. Those features are student GPA during the first four semesters, the number of failed courses during the first four semesters, and the grades of three core courses, i.e., database fundamentals, programming language (1), and computer network fundamentals. Findings: The empirical results show the following: (i) the main features that can predict students’ academic achievement are the student GPA during the first four semesters, the number of failed courses during the first four semesters, and the grades of three core courses; (ii) Naïve Bayes classifier performed better than Tree-based Models in predicting students’ academic achievement in general, however, Random Forest outperformed Naïve Bayes in predicting honorary students; (iii) English language skills do not play an essential role in students’ success at the college of Computer and Information Sciences; and (iv) studying an orientation year does not contribute to students’ success. Recommendations for Practitioners: We would recommend instructors to consider using EDM in predicting students’ academic achievement and benefit from that in customizing students’ learning experience based on their different needs. Recommendation for Researchers: We would highly endorse that researchers apply more EDM studies across various universities and compare between them. For example, future research could investigate the effects of offering tutoring sessions for students who fail core courses in their first semesters, examine the role of language skills in social science programs, and examine the role of the orientation year in other programs. Impact on Society: The prediction of academic performance can help both teachers and students in many ways. It also enables the early discovery of honorary students. Thus, well-deserved opportunities can be offered; for example, scholarships, internships, and workshops. It can also help identify students who require special attention to take an appropriate intervention at the earliest stage possible. Moreover, instructors can be aware of each student’s capability and customize the teaching tasks based on students’ needs. Future Research: For future work, the experiment can be repeated with a larger dataset. It could also be extended with more distinctive attributes to reach more accurate results that are useful for improving the students’ learning outcomes. Moreover, experiments could be done using other data mining algorithms to get a broader approach and more valuable and accurate outputs.




data mining

High quality management of higher education based on data mining

In order to improve the quality of higher education, student satisfaction, and employment rate, a data mining based high-quality management method for higher education is proposed. Firstly, construct a high-quality evaluation system for higher education based on the principles of education quality evaluation. Secondly, the association rule mining method is used to construct a university education quality management model and determine the weight of the impact indicators for high-quality management of university education. Finally, the fuzzy evaluation method is used to determine the high-quality evaluation function of higher education, and the results of high-quality evaluation of higher education are obtained. High-quality management strategies are developed based on the evaluation results to improve the quality of education. The experimental results show that the student satisfaction rate of this method can reach 99.3%, and the student employment rate can reach 99.9%.




data mining

Reflections on strategies for psychological health education for college students based on data mining

In order to improve the mental health level of college students, a data mining based mental health education strategy for college students is proposed. Firstly, analyse the characteristics of data mining and its potential value in mental health education. Secondly, after denoising the mental health data of college students using wavelet transform, data mining methods are used to identify the psychological crisis status of college students. Finally, based on the psychological crisis status of college students, measures for mental health education are proposed from the following aspects: building a psychological counselling platform, launching psychological health promotion activities, establishing a psychological support network, strengthening academic guidance and stress management. The example analysis results show that after the application of the strategy in this article, the psychological health scores of college students have been effectively improved, with an average score of 93.5 points.




data mining

A method for evaluating the quality of college curriculum teaching reform based on data mining

In order to improve the evaluation effect of current university teaching reform, a new method for evaluating the quality of university course teaching reform is proposed based on data mining algorithms. Firstly, the optimal data clustering criterion was used to select evaluation indicators and a quality evaluation system for university curriculum teaching reform was established. Next, a reform quality evaluation model is constructed using BP neural network, and the training process is improved through genetic algorithm to obtain the model weight and threshold of the optimal solution. Finally, the calculated parameters are substituted into the model to achieve accurate evaluation of the quality of university curriculum teaching reform. Selecting evaluation accuracy and evaluation efficiency as evaluation indicators, the practicality of the proposed method was verified through experiments. The experimental results showed that the proposed method can mine teaching reform data and evaluate the quality of teaching reform. Its evaluation accuracy is higher than 96.3%, and the evaluation time is less than 10ms, which is much better than the comparison method, fully demonstrating the practicality of the method.




data mining

A personalised recommendation method for English teaching resources on MOOC platform based on data mining

In order to enhance the accuracy of teaching resource recommendation results and optimise user experience, a personalised recommendation method for English teaching resources on the MOOC platform based on data mining is proposed. First, the learner's evaluation of resources and resource attributes are abstracted into the same space, and resource tags are established using the Knowledge graph. Then, interest preference constraints are introduced to mine sequential patterns of user historical learning behaviour in the MOOC platform. Finally, a graph neural network is used to construct a recommendation model, which adjusts users' short-term and short-term interest parameters to achieve dynamic personalised teaching recommendation resources. The experimental results show that the accuracy and recall of the resource recommendation results of the research method are always higher than 0.9, the normalised sorting gain is always higher than 0.5.




data mining

Prediction method of college students' achievements based on learning behaviour data mining

This paper proposes a method for predicting college students' performance based on learning behaviour data mining. The method addresses the issue of limited sample size affecting prediction accuracy. It utilises the K-means clustering algorithm to mine learning behaviour data and employs a density-based approach to determine optimal clustering centres, which are then output as the results of the clustering process. These clustering results are used as input for an attention encoder-decoder model to extract features from the learning behaviour sequence, incorporating an attention mechanism, sequence feature generator, and decoder. The characteristics derived from the learning behaviour sequence are then used to establish a prediction model for college students' performance, employing support vector regression. Experimental results demonstrate that this method accurately predicts students' performance with a relative error of less than 4% by leveraging the results obtained from learning behaviour data mining.




data mining

International Journal of Business Intelligence and Data Mining




data mining

A data mining method based on label mapping for long-term and short-term browsing behaviour of network users

In order to improve the speedup and recognition accuracy of the recognition process, this paper designs a data mining method based on label mapping for long-term and short-term browsing behaviour of network users. First, after removing the noise information in the behaviour sequence, calculate the similarity of behaviour characteristics. Then, multi-source behaviour data is mapped to the same dimension, and a behaviour label mapping layer and a behaviour data mining layer are established. Finally, the similarity of the tag matrix is calculated based on the similarity calculation results, and the mining results are output using SVM binary classification process. Experimental results show that the acceleration ratio of this method exceeds 0.9; area under curve receiver operating characteristic curve (AUC-ROC) value increases rapidly in a short time, and the maximum value can reach 0.95, indicating that the mining precision of this method is high.




data mining

Finding Diamonds in Data: Reflections on Teaching Data Mining from the Coal Face




data mining

Analyzing Computer Programming Job Trend Using Web Data Mining




data mining

A Data Mining Approach to Improve Re-Accessibility and Delivery of Learning Knowledge Objects




data mining

A data mining model to predict the debts with risk of non-payment in tax administration

One of the main tasks in tax administration is debt management. The main goal of this function is tax due collection. Statements are processed in order to select strategies to use in the debt management process to optimise the debt collection process. This work proposes to carry out a data mining process to predict debts of taxpayers with high probability of non-payment. The data mining process identifies high-risk debts using a survival analysis on a dataset from a tax administration. Three groups of tax debtors with similar payment behaviour were identified and a success rate of up to 90% was reached in estimating the payment time of taxpayers. The concordance index (C-index) was used to determine the performance of the constructed model. The highest prediction rate reached was 90.37% corresponding to the third group.




data mining

Data Mining: Making the Right Connections

While data mining can unearth a wealth of information, it takes discriminating analysis to make sure we are not just making connections, but the right connections.




data mining

Data Mining Techniques for the Life Sciences

Location: Electronic Resource- 




data mining

Data mining and machine learning in building energy analysis

Location: Engineering Library- QA76.9.D343M34 2016




data mining

Big data mining predicts toxicity better than animal tests




data mining

Data mining and model generation using an in-database analytic flow generator

Embodiments are described for a system and method of providing a data miner that decouples the analytic flow solution components from the data source. An analytic-flow solution then couples with the target data source through a simple set of data source connector, table and transformation objects, to perform the requisite analytic flow function. As a result, the analytic-flow solution needs to be designed only once and can be re-used across multiple target data sources. The analytic flow can be modified and updated at one place and then deployed for use on various different target data sources.




data mining

Data mining in a digital map database to identify blind intersections along roads and enabling precautionary actions in a vehicle

Disclosed is a feature for a vehicle that enables taking precautionary actions in response to conditions on the road network around or ahead of the vehicle, in particular, a blind intersection along a section of road. A database that represents the road network is used to determine locations where a blind intersection is located along a section of road. Then, precautionary action data is added to the database to indicate a location at which a precautionary action is to be taken about the blind intersection located along the section of road. A precautionary action system installed in a vehicle uses this database, or a database derived therefrom, in combination with a positioning system to determine when the vehicle is at a location that corresponds to the location of a precautionary action. When the vehicle is at such a location, a precautionary action is taken by a vehicle system as the vehicle is approaching a blind intersection.




data mining

Privacy-preserving aggregated data mining

An apparatus, system and method are introduced for preserving privacy of data in a dataset in a database with a number n of entries. In one embodiment, the apparatus includes memory including computer program code configured to, with a processor, cause the apparatus to form a random matrix of dimension m by n, wherein m is less than n, operate on the dataset with the random matrix to produce a compressed dataset, form a pseudoinverse of the random matrix, and operate on the dataset with the pseudoinverse of the random matrix to produce a decompressed dataset.




data mining

Predictive analytics, data mining and big data : myths, misconceptions and methods / Steven Finlay

Finlay, Steven, 1969-




data mining

Data mining and market intelligence: implications for decision making / Mustapha Akinkunmi

Online Resource




data mining

[ASAP] lipidr: A Software Tool for Data Mining and Analysis of Lipidomics Datasets

Journal of Proteome Research
DOI: 10.1021/acs.jproteome.0c00082




data mining

A scalable framework for integrated social data mining / James Meneghello

Meneghello, James, author




data mining

Mining your own business : a primer for executives on understanding and employing data mining and predictive analytics / Jeff Deal & Gerhard Pilcher ; foreword by Eric Siegel

Deal, Jeff, author




data mining

Human Capital Systems, Analytics, and Data Mining / by Robert C. Hughes

Online Resource




data mining

2019 International Conference on Data Mining Workshops (ICDMW) [electronic journal].

IEEE / Institute of Electrical and Electronics Engineers Incorporated




data mining

2019 International Conference on Data Mining Workshops (ICDMW) [electronic journal].

IEEE / Institute of Electrical and Electronics Engineers Incorporated




data mining

2019 IEEE International Conference on Data Mining (ICDM) [electronic journal].

IEEE / Institute of Electrical and Electronics Engineers Incorporated




data mining

2019 IEEE International Conference on Data Mining (ICDM) [electronic journal].

IEEE / Institute of Electrical and Electronics Engineers Incorporated




data mining

Data mining and data warehousing : principles and practical techniques / Parteek Bhatia

Bhatia, Parteek, author




data mining

Advances in knowledge discovery and data mining : 22nd Pacific-Asia Conference, PAKDD 2018, Melbourne, VIC, Australia, June 3-6, 2018, Proceedings. Parts I-III / Dinh Phung, Vincent S. Tseng, Geoffrey I. Webb, Bao Ho, Mohadeseh Ganji, Lida Rashidi (eds.)

Pacific-Asia Conference on Knowledge Discovery and Data Mining (22nd : 2018 : Melbourne, Vic.)




data mining

Data mining and big data : 4th International Conference, DMBD 2019, Chiang Mai, Thailand, July 26-30, 2019 : proceedings / Ying Tan, Yuhui Shi (eds.)

DMBD (Conference) (4th : 2019 : Chiang Mai, Thailand),




data mining

Mathematica Showcases Innovative Analysis, Data Mining, and Visualizations at Health Datapalooza

At this year’s Health Datapalooza in Washington, DC, Mathematica staff will showcase their expertise in data visualizations, machine learning, and data mining to help progress together on critical issues in today’s health policy environment.




data mining

Data mining in biomedicine [electronic resource] / edited by Panos M. Pardalos, Vladimir L. Boginski, Alkis Vazacopoulos

Boston, MA : Springer US, 2007




data mining

Data mining for biomedical applications [electronic resource] : PAKDD 2006 workshop, BioDM 2006, Singapore, April 9, 2006 : proceedings / Jinyan Li, Qiang Yang, Ah-Hwee Tan (eds.)

Berlin ; New York : Springer, ©2006




data mining

Flood forecasting using time series data mining




data mining

Statistical and data mining methodologies for behavioral analysis in transgenic mouse models of Alzheimer's disease




data mining

Big data mining predicts toxicity better than animal tests