A Survey on Ensemble Learning for Data Stream Classification

Ensemble-based methods are among the most widely used techniques for data stream classification. Their popularity is attributable to their good... (more)

Data-Driven Techniques in Disaster Information Management

Improving disaster management and recovery techniques is one of national priorities given the huge... (more)

Business Process Variability Modeling

It is common for organizations to maintain multiple variants of a given business process, such as multiple sales processes for different products or multiple bookkeeping processes for different countries. Conventional business process modeling languages do not explicitly support the representation of such families of process variants. This gap... (more)

Wireless Body Area Network (WBAN)

Wireless Body Area Network (WBAN) has been a key element in e-health to monitor bodies. This technology enables new applications under the umbrella of different domains, including the medical field, the entertainment and ambient intelligence areas. This survey paper places substantial emphasis on the concept and key features of the WBAN technology.... (more)

Evaluation of Knowledge Gaps in Mathematical Applications of Thermal Image Processing Techniques for Fire Prevention

In this article, we present literature reviews on fire prevention methods, especially in mining... (more)

A Survey on Data-Flow Testing

Data-flow testing (DFT) is a family of testing strategies designed to verify the interactions between each program variable’s definition and its uses. Such a test objective of interest is referred to as a def-use pair. DFT selects test data with respect to various test adequacy criteria (i.e., data-flow coverage criteria) to exercise each... (more)

A Survey of Timing Channels and Countermeasures

A timing channel is a communication channel that can transfer information to a receiver/decoder by modulating the timing behavior of an entity. Examples of this entity include the interpacket delays of a packet stream, the reordering packets in a packet stream, or the resource access time of a cryptographic module. Advances in the information and... (more)

Presentation Attack Detection Methods for Face Recognition Systems

The vulnerability of face recognition systems to presentation attacks (also known as direct attacks or spoof attacks) has received a great deal of... (more)

Hypervideos and Interactive Multimedia Presentations

Hypervideos and interactive multimedia presentations allow the creation of fully interactive and enriched video. It is possible to organize video... (more)

Surviving the Web

In this article, we survey the most common attacks against web sessions, that is, attacks that target honest web browser users establishing an authenticated session with a trusted web application. We then review existing security solutions that prevent or mitigate the different attacks by evaluating them along four different axes: protection,... (more)

CASE Tool Support for Variability Management in Software Product Lines

Software product lines (SPL) aim at reducing time-to-market and increasing software quality through extensive, planned reuse of artifacts. An... (more)


About CSUR

ACM Computing Surveys (CSUR) publishes comprehensive, readable tutorials and survey papers that give guided tours through the literature and explain topics to those who seek to learn the basics of areas outside their specialties. These carefully planned and presented introductions are also an excellent way for professionals to develop perspectives on, and identify trends in complex technologies. Recent issues have covered image understanding, software reusability, and object and relational database topics. 

Forthcoming Articles
A Survey on Malware Detection Using Data Mining Techniques

In the Internet-age, malware has posed a serious and evolving security threat to Internet users. To protect legitimate users from these threats, anti-malware software products from different companies provide the most significant defense against malware. Unfortunately, driven by the economic benefits, the number of new malware samples has explosively increased: anti-malware venders are now confronted with millions of potential malware samples per year. In order to keep on well combating the increase in malware samples, there is an urgent need to develop intelligent methods for effective and efficient malware detection from the real and large daily sample collection. In this survey paper, we first provide a brief overview on malware as well as the anti-malware industry, and present the industrial needs on malware detection. We then survey intelligent malware detection methods. In these methods, the process of detection is usually divided into two stages: feature extraction and classification/ clustering. The performance of such intelligent malware detection approaches critically depend on the extracted features and the methods for classification/clustering. We provide a comprehensive investigation on both the feature extraction and the classification/clustering techniques. We also discuss the additional issues and the challenges of malware detection using data mining techniques and finally forecast the trends of malware development.

Arabic Online Handwriting Recognition (AOHR): A Survey

This article comprehensively surveys Arabic Online Handwriting Recognition (AOHR). We address the challenges posed by online handwriting recognition including ligatures, dots and diacritic problems, online/offline touching of text, and geometric variations. Then, we present a general model of AOHR system that incorporates the different phases of an AOHR system. We summarize the main AOHR databases and identify their uses and limitations. Preprocessing techniques that are used in AOHR, viz. normalization, smoothing, de-hooking, baseline identification, and delayed stroke processing, are presented with illustrative examples. We discuss different techniques for Arabic online handwriting segmentation at the character and morpheme levels and identify their limitations. Feature extraction techniques that are used in AOHR are discussed and their challenges identified. We address the classification techniques of non-cursive (characters and digits) and cursive Arabic online handwriting and analyze their applications. We discuss different classification techniques, viz. structural approaches, SVM, Fuzzy SVM, Neural Networks, HMM, Genetic algorithms, decision trees, and rule-based systems, and analyze their performance. Post-processing techniques are also discussed. Several tables that summarize the surveyed publications are provided for ease of reference and comparison. In the conclusions, we summarize the current limitations and difficulties of AOHR, and future directions of research.

Game Theory for Cyber Security and Privacy

In this survey, we review the existing game-theoretic approaches for cyber security and privacy issues, categorizing their application into two classes, security and privacy. To show how game theory is utilized in cyberspace security and privacy, we select research regarding three main applications: cyber-physical security, communication security, and privacy. We present game models, features and solutions of the selected works and describe their advantages and limitations from design to implementation of the defense mechanisms. We also identify some emerging trends and topics of the future research. This survey not only demonstrates how to employ game-theoretic approaches to security and privacy, but also encourages researchers to employ game theory to establish a comprehensive understanding of emerging security and privacy problems in cyberspace and potential solutions.

A Survey on Reliability Protocols in Wireless Sensor Networks

Wireless Sensor Networks (WSN) applications become more and more attractive with the miniaturisation of circuits and the large variety of sensors. The different application domains, especially critical fields of WSN use, make the reliability of data acquisition and communication a hot research field that must be tackled efficiently. Indeed, the quality of largely used, cheap cost wireless sensors and their scarce energy supply support these reliability challenges that lead to data loss or corruption. For solving this problem, the conception of a reliability mechanism that detects these shortcomings and recovers to them becomes necessary. In this paper, we present a survey on existing reliability protocols conceived especially for WSNs due to their special features. The deep classification and discussion all along this study allow for understanding the pros and cons of state-of-the-art works in order to enhance the existing schemes and fill the gaps. We have classified the works according to the required level of reliability, the manner to identify the reliability lack origins, and the control to recover this lack. Across the discussion along this study, we deduce that the cross layer design between MAC, routing and transport layers presents a good concept to efficiently overcome the different reliability holes.

Hashing Techniques: A Survey and Taxonomy

With the rapid development of information storage and networking technologies, quintillion bytes of data are generated every day from social networks, business transactions, sensors, and many other domains. The increasing data volumes impose significant challenges to traditional data analysis tools in storing, processing, and analysing these extremely large-scale data. For decades, hashing is one of the most effective tools commonly used to compress data for fast access and analysis. Hashing techniques have also evolved from simple randomization approaches to advanced adaptive methods considering locality, structure, or label information of the data for effective hashing. This survey reviews and categorizes existing hashing techniques as a taxonomy, in order to provide a comprehensive view of mainstream hashing techniques for different types of data and domain applications. The taxonomy also studies uniqueness of each method and therefore can serve as technique references in understanding the niche of different hashing mechanisms for future development.

Online Algorithms with Advice: A Survey

Online algorithms with advice is an area of research where one attempts to measure how much knowledge of the future is necessary to achieve a given competitive ratio. The lower bound results give robust bounds on what is possible using semi-online algorithms. On the other hand, when the advice is of an obtainable form, algorithms using advice can lead to semi-online algorithms. This survey explains the models, motivates the study in general, presents some examples of the work that has been carried out, and includes a fairly complete set of references, organized by problem studied.

Simulation and Experimentation Platforms for Underwater Acoustic Sensor Networks: Advancements and Challenges

Ocean and water basically cover the major parts of our planet earth. In order to obtain the best utilization of the underlying resources on these parts of earth, people have made some research advancement. Specically, the research on underwater wireless acoustic sensor networks (UWA-SNs) has made a great progress. However, wide deployment of UWA-SNs is far from the reality due to several reasons. One important reason is that offshore deployment and eld-level experiments of ocean-centric applications are both expensive and labor-intensive. Other alternatives to attain this objective are to conduct simulation or experimentation which can reduce cost, and accelerate the research activities and their outcomes. However, designing efcient and reliable simulation and experimentation platforms are proved to be more challenging beyond the expectation. In this paper, we explore the main techniques (and their pros and cons) and components to develop simulation and experimentation platforms, and provide a comprehensive survey report in this area. We classify simulation and experimentation platforms based on some typical criteria, and then provide useful guidelines for researchers on choosing suitable platforms in accordance with their requirements. Finally, we address some open and un-resolved issues in this context, and provide some suggestions on future research.

Smart Computing and Sensing Technologies for Animal Welfare: A Systematic Review

Animals play a profoundly important and intricate role in our lives today. Dogs have been human companions for thousands of years, but they now work to assist the disabled, and in combat and search and rescue situations. Farm animals are a critical part of sustainable agriculture today, and there is increasing consumer interest in humanely raised livestock, and how it impacts our health and environmental footprint. Wild animals are threatened with extinction by human induced factors, and shrinking and compromised habitat. This review sets the goal to systematically survey the existing literature in smart computing and sensing technologies for domestic, farm and wild animal welfare. We use the notion of \emph{animal welfare} in broad terms, to review the technologies for assessing whether animals are healthy, free of pain and suffering, and also positively stimulated in their environment. Also the notion of \emph{smart computing and sensing} is used in broad terms, to refer to computing and sensing systems that are not isolated but interconnected with communication networks, and capable of remote data collection, processing, exchange and analysis. The findings of this review are expected to motivate future research in computer science and engineering, as well as contribute to data, information and communication management for animal welfare.

Systematic Review of Software Behavioral Model Consistency Checking

In software development, models are often used to represent multiple views of the same system. Such models need to be properly related to each other in order to provide a consistent description of the developed system. Models may contain contradictory system specifications, for instance, when they evolve independently. Therefore, it is very crucial to ensure that models conform to each other. In this context, we focus on consistency checking of behavior models. Several techniques and approaches have been proposed in the existing literature to support behavioral model consistency checking. This paper presents a Systematic Literature Review (SLR) that was carried out to obtain an overview of the various consistency concepts, problems, and solutions proposed regarding behavior models. In our study, the identification and selection of the primary studies was based on a well-planned search strategy. The search process identified a total of 1770 primary studies, out of which 96 have been thoroughly analyzed according to our predefined SLR protocol. The SLR aims to highlight the state-of-the-art of software behavior model consistency checking and identify potential gaps for future research.

Changes as First Class Citizens: A Research Perspective on Modern Software Tooling

Software must evolve to keep up with an ever-changing context, the real world. We discuss an emergent trend in software evolution research revolving around the central notion that drives evolution: Change. By reifying change, and by modelling it as a first-class entity, researchers can now analyse the complex phe- nomenon known as software evolution with an unprecedented degree of accuracy. We present a Systematic Mapping Study of 86 papers to give an overview on the state of the art in this area of research and present a roadmap with open issues and future directions.

Imitation Learning: A Survey of Learning Methods

Imitation learning techniques aim to model human behavior in a given task. An agent is trained to perform a task from demonstrations by learning a mapping between observations and actions. The idea of teaching by imitation has been around for many years, however, the field is gaining attention recently due to advances in computing and sensing as well as rising demand for intelligent applications. The paradigm of learning by imitation is gaining popularity because it facilitates teaching complex tasks with minimal expert knowledge of the tasks. However, specialized algorithms are needed to effectively and robustly learn models as learning by imitation poses its own set of challenges. In this paper, we survey imitation learning methods and present design options in different steps of the learning process. We introduce a background and motivation for the field as well as highlight challenges specific to the imitation problem. Methods for designing and evaluating imitation learning tasks are categorized and reviewed. We extensively discuss combining imitation learning approaches using different sources and methods, as well as incorporating other motion learning methods to enhance imitation. We also discuss the potential impact on industry, present major applications and highlight current and future research directions.

Deep Learning Advances in Computer Vision with 3D Data: A Survey

Deep learning has recently gained popularity achieving state-of-the-art performance in tasks involving text, sound or image processing. Due to its outstanding performance, there have been efforts to apply it in more challenging scenarios, e.g. 3D data processing. This article surveys methods applying deep learning on 3D data and provides a classification based on how they exploit them. From the results of the examined works, we conclude that systems employing 2D views of 3D data typically surpass voxel-based (3D) deep models which however can perform better with more layers and severe data augmentation. Therefore, larger-scale datasets and increased resolutions are required.

A Survey of Techniques for Cache Partitioning in Multicore Processors

As the number of on-chip cores and memory demands of applications increase, judicious management of cache resources has become, not merely attractive, but even imperative. Cache partitioning, i.e. dividing cache space between applications based on their memory demands, is a promising approach to provide capacity benefits of shared cache with performance isolation of private caches. However, naively partitioning the cache may lead to performance loss, unfairness and lack of QoS (quality-of-service) guarantees. It is clear that intelligent techniques are required for realizing the full potential of cache partitioning. In this paper, we present a survey of techniques for partitioning shared caches in multicore processors. We categorize the techniques based on important characteristics and provide a bird's eye view of the field of cache partitioning.

A Taxonomy and Survey of Cloud Resource Orchestration Techniques

Cloud services and applications prove indispensable amidst todays modern utility-based computing. The cloud has displayed disruptive and growing impact on everyday computing tasks. However, facilitating the orchestration of cloud resources in order to build such cloud services and applications is yet to unleash its entire magnitude of power. Accordingly, it is paramount to devise a unified and comprehensive analysis framework in order to accelerate fundamental understanding of cloud resource orchestration in terms of concepts, paradigms, languages, models and tools. This framework is essential to empower effective research, comprehension, comparison and selection of cloud resource orchestration models, languages, platforms and tools. This article provides such a comprehensive framework whilst analyzing the relevant state-of-the-art in cloud resource orchestration from a novel and holistic viewpoint.

A Survey on Reinforcement Learning Models and Algorithms for Traffic Signal Control

Traffic congestion has been an inevitable and complex issue in most urban areas. Of particular interest are the intersections where traffic bottlenecks are known to occur despite being traditionally signalized. Reinforcement learning (RL), which is an artificial intelligence approach, has been adopted in traffic signal control for monitoring and ameliorating traffic congestion. RL enables autonomous decision makers, such as a traffic signal controller, to observe, learn and select the optimal action, such as the appropriate traffic phase and its timing, to manage traffics in order to improve system performance. This article reviews various RL models and algorithms applied to traffic signal control in the aspect of the representations of the RL model (i.e., state, action and reward), performance enhancement, and complexity to establish a foundation for further investigation in this research field. Open issues are presented towards the end of this article to discover new research areas with the objective to spark new interest in this research field.

Fog Computing for Sustainable Smart Cities: A Survey

The Internet of Things (IoT) objects are expected to generate large amounts of data and send the data to the cloud for further processing, specially for knowledge discovery, in order that appropriate actions can be taken. However, in reality sensing all possible data items captured by a smart object and then completely sending to the cloud is less useful. Further, such approach would also lead to resource wastage (e.g. network, storage, etc.). Fog (Edge) computing paradigm has been proposed to counterpart the weakness by pushing processes of knowledge discovery using data analytics to the edges. However, edge devices have limited computational capabilities. Due to inherited strengths and weaknesses, neither Cloud computing nor Fog computing paradigm addresses these challenges alone. In this paper, we review existing approaches that have been proposed to tackle the challenges in the Fog computing domain. Specifically, we describe several inspiring use case scenarios of Fog computing, identify ten key characteristics and common features of Fog computing, and compare more than 30 existing research efforts in this domain. Based on our review, we further identify several major functionalities that ideal Fog computing platforms should support and a number of open challenges towards impleme

A Survey on Financial Applications of Metaheuristics

Modern heuristics or metaheuristics are optimization algorithms that have been increasingly used during the last decades to support complex decision making in a number of fields, such as logistics and transportation, telecommunication networks, bioinformatics, finance, etc. The continuous increase in computing power, together with advancements in metaheuristics frameworks and parallelization strategies, are empowering these types of algorithms as one of the best alternatives to solve rich and real-life combinatorial optimization problems that arise in a number of financial and banking activities. This paper reviews some of the works related to the use of metaheuristics in solving both classical and emergent problems in the finance arena. A non-exhaustive list of examples includes rich portfolio optimization, index tracking, enhanced indexation, credit risk, stock investments, financial project scheduling, option pricing, feature selection, bankruptcy and financial distress prediction, and credit risk assessment. The paper also discusses some open opportunities for researchers in the field, and forecast the evolution of metaheuristics to include real-life uncertainty conditions into the optimization problems being considered.

GPU Virtualization and Scheduling Methods: A Comprehensive Survey

The integration of Graphics Processing Units (GPUs) on high-end compute nodes has established a new accelerator-based heterogeneous computing model which now permeates High Performance Computing. The same paradigm has limited adoption in cloud computing or other large-scale distributed computing paradigms though. Heterogeneous computing with GPUs can benefit the Cloud by reducing operational costs and improving resource and energy efficiency. However, such a paradigm shift would require effective methods for virtualizing GPUs, as well as other accelerators. In this survey paper, we present an extensive and in-depth survey of GPU virtualization techniques and their scheduling methods. We review a wide range of virtualization techniques implemented at the GPU library, driver, and hardware level. Furthermore, we review GPU scheduling methods that address performance and fairness issues between multiple virtual machines sharing GPUs. We believe that our survey delivers a perspective on the challenges and opportunities for virtualization of heterogeneous computing environments.


Publication Years 1969-2017
Publication Count 1590
Citation Count 48587
Available for Download 1590
Downloads (6 weeks) 25861
Downloads (12 Months) 202214
Downloads (cumulative) 3094388
Average downloads per article 1946
Average citations per article 31
