Metalearning
This open access book offers a comprehensive and thorough introduction to almost all aspects of metalearning and automated machine learning (AutoML), covering the basic concepts and architecture, evaluation, datasets, hyperparameter optimization, ensembles and workflows, and also how this knowledge can be used to select, combine, compose, adapt and configure both algorithms and models to yield faster and better solutions to data mining and data science problems. It can thus help developers to develop systems that can improve themselves through experience.As one of the fastest-growing areas of research in machine learning, metalearning studies principled methods to obtain efficient models and solutions by adapting machine learning and data mining processes. This adaptation usually exploits information from past experience on other tasks and the adaptive processes can involve machine learning approaches. As a related area to metalearning and a hot topic currently, AutoML is concerned with automating the machine learning processes. Metalearning and AutoML can help AI learn to control the application of different learning methods and acquire new solutions faster without unnecessary interventions from the user.This book is a substantial update of the first edition published in 2009. It includes 18 chapters, more than twice as much as the previous version. This enabled the authors to cover the most relevant topics in more depth and incorporate the overview of recent research in the respective area. The book will be of interest to researchers and graduate students in the areas of machine learning, data mining, data science and artificial intelligence.
Community Detection and Mining in Social Media
The past decade has witnessed the emergence of participatory Web and social media, bringing people together in many creative ways. Millions of users are playing, tagging, working, and socializing online, demonstrating new forms of collaboration, communication, and intelligence that were hardly imaginable just a short time ago. Social media also helps reshape business models, sway opinions and emotions, and opens up numerous possibilities to study human interaction and collective behavior in an unparalleled scale. This lecture, from a data mining perspective, introduces characteristics of social media, reviews representative tasks of computing with social media, and illustrates associated challenges. It introduces basic concepts, presents state-of-the-art algorithms with easy-to-understand examples, and recommends effective evaluation methods. In particular, we discuss graph-based community detection techniques and many important extensions that handle dynamic, heterogeneous networks in social media. We also demonstrate how discovered patterns of communities can be used for social media mining. The concepts, algorithms, and methods presented in this lecture can help harness the power of social media and support building socially-intelligent systems. This book is an accessible introduction to the study of \emph{community detection and mining in social media}. It is an essential reading for students, researchers, and practitioners in disciplines and applications where social media is a key source of data that piques our curiosity to understand, manage, innovate, and excel. This book is supported by additional materials, including lecture slides, the complete set of figures, key references, some toy data sets used in the book, and the source code of representative algorithms. The readers are encouraged to visit the book website for the latest information. Table of Contents: Social Media and Social Computing / Nodes, Ties, and Influence / Community Detection and Evaluation / Communities in Heterogeneous Networks / Social Media Mining
Correlation Clustering
Given a set of objects and a pairwise similarity measure between them, the goal of correlation clustering is to partition the objects in a set of clusters to maximize the similarity of the objects within the same cluster and minimize the similarity of the objects in different clusters. In most of the variants of correlation clustering, the number of clusters is not a given parameter; instead, the optimal number of clusters is automatically determined. Correlation clustering is perhaps the most natural formulation of clustering: as it just needs a definition of similarity, its broad generality makes it applicable to a wide range of problems in different contexts, and, particularly, makes it naturally suitable to clustering structured objects for which feature vectors can be difficult to obtain. Despite its simplicity, generality, and wide applicability, correlation clustering has so far received much more attention from an algorithmic-theory perspective than from the data-mining community. The goal of this lecture is to show how correlation clustering can be a powerful addition to the toolkit of a data-mining researcher and practitioner, and to encourage further research in the area.
Web Information Systems Engineering - Wise 2022
This book constitutes the proceedings of the 23nd International Conference on Web Information Systems Engineering, WISE 2021, held in Biarritz, France, in November 2022. The 31 full, 13 short and 3 demo papers were carefully reviewed and selected from 94 submissions. The papers are organized in the following topical sections: Social Media, Spatial & Temporal Issues, Query Processing & Information Extraction, Architecture and Performance, Graph Data Management, Security & Privacy, Information Retrieval & Text Processing, Reinforcement Learning, Learning & Optimization, Spatial Data Processing, Recommendation, Neural Networks, and Demo Papers.
IoT Edge Computing with MicroK8s
A step-by-step, comprehensive guide that includes real-world use cases to help you successfully develop and run applications and mission-critical workloads using MicroK8sKey Features: - An easy-to-follow guide that helps you get started with MicroK8s and other Kubernetes components- Understand the key concepts and constraints for building IoT and edge architectures- Get guidance on how to develop and deploy use cases and examples on IoT and edge computing platformsBook Description: Are you facing challenges with developing, deploying, monitoring, clustering, storing, securing, and managing Kubernetes in production environments as you're not familiar with infrastructure technologies? MicroK8s - a zero-ops, lightweight, and CNCF-compliant Kubernetes with a small footprint is the apt solution for you.This book gets you up and running with production-grade, highly available (HA) Kubernetes clusters on MicroK8s using best practices and examples based on IoT and edge computing.Beginning with an introduction to Kubernetes, MicroK8s, and IoT and edge computing architectures, this book shows you how to install, deploy sample apps, and enable add-ons (like DNS and dashboard) on the MicroK8s platform. You'll work with multi-node Kubernetes clusters on Raspberry Pi and networking plugins (such as Calico and Cilium) and implement service mesh, load balancing with MetalLB and Ingress, and AI/ML workloads on MicroK8s. You'll also understand how to secure containers, monitor infrastructure and apps with Prometheus, Grafana, and the ELK stack, manage storage replication with OpenEBS, resist component failure using a HA cluster, and more, as well as take a sneak peek into future trends.By the end of this book, you'll be able to use MicroK8 to build and implement scenarios for IoT and edge computing workloads in a production environment.What You Will Learn: - Get a holistic view of MicroK8s features using a sample application- Understand IoT and edge computing and their architecture constraints- Create, scale, and update HA Raspberry Pi multi-node clusters- Implement AI/ML use cases with the Kubeflow platform- Work with various networking plugins, and monitoring and logging tools- Perform service mesh integrations using Istio and Linkerd- Run serverless applications using Knative and OpenFaaS frameworks- Secure your containers using Kata and strict confinement optionsWho this book is for: This book is for DevOps and cloud engineers, SREs, and application developers who want to implement efficient techniques for deploying their software solutions. It will also be useful for technical architects and technology leaders who are looking to adopt cloud-native technologies. A basic understanding of container-based application design and development, virtual machines, networking, databases, and programming will be helpful for using this book.Table of Contents- Getting Started with Kubernetes- Introducing MicroK8s- Essentials of IoT and Edge Computing- Handling the Kubernetes Platform for IoT and Edge Computing- Creating and Implementing Updates on Multi-Node Raspberry Pi Kubernetes Clusters- Configuring Connectivity for Containers- Setting Up MetalLB and Ingress for Load Balancing- Monitoring the Health of Infrastructure and Applications- Using Kubeflow to Run AI/MLOps Workloads- Going Serverless with Knative and OpenFaaS Frameworks- Managing Storage Replication with OpenEBS- Implementing Service Mesh for Cross-Cutting Concerns- Resisting Component Failure Using HA Clusters- Hardware Virtualization for Securing Containers(N.B. Please use the Read Sample option to see further chapters)
Frontiers of Algorithmic Wisdom
This book constitutes the proceedings of the International Joint Conference on Theoretical Computer Science-Frontier of Algorithmic Wisdom (IJTCS-FAW 2022), for the 16th International Conference on Frontier of Algorithmic Wisdom (FAW) and the third International Joint Conference on Theoretical Computer Science (IJTCS), held in Hong Kong, China, in August 15-19 2022.FAW started as the Frontiers of Algorithmic Workshop in 2007 at Lanzhou, China, and was held annually from 2007 to 2021 and published archival proceedings. IJTCS, the International joint theoretical Computer Science Conference, started in 2020, aimed to bring in presentations covering active topics in selected tracks in theoretical computer science.To accommodate the diversified new research directions in theoretical computer science, FAW and IJTCS joined their forces together to organize an event for information exchange of new findings and work of enduring value in the field. In addition to four keynote speakers, 26 invited speakers and 19 contributed speakers, IJTCS-FAW2022 organized Forums for undergraduate research, young PhD graduates, young TCS faculty members, female researchers, as well as a forum in Conscious AI and a CSIAM Forum in blockchain.The 19 full papers presented in this book were carefully reviewed and selected from 25 submissions. They were organized in topical sections as follows: Algorithmic Game Theory; Game Theory in Block Chain; Frontiers of Algorithmic Wisdom; Computational and Network Economics.
Oracle on Docker
Discover the benefits of running Oracle databases in Linux containers. This book approaches containers from the perspective of database administrators, developers, and systems administrators. It explains the differences between containers and virtual machines and describes why containers deliver greater speed, flexibility, and portability, with lower resource requirements. You'll learn how running Oracle databases in containers complements existing database infrastructure and accelerates development, and you'll understand the advantages they offer for test and validation environments. This book teaches you how to begin working with Oracle databases in Docker, covering the steps for preparing and installing software on Windows, Mac, and Linux systems. It describes the steps for deploying Oracle databases, separating data and configurations from database software, and networking and communicating with your containers. It introduces the Docker commands you'll use for managingcontainers, including tips and shortcuts to make everyday tasks easier. Databases have unique demands for performance and reliability, and this book addresses those qualities with discussions on protecting, persisting, and distributing data. Other books may overlook these topics and approach containers as disposable commodities in serverless environments or convenient coding platforms. You'll gain battle-tested insights for customizing and extending your containers to meet different needs. The opening chapters concentrate on the practical steps of running Oracle databases in Docker. Once you're comfortable with container terminology and methods, you'll look deeper at the real power behind containers--preparing and building images, and the templates that form the foundation beneath every container. You'll begin by modifying publicly available image manifests, or Dockerfiles, following multiple examples that add functionality and capabilities to your databases. You'lldiscover methods for using run-time options to create flexible and extensible images that adapt to real-world requirements. Within the pages, you'll see how Oracle and Docker empower you to confidently build and deploy systems. It's written with databases and database users in mind and delivers practical advice based on the author's real-world, battle-tested experiences deploying and running Oracle databases in containers since 2014. With Oracle databases in containers, database administrators have the ideal platform for evaluating performance, practicing database upgrades and migrations, validating backup and recovery processes, and hardening environments. Developers will find that the marriage of Oracle and Docker simplifies code and application tests. Docker's unique ability to isolate data artifacts improves reliability and confidence in test and QA processes. If you're a database administrator, this book will help you join the container revolution sweeping the industry and making IT professionals more productive than ever! What You Will LearnRecognize when and why to use containers for an Oracle databaseUnderstand container terminology and architectureCreate and customize Oracle databases in containersBuild and extend images and containers for multiple usesStore and persist data beyond the container ecosystemUse popular database tools with databases in containersExplore container networking and connect multiple container databasesManage, monitor, and secure containersWrite Dockerfiles to support custom requirementsPackage and deploy data artifacts that accelerate development, test, and QA activitiesWho This Book Is ForDatabase administrators, developers, and systems administrators who want to be more productive by running Oracle databases in Linux containers
Guide to Industrial Analytics
This textbook describes the hands-on application of data science techniques to solve problems in manufacturing and the Industrial Internet of Things (IIoT). Monitoring and managing operational performance is a crucial activity for industrial and business organisations. The emergence of low-cost, accessible computing and storage, through Industrial Digital Technologies (IDT) and Industry 4.0, has generated considerable interest in innovative approaches to doing more with data. Data science, predictive analytics, machine learning, artificial intelligence and general approaches to modelling, simulating and visualising industrial systems have often been considered topics only for research labs and academic departments.This textbook debunks the mystique around applied data science and shows readers, using tutorial-style explanations and real-life case studies, how practitioners can develop their own understanding of performance to achieve tangible business improvements. All exercises can be completed with commonly available tools, many of which are free to install and use.Readers will learn how to use tools to investigate, diagnose, propose and implement analytics solutions that will provide explainable results to deliver digital transformation.
Learning Microsoft Power Bi
Microsoft Power BI is a data analytics and visualization tool powerful enough for the most demanding data scientists, but accessible enough for everyday use for anyone who needs to get more from data. The market has many books designed to train and equip professional data analysts to use Power BI, but few of them make this tool accessible to anyone who wants to get up to speed on their own. This streamlined intro to Power BI covers all the foundational aspects and features you need to go from "zero to hero" with data and visualizations. Whether you work with large, complex datasets or work in Microsoft Excel, author Jeremey Arnold shows you how to teach yourself Power BI and use it confidently as a regular data analysis and reporting tool. You'll learn how to: Import, manipulate, visualize, and investigate data in Power BI Approach solutions for both self-service and enterprise BI Use Power BI in your organization's business intelligence strategy Produce effective reports and dashboards Create environments for sharing reports and managing data access with your team Determine the right solution for using Power BI offerings based on size, security, and computational needs
Transactions on Large-Scale Data- And Knowledge-Centered Systems Li
The LNCS journal Transactions on Large-Scale Data and Knowledge-Centered Systems focuses on data management, knowledge discovery, and knowledge processing, which are core and hot topics in computer science. Since the 1990s, the Internet has become the main driving force behind application development in all domains. An increase in the demand for resource sharing (e.g., computing resources, services, metadata, data sources) across different sites connected through networks has led to an evolution of data- and knowledge-management systems from centralized systems to decentralized systems enabling large-scale distributed applications providing high scalability.This, the 51st issue of Transactions on Large-Scale Data and Knowledge-Centered Systems, contains five fully revised selected regular papers. Topics covered include data anonyomaly detection, schema generation, optimizing data coverage, and digital preservationwith synthetic DNA.
Algorithms on Trees and Graphs
This book introduces graph algorithms on an intuitive basis followed by a detailed exposition in a literate programming style, with correctness proofs as well as worst-case analyses. Full C++ implementations of all algorithms presented are given using the LEDA library of efficient data structures and algorithms.
SAP S/4hana Financial Accounting Configuration
Upgrade your knowledge to learn S/4HANA, the latest version of the SAP ERP system, with its built-in intelligent technologies, including AI, machine learning, and advanced analytics.Since the first edition of this book published as SAP ERP Financial and Controlling: Configuration and Use Management, the perspective has changed significantly as S/4HANA now comes with new features, such as FIORI (new GUI), which focuses on flexible app style development and interactivity with mobile phones. It also has a universal journal, which helps in data integration in a single location, such as centralized processing, and is faster than ECC S/3. It merges FI & CO efficiently, which enables document posting in the Controlling area setup. General Ledger Accounts (FI) and Cost Element (CO) are mapped together in a way that cost elements (both primary and secondary) are part of G/L accounts. And a mandatory setup of customer-vendor integration with business partners is included vs the earlier ECC creation with separate vendor master and customer master.This updated edition presents new features in SAP S/4HANA, with in-depth coverage of the FI syllabus in SAP S/4HANA. A practical and hands-on approach includes scenarios with real-life examples and practical illustrations. There is no unnecessary jargon in this configuration and end-user manual.What You Will LearnConfigure SAP FI as a pro in S/4Master core aspects of Financial Accounting and ControllingIntegrate SAP Financial with other SAP modulesGain a thorough hands-on experience with IMG (Implementation Guide)Understand and explain the functionalities of SAP FIWho This Book Is ForFI consultants, trainers, developers, accountants, and SAP FI support organizations will find the book an excellent reference guide. Beginners without prior FI configuration experience will find the step-by-step illustrations to be practical and great hands-on experience.
Applied Informatics
This book constitutes the proceedings of the 5th International Conference on Applied Informatics, ICAI 2022, which took place in Arequipa, Peru, in October 2022. The 32 papers presented in this volume were carefully reviewed and selected from 90 submissions. The contributions are divided into the following thematic blocks: Artificial Intelligence; Data Analysis; Decision Systems; Health Care Information Systems; ICT-Enabled Social Innovation; Image Processing; Robotic Autonomy; Software Architectures; Software Design Engineering.
Social, Cultural, and Behavioral Modeling
This book constitutes the proceedings of the 15th International Conference on Social, Cultural, and Behavioral Modeling, SBP-BRiMS 2022, which was in Pittsburgh, PA, USA in September 2022.The 25 full papers presented in this volume were carefully reviewed and selected from 50 submissions. The papers were organized in topical sections as follows: computer science, psychology, sociology, communication science, public health, bioinformatics, political science, and organizational science. Numerous types of computational methods are used include, but not limited to, machine learning, language technology, social network analysis and visualization, agent-based simulation, and statistics.
Oracle Pl/SQL by Example
Using PL/SQL for Oracle Database 21c, you can build solutions that deliver unprecedented performance and efficiency in any environment, including the cloud. Oracle PL/SQL by Example, Sixth Edition, teaches all the PL/SQL skills you'll need, through real-world labs and extensive examples. Now fully updated for the newest version of PL/SQL 21c, it covers everything from basic syntax and program control through the latest optimization and tuning enhancements. Step by step, you'll walk through every key task, mastering today's most valuable Oracle 21c PL/SQL programming techniques on your own. Start by downloading the supporting schema and exercises from informit.com/title/9780138062835. Once you've done an exercise, the author doesn't just present the answer: She offers an in-depth discussion introducing deeper insights and modern best practices. This book's approach fully reflects the author's award-winning experience teaching PL/SQL to professionals at Columbia University in New York City. New database developers and DBAs can use it to get productive fast; experienced PL/SQL programmers will find it to be a superb Oracle Database 21c solutions reference. New in This Edition Updated code examples throughout New iteration controls for the FOR LOOP statement, such as stepped range, multiple iterations, collection, and cursor iterations Enhancements for PL/SQL qualified expressions Performance enhancements for PL/SQL functions, such as SQL macro, and better control of the result cache Other Topics Covered Mastering basic PL/SQL concepts and language fundamentals, and understanding SQL's role in PL/SQL Using conditional and iterative program controls Efficiently handling errors and exceptions Working with cursors and triggers, including compound triggers Using stored procedures, functions, and packages to write modular code that other programs can run Working with collections, object-relational features, native dynamic SQL, bulk SQL, and other advanced features
Research Challenges in Information Science: Ethics and Trustworthiness in Information Science
This book constitutes the proceedings of the 16th International Conference on Research Challenges in Information Sciences, RCIS 2022, which took place in Barcelona, Spain, during May 17-20, 2022. It focused on the special theme "Ethics and Trustworthiness in Information Science". The scope of RCIS is summarized by the thematic areas of information systems and their engineering; user-oriented approaches; data and information management; business process management; domain-specific information systems engineering; data science; information infrastructures, and reflective research and practice. The 35 full papers presented in this volume were carefully reviewed and selected from a total 100 submissions. The 18 Forum papers are based on 11 Forum submissions, from which 5 were selected, and the remaining 13 were transferred from the regular submissions. The 6 Doctoral Consortium papers were selected from 10 submissions to the consortium. The contributions were organized in topical sections named: Data Science and Data Management; Information Search and Analysis; Business Process Management; Business Process Mining; Digital Transformation and Smart Life; Conceptual Modelling and Ontologies; Requirements Engineering; Model-Driven Engineering; Machine Learning Applications. In addition, two-page summaries of the tutorials can be found in the back matter.
Perspectives in Business Informatics Research
This book constitutes the proceedings of the 21st International Conference on Perspectives in Business Informatics Research, BIR 2022, which took place in Rostock, Germany, in September 2022. The central theme of BIR 2022 was "Business Informatics for Sustainable Innovation". Achieving sustainability requires a multi-perspective approach taking organizational, economic, and technical aspects into account. In a world of cloud computing, social networks and big data, additional challenges for business informatics and the design of information systems architectures are introduced. To deal with these challenges, a close cooperation of researchers from different disciplines such as information systems, business informatics, and computer science is required.The 14 papers presented in this volume were carefully reviewed and selected from 41 submissions. They were organized in topical sections as follows: Information system development; modeling methods and assistance; applications and technologies; and digital business.
Ocaml Scientific Computing
This book is about the harmonious synthesis of functional programming and numerical computation. It shows how the expressiveness of OCaml allows for fast and safe development of data science applications. Step by step, the authors build up to use cases drawn from many areas of Data Science, Machine Learning, and AI, and then delve into how to deploy at scale, using parallel, distributed, and accelerated frameworks to gain all the advantages of cloud computing environments.To this end, the book is divided into three parts, each focusing on a different area. Part I begins by introducing how basic numerical techniques are performed in OCaml, including classical mathematical topics (interpolation and quadrature), statistics, and linear algebra. It moves on from using only scalar values to multi-dimensional arrays, introducing the tensor and Ndarray, core data types in any numerical computing system. It concludes with two more classical numerical computing topics, the solution ofOrdinary Differential Equations (ODEs) and Signal Processing, as well as introducing the visualization module we use throughout this book. Part II is dedicated to advanced optimization techniques that are core to most current popular data science fields. We do not focus only on applications but also on the basic building blocks, starting with Algorithmic Differentiation, the most crucial building block that in turn enables Deep Neural Networks. We follow this with chapters on Optimization and Regression, also used in building Deep Neural Networks. We then introduce Deep Neural Networks as well as topic modelling in Natural Language Processing (NLP), two advanced and currently very active fields in both industry and academia. Part III collects a range of case studies demonstrating how you can build a complete numerical application quickly from scratch using Owl. The cases presented include computer vision and recommender systems. This book aims at anyone with a basic knowledge of functional programming and a desire to explore the world of scientific computing, whether to generally explore the field in the round, to build applications for particular topics, or to deep-dive into how numerical systems are constructed. It does not assume strict ordering in reading - readers can simply jump to the topic that interests them most.
SAP S/4hana Systems in Hyperscaler Clouds
This book helps SAP architects and SAP Basis administrators deploy and operate SAP S/4HANA systems on the most common public cloud platforms. Market-leading cloud offerings are covered, including Amazon Web Services, Microsoft Azure, and Google Cloud. You will gain an end-to-end understanding of the initial implementation of SAP S/4HANA systems on those platforms. You will learn how to move away from the big monolithic SAP ERP systems and arrive at an environment with a central SAP S/4HANA system as the digital core surrounded by cloud-native services.The book begins by introducing the core concepts of Hyperscaler cloud platforms that are relevant to SAP. You will learn about the architecture of SAP S/4HANA systems on public cloud platforms, with specific content provided for each of the major platforms. The book simplifies the deployment of SAP S/4HANA systems in public clouds by providing step-by-step instructions and helping you deal with thecomplexity of such a deployment. Content in the book is based on best practices, industry lessons learned, and architectural blueprints, helping you develop deep insights into the operations of SAP S/4HANA systems on public cloud platforms. Reading this book enables you to build and operate your own SAP S/4HANA system in the public cloud with a minimum of effort.What You Will LearnChoose the right Hyperscaler platform for your future SAP S/4HANA workloadsStart deploying your first SAP S/4HANA system in the public cloudAvoid typical pitfalls during your implementationApply and leverage cloud-native services for your SAP S/4HANA systemSave costs by choosing the right architecture and build a robust architecture for your most critical SAP systemsMeet your business' criteria for availability and performance by having the right sizing in placeIdentify further use cases whenoperating SAP S/4HANA in the public cloudWho This Book Is ForSAP architects looking for an answer on how to move SAP S/4HANA systems from on-premises into the cloud; those planning to deploy to one of the three major platforms from Amazon Web Services, Microsoft Azure, and Google Cloud Platform; and SAP Basis administrators seeking a detailed and realistic description of how to get started on a migration to the cloud and how to drive that cloud implementation to completion
Land Use Cover Datasets and Validation Tools
Chapter 1. About this book.- Part I. Concepts, data and validation.- Chapter 2. Land Use Cover mapping, modelling and validation. A background.- Chapter 3. Validation of Land Use Cover maps: a guideline.- Chapter 4. Land Use Cover Datasets: a review.- Part II. Data access and visualization.- Chapter 5. Visualization and communication of LUC data.- Chapter 6. Sample data for thematic accuracy assessment in QGIS.- Part III. Tools to validate Land Use Cover maps: a review.- Chapter 7. Basic and Multiple-Resolution Cross Tabulation to validate Land Use Cover maps.- Chapter 8. Metrics based on a Cross-Tabulation matrix to validate Land Use Cover maps.- Chapter 9. Pontius Jr. methods based on a Cross Tabulation matrix to validate Land Use Cover maps.- Chapter 10. Validation of soft maps produced by a Land Use Cover Change model.- Chapter 11. Spatial metrics to validate Land Use Cover maps.- Chapter 12. Advanced pattern analysis to validate Land Use Cover maps.- Chapter 13. Geographically Weighted methods to validate Land Use Cover maps.- Part IV. Land Use Cover datasets: a review.- Chapter 14. Global general Land Use Cover datasets with a single date, - Chapter 15. Global general Land Use Cover datasets with a time series of maps.- Chapter 16. General Land Use Cover datasets for Europe.- Chapter 17. General Land Use Cover datasets for Africa.- Chapter 18. General Land Use Cover datasets for America and Asia.- Chapter 19. Global thematic Land Use Cover datasets characterizing vegetation covers.- Chapter 20. Global thematic Land Use Cover datasets characterizing agricultural covers.- Chapter 21. Global thematic Land Use Cover datasets characterizing artificial covers.- Chapter 22. Supra-national thematic Land Use Cover datasets
Machine Learning and Data Mining for Sports Analytics
This book constitutes the refereed post-conference proceedings of the 8th International Workshop on Machine Learning and Data Mining for Sports Analytics, MLSA 2021, held as virtual event in September 2021. The 12 full papers and 4 short papers presented were carefully reviewed and selected from 29 submissions. The papers present a variety of topics within the area of sports analytics, including tactical analysis, outcome predictions, data acquisition, performance optimization, and player evaluation.
Linking Theory and Practice of Digital Libraries
This book constitutes the proceedings of the 26th International Conference on Theory and Practice of Digital Libraries, TPDL 2022, which took place in Padua, Italy, in September 2022. The 18 full papers, 27 short papers and 15 accelerating innovation papers included in these proceedings were carefully reviewed and selected from 107 submissions. They focus on digital libraries and associated technical, practical, and social issues.
Fundamentals of Enterprise Architecture Management
This textbook provides a comprehensive, holistic, scientifically precise, and practically relevant description of Enterprise Architecture Management (EAM). Based on state-of-the-art concepts, it also addresses current trends like disruptive digitization or agile methods. The book is structured in five chapters. The first chapter offers a comprehensive overview of EAM. It addresses questions like: what does EAM mean, what is the history of EAM, why do enterprises need EAM, what are its goals, and how is it related to digitalization? It also includes a short overview of essential EAM standards and literature. The second chapter provides an overview of Enterprise Architecture (EA). It starts with clarifying basic terminology and the difference between EA and EAM. It also gives a short summary of existing EA frameworks and methods for structuring the digital ecosystem into layers and views. The third chapter addresses the strategic and tactical context of the EAM capabilityin an enterprise. It defines essential terms and parameters in the context of enterprise strategy and tactics as well as the operative, organizational context of EAM. The fourth chapter specifies the detailed goals, processes, functions, artifacts, roles and tools of EAM, building the basis for an EAM process framework that provides a comprehensive overview of EAM processes and functions. Closing the circle, the last chapter describes how to evaluate EAM in an enterprise. It starts by laying out core terminology, like "metric" and "strategic performance measurement system" and ends with a framework that integrates the various measuring areas in the context of EA and EAM. This textbook focuses on two groups: First, EAM scholars, ie bachelor or master students of Business Information Systems, Business Administration or Computer Science. And second, EAM practitioners working in the field of IT strategy or EA who need a reliable, scientifically solid, and practically proven state-of-the-art description of essential EAM methods.
Information Modelling
This textbook provides solid guidance on how to produce information models in practice. Information modeling has become increasingly relevant as an approach for understanding the active role that data plays within business and management and promoting the planning of business activities. The text promotes a practical approach to information modelling based around the analysis of communicative practice within delimited domains of organization. The book chapters are designed to be read in sequence. The early chapters build an account of information modelling from the bedrock of a theory of information situations. Later chapters discuss a number of practical issues concerned with the application of this business analysis and design technique. The conclusion demonstrates a larger context for the application and importance of information modelling. Numerous in-text examples of the concepts of information modelling and their application are included throughout the text. A separate chapter is devoted to a range of exercises which the reader can use to test understanding and application of the technique. An appendix with solutions is also provided to support learning. Overall, this textbook provides a step-by-step introduction to information modelling for use in undergraduate and postgraduate modules in information systems, computer science and even digitally focused modules within business and management. No prerequisite knowledge is assumed on the part of the reader. Students and practitioners are tutored in the development of information modelling from first principles. The book covers all the core principles of both entity-relationship diagramming and class diagramming - the two major approaches to information modelling.
Digital Transformation in Norwegian Enterprises
This open access book presents a number of case studies on digital transformation in Norway, one of the fore-runners in the digital progress index established by the European Commission in 2020. They explore the process of adoption, diffusion and value generation from digital technologies, and how the use of different digital solutions has enabled Norwegian enterprises to digitally transform their operations and business models.The book starts with an introductory chapter summarizing a vast body of literature in order to synthesize what is already known about digital transformation before exploring the Norwegian context in more detail. Then a series of case studies from the private and public sector in Norway is presented. They document a process perspective which describes the sequence of events during and after adoption of digital solutions, as well as the types of business value that were realized. Through these single studies, the process of digital transformation is illustrated, a number of key findings highlighted, and eventually theoretical and practical recommendations based on these cases emphasized. The book closes with a brief overview of some emerging technologies, and comments on how they are likely to change different sectors. Digital transformation has been one of the priority areas for the Norwegian government over the past years and puts Norwegian enterprises upfront in adopting novel technologies and utilizing them for achieving organizational goals. This experience accumulated over the years makes the Norwegian context a particularly interesting one in understanding how private and public organizations make use of new digital solutions, what lessons can be learnt during the process, and what are some of the key success and failure factors. This way the book is written for practitioners who are currently involved in digital transformation projects in their organizations, researchers of information systems and management, aswell as master students in degrees of informatics and technology management.
Analytics for Retail
Examine select retail business scenarios to learn basic mathematics, as well as probability and statistics required to analyze big data. This book focuses on useful and imperative applied analytics needed to build a retail business and explains mathematical concepts essential for decision making and communication in retail business environments. Everyone is a buyer or seller of products these days whether through a physical department store, Amazon, or their own business website. This book is a step-by-step guide to understanding and managing the mechanics of markups, markdowns, and basic statistics, math and computers that will help in your retail business. You'll tackle what to do with data once it is has accumulated and see how to arrange the data using descriptive statistics, primarily means, median, and mode, and then how to read the corresponding charts and graphs. Analytics for Retail is your path to creating visualrepresentations that powerfully communicate information and drive decisions. What You'll LearnReview standard statistical concepts to enhance your understanding of retail dataUnderstand the concepts of markups, markdowns and profit margins, and probability Conduct an A/B testing email campaign with all the relevant analytics calculated and explainedWho This Book Is ForThis is a primer book for anyone in the field of retail that needs to learn or refresh their skills or for a reader who wants to move in their company to a more analytical position.
Bioinformatics with Python Cookbook - Third Edition
Discover modern, next-generation sequencing libraries from the powerful Python ecosystem to perform cutting-edge research and analyze large amounts of biological dataKey Features: Perform complex bioinformatics analysis using the most essential Python libraries and applicationsImplement next-generation sequencing, metagenomics, automating analysis, population genetics, and much moreExplore various statistical and machine learning techniques for bioinformatics data analysisBook Description: Bioinformatics is an active research field that uses a range of simple-to-advanced computations to extract valuable information from biological data, and this book will show you how to manage these tasks using Python.This updated third edition of the Bioinformatics with Python Cookbook begins with a quick overview of the various tools and libraries in the Python ecosystem that will help you convert, analyze, and visualize biological datasets. Next, you'll cover key techniques for next-generation sequencing, single-cell analysis, genomics, metagenomics, population genetics, phylogenetics, and proteomics with the help of real-world examples. You'll learn how to work with important pipeline systems, such as Galaxy servers and Snakemake, and understand the various modules in Python for functional and asynchronous programming. This book will also help you explore topics such as SNP discovery using statistical approaches under high-performance computing frameworks, including Dask and Spark. In addition to this, you'll explore the application of machine learning algorithms in bioinformatics.By the end of this bioinformatics Python book, you'll be equipped with the knowledge you need to implement the latest programming techniques and frameworks, empowering you to deal with bioinformatics data on every scale.What You Will Learn: Become well-versed with data processing libraries such as NumPy, pandas, arrow, and zarr in the context of bioinformatic analysisInteract with genomic databasesSolve real-world problems in the fields of population genetics, phylogenetics, and proteomicsBuild bioinformatics pipelines using a Galaxy server and SnakemakeWork with functools and itertools for functional programmingPerform parallel processing with Dask on biological dataExplore principal component analysis (PCA) techniques with scikit-learnWho this book is for: This book is for bioinformatics analysts, data scientists, computational biologists, researchers, and Python developers who want to address intermediate-to-advanced biological and bioinformatics problems. Working knowledge of the Python programming language is expected. Basic knowledge of biology will also be helpful.
Business Intelligence
This book constitutes the proceedings of the 7th International Conference on Business Intelligence, CBI 2022, which took place in Khouribga, Morocco, during May 26-28, 2022. The 23 full papers included in this book were carefully reviewed and selected from a total of 68 submissions. They were organized in topical sections as follows: decision support and artificial intelligence; business intelligence and database; and optimization and dynamic programming.
The Semantic Web - Iswc 2022
This book constitutes the proceedings of the 21st International Semantic Web Conference, ISWC 2022, which took place in October 2022 in a virtual mode. The 48 full papers presented in this volume were thoroughly reviewed and selected from 239 submissions. They deal with the latest advances in fundamental research, innovative technology, and applications of the Semantic Web, linked data, knowledge graphs, and knowledge processing on the Web. Papers are organized in a research track, resources and in-use track. The research track details theoretical, analytical and empirical aspects of the Semantic Web and its intersection with other disciplines. The resources track promotes the sharing of resources which support, enable or utilize semantic web research, including datasets, ontologies, software, and benchmarks. And finally, the in-use-track is dedicated to novel and significant research contributions addressing theoretical, analytical and empirical aspects of the Semantic Web and its intersection with other disciplines.The chapters "Hashing the Hypertrie: Space- and Time-Efficient Indexing for SPARQL in Tensors", "Agree to Disagree: Managing Ontological Perspectives using Standpoint Logic", "GNNQ: A Neuro-Symbolic Approach to Query Answering over Incomplete Knowledge Graphs", "ISSA: Generic Pipeline, Knowledge Model and Visualization tools to Help Scientists Search and Make Sense of a Scientific Archiveare" are licensed under the terms of the Creative Commons Attribution 4.0 International License.
Emerging Technologies in Computer Engineering: Cognitive Computing and Intelligent Iot
This book constitutes the refereed proceedings of the 5th International Conference on Emerging Technologies in Computer Engineering, ICETCE 2021, held in Jaipur, India, in February 2022.The 40 revised full papers along with 20 short papers presented were carefully reviewed and selected from 235 submissions. The papers are organized according to the following topical headings: ​cognitive computing; Internet of Things (IoT); machine learning and applications; soft computing; data science and big data analytics; blockchain and cyber security.
Data Warehouse Systems
With this textbook, Vaisman and Zim獺nyi deliver excellent coverage of data warehousing and business intelligence technologies ranging from the most basic principles to recent findings and applications. To this end, their work is structured into three parts. Part I describes "Fundamental Concepts" including conceptual and logical data warehouse design, as well as querying using MDX, DAX and SQL/OLAP. This part also covers data analytics using Power BI and Analysis Services. Part II details "Implementation and Deployment," including physical design, ETL and data warehouse design methodologies. Part III covers "Advanced Topics" and it is almost completely new in this second edition. This part includes chapters with an in-depth coverage of temporal, spatial, and mobility data warehousing. Graph data warehouses are also covered in detail using Neo4j. The last chapter extensively studies big data management and the usage of Hadoop, Spark, distributed, in-memory, columnar, NoSQL and NewSQLdatabase systems, and data lakes in the context of analytical data processing. As a key characteristic of the book, most of the topics are presented and illustrated using application tools. Specifically, a case study based on the well-known Northwind database illustrates how the concepts presented in the book can be implemented using Microsoft Analysis Services and Power BI. All chapters have been revised and updated to the latest versions of the software tools used. KPIs and Dashboards are now also developed using DAX and Power BI, and the chapter on ETL has been expanded with the implementation of ETL processes in PostgreSQL. Review questions and exercises complement each chapter to support comprehensive student learning. Supplemental material to assist instructors using this book as a course text is available online and includes electronic versions of the figures, solutions to all exercises, and a set of slides accompanying each chapter. Overall, students, practitioners and researchers alike will find this book the most comprehensive reference work on data warehouses, with key topics described in a clear and educational style. "I can only invite you to dive into the contents of the book, feeling certain that once you have completed its reading (or maybe, targeted parts of it), you will join me in expressing our gratitude to Alejandro and Esteban, for providing such a comprehensive textbook for the field of data warehousing in the first place, and for keeping it up to date with the recent developments, in this current second edition."From the foreword by Panos Vassiliadis, University of Ioannina, Greece.
Digital Transformation in Norwegian Enterprises
This open access book presents a number of case studies on digital transformation in Norway, one of the fore-runners in the digital progress index established by the European Commission in 2020. They explore the process of adoption, diffusion and value generation from digital technologies, and how the use of different digital solutions has enabled Norwegian enterprises to digitally transform their operations and business models.The book starts with an introductory chapter summarizing a vast body of literature in order to synthesize what is already known about digital transformation before exploring the Norwegian context in more detail. Then a series of case studies from the private and public sector in Norway is presented. They document a process perspective which describes the sequence of events during and after adoption of digital solutions, as well as the types of business value that were realized. Through these single studies, the process of digital transformation is illustrated, a number of key findings highlighted, and eventually theoretical and practical recommendations based on these cases emphasized. The book closes with a brief overview of some emerging technologies, and comments on how they are likely to change different sectors. Digital transformation has been one of the priority areas for the Norwegian government over the past years and puts Norwegian enterprises upfront in adopting novel technologies and utilizing them for achieving organizational goals. This experience accumulated over the years makes the Norwegian context a particularly interesting one in understanding how private and public organizations make use of new digital solutions, what lessons can be learnt during the process, and what are some of the key success and failure factors. This way the book is written for practitioners who are currently involved in digital transformation projects in their organizations, researchers of information systems and management, aswell as master students in degrees of informatics and technology management.
Decision Support Systems XII: Decision Support Addressing Modern Industry, Business, and Societal Needs
This book constitutes the proceedings of the 8th International Conference on Decision Support Systems Technologies, ICDSST 2022, held during May 23-25, 2022.The EWG-DSS series of International Conference on Decision Support System Technology (ICDSST) is planned to consolidate the tradition of annual events organized by the EWG-DSS in offering a platform for European and international DSS communities, comprising the academic and industrial sectors, to present state-of-the-art DSS research and developments, to discuss current challenges that surround decision-making processes, to exchange ideas about realistic and innovative solutions, and to co-develop potential business opportunities. The main aim of this year's conference is to investigate the role DSS and related technologies can play in mitigating the impact of pandemics and post-crisis recovery. The 15 papers presented in this volume were carefully reviewed and selected from 46 submissions. They were organized in topical sections as follows: decision support addressing modern industry; decision support addressing business and societal needs, and multiple criteria approaches.
Advances in Geospatial Data Science
This book presents a selection of manuscripts submitted to the 2nd International Conference on Geospatial Information Sciences 2021, a virtual conference held on November 3-5, 2021. These papers were selected by the Scientific Program Committee of the Conference after a rigorous peer-review process. They represent the vast scope of the interdisciplinary research areas that characterize the Geospatial Information Sciences that is done in the discipline. It especially represents a fabulous opportunity to showcase research carried out by young Mexican researchers and showcase it to the rest of the world and enhance the growth of the sciences in the country while, at the same time, enforces them to level up with other research at the international level.
The Azure Data Lakehouse Toolkit
Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse using highly performant and cutting-edge Apache Spark capabilities using Azure Databricks, Azure Synapse Analytics, and Snowflake. You will learn to write efficient PySpark code for batch and streaming ELT jobs on Azure. And you will follow along with practical, scenario-based examples showing how to apply the capabilities of Delta Lake and Apache Spark to optimize performance, and secure, share, and manage a high volume, high velocity, and high variety of data in your lakehouse with ease.The patterns of success that you acquire from reading this book will help you hone your skills to build high-performing and scalable ACID-compliant lakehouses using flexible and cost-efficient decoupled storage and compute capabilities. Extensive coverage of Delta Lake ensures that you are aware of and can benefit from all that this new, open source storage layer can offer. In addition to the deep examples on Databricks in the book, there is coverage of alternative platforms such as Synapse Analytics and Snowflake so that you can make the right platform choice for your needs.After reading this book, you will be able to implement Delta Lake capabilities, including Schema Evolution, Change Feed, Live Tables, Sharing, and Clones to enable better business intelligence and advanced analytics on your data within the Azure Data Platform. What You Will LearnImplement the Data Lakehouse Paradigm on Microsoft's Azure cloud platformBenefit from the new Delta Lake open-source storage layer for data lakehouses Take advantage of schema evolution, change feeds, live tables, and moreWritefunctional PySpark code for data lakehouse ELT jobsOptimize Apache Spark performance through partitioning, indexing, and other tuning optionsChoose between alternatives such as Databricks, Synapse Analytics, and SnowflakeWho This Book Is ForData, analytics, and AI professionals at all levels, including data architect and data engineer practitioners. Also for data professionals seeking patterns of success by which to remain relevant as they learn to build scalable data lakehouses for their organizations and customers who are migrating into the modern Azure Data Platform.
IBM DB2 Administration Guide
Guidance for successful installation of a wide range of IBM Software KEY FEATURES ● Precise and step-by-step guidance for installation and configuration of IBM DB2 Solutions.● Specially Designed and Personalized for IT consultants and Systems and Solution Architects.● Includes illustrations and simplified guidelines for data developers and analysts for successful implementation. DESCRIPTION IT professionals and software architects are overloaded with knowledge due to the rapid advancement of technology and the launch of new Cloud-based services. This book provides helpful instructions for installing and configuring IBM Database on multiple operating systems and enterprise platforms. The book's troubleshooting sections are designed to increase IT support productivity and speed up problem resolution. Software Architects, Installation specialists, database developers, and IBM Document, Case, and Workflow management software developers can all benefit from this book. This book offers a centralised resource that discusses the most recent version of IBM software that readers can use on the most recent versions of Red Hat Linux and IBM Cloud platforms. This book is intended to provide a thorough introduction that will allow an IT expert to understand the installation of a wide range of IBM Software products. It includes information on online references and step-by-step processes for installing a wide range of IBM Software products. WHAT YOU WILL LEARN● Identify the prerequisite DB2 version for IBM Software Application Systems.● Identify Server platform versions, disc, memory, and network resources for DB2 and DB2 Graph installation.● Detect the DB2 prerequisite version necessary for installation on various server setups.● Install DB2 for Docker Container, RedHat OpenShift, IBM Cloud Private system 3.2.0 (Community Edition), and IBM Cloud.● Install DB2Graph Containers for analysis using Apache "TinkerPop" Graph Computing Framework.● Enable entire DB2 administration for backup and recovery of systems.WHO THIS BOOK IS FORThis book is intended for IT consultants, Solution Architects, System Developers, and Data Professionals with rudimentary technological knowledge. The book is also an excellent resource for banking and insurance experts creating database solutions for their companies.
Azure Data Engineering Cookbook - Second Edition
Nearly 80 recipes to help you collect and transform data from multiple sources into a single data source, making it way easier to perform analytics on the dataKey Features: Build data pipelines from scratch and find solutions to common data engineering problemsLearn how to work with Azure Data Factory, Data Lake, Databricks, and Synapse AnalyticsMonitor and maintain your data engineering pipelines using Log Analytics, Azure Monitor, and Azure PurviewBook Description: The famous quote 'Data is the new oil' seems more true every day as the key to most organizations' long-term success lies in extracting insights from raw data. One of the major challenges organizations face in leveraging value out of data is building performant data engineering pipelines for data visualization, ingestion, storage, and processing. This second edition of the immensely successful book by Ahmad Osama brings to you several recent enhancements in Azure data engineering and shares approximately 80 useful recipes covering common scenarios in building data engineering pipelines in Microsoft Azure.You'll explore recipes from Azure Synapse Analytics workspaces Gen 2 and get to grips with Synapse Spark pools, SQL Serverless pools, Synapse integration pipelines, and Synapse data flows. You'll also understand Synapse SQL Pool optimization techniques in this second edition. Besides Synapse enhancements, you'll discover helpful tips on managing Azure SQL Database and learn about security, high availability, and performance monitoring. Finally, the book takes you through overall data engineering pipeline management, focusing on monitoring using Log Analytics and tracking data lineage using Azure Purview.By the end of this book, you'll be able to build superior data engineering pipelines along with having an invaluable go-to guide.What You Will Learn: Process data using Azure Databricks and Azure Synapse AnalyticsPerform data transformation using Azure Synapse data flowsPerform common administrative tasks in Azure SQL DatabaseBuild effective Synapse SQL pools which can be consumed by Power BIMonitor Synapse SQL and Spark pools using Log AnalyticsTrack data lineage using Microsoft Purview integration with pipelinesWho this book is for: This book is for data engineers, data architects, database administrators, and data professionals who want to get well versed with the Azure data services for building data pipelines. Basic understanding of cloud and data engineering concepts will help in getting the most out of this book.
Model-Driven Development of Akoma Ntoso Application Profiles
This book presents a model-driven approach for creating a national application profile of the international legislative document standard Akoma Ntoso (AKN). AKN is an XML-based document standard that serves as the basis for modern machine-readable and fully digital legislative and judicial processes. The described model-driven development approach ensures consistent and error-proof application of AKN concepts and types, even when using different software tools. It allows for easy maintenance, is self-documenting, and facilitates stakeholder validation with nontechnical legal experts. The resulting application profile remains fully compliant to and compatible with AKN. For the sake of illustration, the approach is paradigmatically applied to the German federal legislative process, as a corresponding approach was used in the creation of the German AKN application profile, LegalDocML.de. We discuss how the methodology yields a model, schema definition and specification that correspond to the artefacts created by LegalDocML.de, using examples from Germany. The book is of interest to both legal and technical project teams on the cusp of introducing AKN in a legislative domain and intended as a practical guideline for teams preparing to create a custom application profile for their own domain. Furthermore, it can serve as both a resource and an inspiration for similar and yet to be developed methodologies in the public sector, the health sector or in defense, where international standardization and interoperability efforts are to be applied to a local level.
Business Process Management Forum
This book constitutes the proceedings of the BPM Forum held at the 20th International Conference on Business Process Management, BPM 2022, which took place in M羹nster, Germany, in September 2022. The BPM Forum hosts innovative research which has a high potential of stimulating discussions. The papers selected for the forum are expected to showcase fresh ideas from exciting and emerging topics in BPM, even if they are not yet as mature as the regular papers at the conference. The 13 full papers included in this volume were carefully reviewed and selected from 98 submissions. The papers were organized in topical sections named: modeling and design; process mining; and predictive process monitoring.
Business Process Management: Blockchain, Robotic Process Automation, and Central and Eastern Europe Forum
This book constitutes the proceedings of the Blockchain, Robotic Process Management (RPA), and Central and Eastern Europe (CEE) Forum which were held as part of the 20th International Conference on Business Process Management, BPM 2022, which took place in M羹nster, Germany, during September 11-15, 2022. The Blockchain Forum is dealing with techniques for and applications of blockchains, distributed ledger technologies, and related topics. "The RPA Forum brings together researchers from various communities to discuss challenges, opportunities, and new ideas related to robotic process automation and its application to business processes in private and public sectors." The CEE Forum provides a discussion platform for BPM academics from Central and Eastern Europe to disseminate their research, compare results and share experiences. The 20 papers presented in this volume were carefully reviewed and selected from a total of 40 submissions.
Data Science Concepts and Techniques with Applications
This textbook comprehensively covers both fundamental and advanced topics related to data science. Data science is an umbrella term that encompasses data analytics, data mining, machine learning, and several other related disciplines. The chapters of this book are organized into three parts: The first part (chapters 1 to 3) is a general introduction to data science. Starting from the basic concepts, the book will highlight the types of data, its use, its importance and issues that are normally faced in data analytics, followed by presentation of a wide range of applications and widely used techniques in data science. The second part, which has been updated and considerably extended compared to the first edition, is devoted to various techniques and tools applied in data science. Its chapters 4 to 10 detail data pre-processing, classification, clustering, text mining, deep learning, frequent pattern mining, and regression analysis. Eventually, the third part (chapters 11 and 12) present a brief introduction to Python and R, the two main data science programming languages, and shows in a completely new chapter practical data science in the WEKA (Waikato Environment for Knowledge Analysis), an open-source tool for performing different machine learning and data mining tasks. An appendix explaining the basic mathematical concepts of data science completes the book. This textbook is suitable for advanced undergraduate and graduate students as well as for industrial practitioners who carry out research in data science. They both will not only benefit from the comprehensive presentation of important topics, but also from the many application examples and the comprehensive list of further readings, which point to additional publications providing more in-depth research results or provide sources for a more detailed description of related topics. "This book delivers a systematic, carefully thoughtful material on Data Science." from the Foreword by Witold Pedrycz, U Alberta, Canada.
Chinese Computational Linguistics
This book constitutes the proceedings of the 21st China National Conference on Computational Linguistics, CCL 2022, held in Nanchang, China, in October 2022.The 22 full English-language papers in this volume were carefully reviewed and selected from 293 Chinese and English submissions.The conference papers are categorized into the following topical sub-headings: Linguistics and Cognitive Science; Fundamental Theory and Methods of Computational Linguistics; Information Retrieval, Dialogue and Question Answering; Text Generation and Summarization; Knowledge Graph and Information Extraction; Machine Translation and Multilingual Information Processing; Minority Language Information Processing; Language Resource and Evaluation; NLP Applications.
Combinatorial Optimization
This book constitutes thoroughly refereed and revised selected papers from the 7th International Symposium on Combinatorial Optimization, ISCO 2022, which was held online during May 18-20, 2022.The 24 full papers included in this book were carefully reviewed and selected from 50 submissions. They were organized in topical sections as follows: Polyhedra and algorithms; polyhedra and combinatorics; non-linear optimization; game theory; graphs and trees; cutting and packing; applications; and approximation algorithms.
Composite NUV Priors and Applications
Normal with unknown variance (NUV) priors are a central idea of sparse Bayesian learning and allow variational representations of non-Gaussian priors. More specifically, such variational representations can be seen as parameterized Gaussians, wherein the parameters are generally unknown. The advantage is apparent: for fixed parameters, NUV priors are Gaussian, and hence computationally compatible with Gaussian models. Moreover, working with (linear-)Gaussian models is particularly attractive since the Gaussian distribution is closed under affine transformations, marginalization, and conditioning. Interestingly, the variational representation proves to be rather universal than restrictive: many common sparsity-promoting priors (among them, in particular, the Laplace prior) can be represented in this manner. In estimation problems, parameters or variables of the underlying model are often subject to constraints (e.g., discrete-level constraints). Such constraints cannot adequately be represented by linear-Gaussian models and generally require special treatment. To handle such constraints within a linear-Gaussian setting, we extend the idea of NUV priors beyond its original use for sparsity. In particular, we study compositions of existing NUV priors, referred to as composite NUV priors, and show that many commonly used model constraints can be represented in this way.
New Trends in Database and Information Systems
This book constitutes the proceedings of the 26th European Conference on Advances in Databases and Information Systems, ADBIS 2022, held in Turin, Italy, in September 2022. The 29 short papers presented were carefully reviewed and selected from 90 submissions. The selected short papers are organized in the following sections: data understanding, modeling and visualization; fairness in data processing; data management pipeline, information and process retrieval; data access optimization; data pre-processing and cleaning; data science and machine learning. Further, papers from the following workshops and satellite events are provided in the volume: DOING: 3rd Workshop on Intelligent Data - From Data to Knowledge; K-GALS: 1st Workshop on Knowledge Graphs Analysis on a Large Scale; MADEISD: 4th Workshop on Modern Approaches in Data Engineering and Information System Design; MegaData: 2nd Workshop on Advanced Data Systems Management, Engineering, and Analytics; SWODCH: 2nd Workshop on Semantic Web and Ontology Design for Cultural Heritage; Doctoral Consortium.
Recent Trends in Analysis of Images, Social Networks and Texts
This book constitutes revised selected papers of the 10th International Conference on Analysis of Images, Social Networks and Texts, AIST 2021, held in Tbilisi, Georgia, in December 2021. Due to the COVID-19 pandemic the conference was held in hybrid mode. The 17 full papers were carefully reviewed and selected from 118 submissions, out of which 92 were sent to peer review. The papers are organized in topical sections on ​natural language processing; computer vision; data analysis and machine learning; social network analysis; theoretical machine learning and optimisation.
Blockchain Foundations and Applications
This monograph provides a comprehensive and rigorous exposition of the basic concepts and most important modern research results concerning blockchain and its applications. The book includes the required cryptographic fundamentals underpinning the blockchain technology, since understanding of the concepts of cryptography involved in the design of blockchain is necessary for mastering the security guarantees furnished by blockchain. It also contains an introduction to cryptographic primitives, and separate chapters on bitcoin, ethereum and smart contracts, public blockchain, private blockchain, cryptocurrencies, and blockchain applications.This volume is of great interest to active researchers who are keen to develop novel applications of blockchain in the field of their investigatio. Further, it is also beneficial for industry practitioners as well as undergraduate students in computing and information technology.