Database (1420) | ![]() |
Database management system (9) | ![]() |
Database systems (14) | ![]() |
How many times the same or similar data including the same pictures are registered in different databases?
and biometric information registered in a database. UID branded as Aadhaar guarantees only a person's identity, not rights, benefits or entitlements.
When completed, it will probably be the world's largest single-entity biometric database. This initiative is both technological and social.
PIA 91 PIATESKI G.,FRAWLEY W.,Knowledge discovery in Databases, MIT Press, Cambridge, MA, 1991. PLA 14 PLANETOSCOPE, available at http://www. planetoscope. com/Avion/109-nombre-de-vols-d-avions-dans-le-monde. html, 2014.
and Richard Welch Part III Driving Innovation Through Advanced Process Analytics Extracting Event Data from Databases to Unleash Process Mining...
Wil van der Aalst reports on Extracting Event Data from Databases to Unleash Process Mining. He introduces an approach to create event logs from underlying databases as a fundamental prerequisite for the application of processmining techniques
when information systems do not explicitly record events. Jan Recker gives insights on Evidence-Based Business Process Management:
, in-memory database technologies. To sum up and to frame the research challenges of proactive value chains:
while rather narrow services such as an order-approval, database request, or an ERP-based shipping receipt event entry are at the other end.
Process Innovation with Disruptive Technology in Auto Insurance 101 Part III Driving Innovation Through Advanced Process Analytics Extracting Event Data from Databases to Unleash Process Mining Wil M
This paper uses a novel perspective to conceptualize a database view on event data. Starting from a class model and corresponding object models it is shown that events correspond to the creation, deletion,
The key idea is that events leave footprints by changing the underlying database. Based on this an approach is described that scopes
However, in most organizations one encounters information systems built on top of database technology. The Ioe depends on a variety of databases (classical relational DBMSS or new nosql technologies.
Therefore, we provide a database view on event data and assume that events leave footprints by changing the underlying database.
Fortunately, database 2 For example, http://www. win. tue. nl/ieeetfpm/doku. php? id shared: process mining case stud ies lists over 20 successful case studies in industry.
Extracting Event Data from Databases to Unleash Process Mining 107 technology often provides so called redo logs that can be used to reconstruct the history of database updates.
This is what we would like to exploit systematically. Although the underlying databases are loaded with data, there are no explicit references to events, cases, and activities.
Instead, there are tables containing records and these tables are connected through key relationships. Hence, the challenge is to convert tables and records into event logs.
Obviously, this cannot be done in an automated manner. To understand why process-mining techniques need flat event logs (i e.
event logs with ordered events that explicitly refer to cases and activities) as input, consider any process model in one of the mainstream process modeling notations (e g.,
we focus on the problem of extracting flat event logs from databases. First, we introduce process mining in a somewhat more detailed form (Sect. 2). Section 3 presents twelve guidelines for logging.
this paper aims to exploit the events hidden in existing databases. We use database-centric view on processes:
the state of a process is reflected by the database content. Hence, events are merely changes of the database.
In the remainder we assume that data is stored in a database management system and that we can see all updates of the underlying database.
This assumption is realistic (see e g. the redo logs of Oracle. However, how to systematically approach the problem of converting database updates into event logs?
Section 4 introduces class and object models as a basis to reason about the problem. In Sect. 5 we show that class models can be extended with a so-called event model.
The event model is used to capture changes of the underlying database. Section 6 describes a three-step approach (Scope, Bind,
and Classify) to create a collection of flat event logs. The results serve as input for conventional process-mining techniques.
Extracting Event Data from Databases to Unleash Process Mining 109 The Prom framework provides an open source process-mining infrastructure.
Before we present our database-centric approach, we introduce twelve guidelines for logging. These guidelines make no assumptions on the underlying technology used to record event data.
d) Celonis process mining (Celonis Gmbh)( Color figure online) Extracting Event Data from Databases to Unleash Process Mining 111 specific extensions (see for example the extension mechanism of XES (IEEE Task force
For example, do not remove a student from the database after he dropped out since this may lead to misleading analysis results.
We aim to exploit the hidden event data already present in databases. The content of the database can be seen as the current state of one or more processes.
Updates of the database are considered therefore as the primary events. This database-centric view on event logs is orthogonal to the above guidelines. 4 Class
and Object models Most information systems do not record events explicitly. Only process-aware information systems (e g.,, BPM/WFM systems) record event data in the format shown in Table 1. To create an event log
we often need to gather data from different data sources where events exist only implicitly.
In fact, for most process-mining projects event data need to be extracted from conventional databases. This is often done in an ad hoc manner.
but still the event logs need to be constructed by querying the database and converting database records (row in tables) into events.
Moreover, the regular tables in a database only provide the current state of the information system.
deleted records are generally invisible. 3 Taking the viewpoint that the database reflects the current state of one or more processes,
we define all changes of the database to be events. Below we conceptualize this viewpoint.
The event model relates coherent set of changes to the underlying database to events used for process mining.
In this way all intermediate states of the database can be reconstructed. Moreover, marking objects as deleted instead of completely removing them from the database is often more natural, e g.,
, concerts are deleted not they are canceled, employees are deleted not they are fired, etc. Extracting Event Data from Databases to Unleash Process Mining 113 Definition 1 (Unconstrained Class Model) Assume V to be some universe of values (strings
numbers, etc..An unconstrained class model is a tuple UCM ðc; A r; val; key; attr;
there cannot be two concerts on the same day in the same concert hall Fig. 2 Example of a constrained class model (Color figure online) Extracting Event Data from Databases to Unleash Process Mining 115
and class models in a database. However, it is easy to map any class model onto a set of related tables in a conventional relational database system.
but it is obvious that the conceptualization agrees with standard database technology. 5 Events and Their Effect on the Object model Examples of widely used Database management systems (DBMSS) are Oracle RDBMS (Oracle), SQL SERVER (Microsoft), DB2 (IBM), Sybase (SAP),
all of these systems have facilities to record changes to the database. For example, in the Oracle RDBMS environment, redo logs comprise files in a proprietary format 116 W. M. P. van der Aalst
which log a history of all changes made to the database. Oracle Logminer, a utility provided by Oracle,
provides methods of querying logged changes made to an Oracle database. Every Microsoft SQL SERVER database has a transaction log that records all database modifications.
Sybase IQ also provides a transaction log. Such redo/transaction logs can be used to recover from a system failure.
if there are frequent changes to the database. In such cases, the redo/transaction logs need to be truncated regularly.
However, we assume that through redo/transaction logs we can monitor changes to the database.
Extracting Event Data from Databases to Unleash Process Mining 117 Definition 6 (Events) Let CM ðc;
If the customer is already in the database, the composite event cannot contain the creation of the customer object c6.
model (Color figure online) Extracting Event Data from Databases to Unleash Process Mining 119 Next we define the effect of an event occurrence, i e.,
This is denoted by OM0) L OMN. 120 W. M. P. van der Aalst The formalizations above provide operational semantics for an abstract database system that processes a sequence of events.
However, the goal is not to model a database system. Instead, we aim to relate database updates to event logs that can be used for process mining.
Subsequently, we assume that we can witness a change log L he1; e2;..eni. It is easy to see atomic events.
and/or user id). Definition 3 shows that this assumption allows us to reconstruct the state of the database system after each event, i e.,
one may convert it into a conventional event by Extracting Event Data from Databases to Unleash Process Mining 121 taking tsi as timestamp and eni as activity.
and classify approach allows for the transformation of database updates into events populating process cubes that can be used for a variety of process-mining analyses. 7 Related Work The reader is referred to (Aalst, 2011) for an introduction
Next to the automated discovery of the underlying process based on raw Extracting Event Data from Databases to Unleash Process Mining 123 event data,
We are not aware of any work systematically transforming database updates into event logs. Probably, there are process-mining case-studies using redo/transaction logs from database management systems like Oracle RDBMS, Microsoft SQL SERVER, IBM DB2,
database systems. This paper focused on supporting the systematic extraction of event data from database systems.
Regular tables in a database provide a view of the actual state of the information system.
For process mining, however, it is interesting to know when a record was created, updated, or deleted.
Taking the viewpoint that the database reflects the current state of one or more processes,
we define all changes of the database to be events. In this paper, we conceptualized this viewpoint.
The event model relates changes to the underlying database to events used for process mining.
Extracting Event Data from Databases to Unleash Process Mining 125 Aalst, W. van der, Barthelmess, P.,Ellis, C,
In Sixth International Conference on Extending Database Technology (Lecture Notes in Computer science, Vol. 1377, pp. 469 483.
ACM Transactions on Database Systems, 1, 9 36. Cohn, D, . & Hull, R. 2009). Business artifacts:
Distributed and Parallel Databases, 25 (3), 193 240. Goedertier, S.,Martens, D.,Vanthienen, J, . & Baesens, B. 2009).
Extracting Event Data from Databases to Unleash Process Mining 127 Reichert, M, . & Weber, B. 2012).
In Proceedings of the workshop on databases in networked information systems (DNIS)( pp. 1 14. Springer.
In Proceedings of the international conference on extending database technology (EDBT)( pp. 21 32. Springer.
and can comprise database schemata (e g. Evermann, 2009) as well as arbitrary other model schemata. Process matching can be divided into two different fields matching process models (1) and matching nodes of process models (2)( Thaler, Hake, Fettke, & Loos, 2014.
The Very Large Database Journal, 10,334 350. Rahm, E, . & Bernstein, P. A. 2001b). A survey of approaches to automatic schema matching.
International Journal on Very Large Data base, 10 (4), 334 350. Rehse, J.-R.,Fettke, P,
Extracting event data from databases to unleash process mining. In J. Brocke & T. Schmiedel (Eds.
or knowledge sharing databases. 4. 1 Learnings The process capability framework and the underlying maturity models illustrate that BPM can be approached from a technical perspective and/or a people perspective.
Database Marketing and Customer Strategy Management, 18 (1), 31 38. Basu, S. C, . & Palvia, P. C. 2000).
Database Marketing and Customer Strategy Management, 18 (1), 50 64. Zachman, J. A. 1987. A framework for information systems architecture.
, 135 Database, 13,24, 32,105 125,163, 165,271 Database management systems (DBMS), 107,108, 116,117, 119,123 125 Data science, 106,135 DBMS.
e g. on IPR INTERNAL NETWORKING AMONG CLUSTER MEMBERS Regular meetings, get-togethers, thematic events/workshops for cluster members Internal newsletters, databases etc.
Furthermore, the data base of 2012 has been extended by more cluster programs. They have been benchmarked with the same criteria as the programs analyzed in 2011.
and other types of evaluations as those databases do not contain all data in detail that is usually required for the analysis or evaluation of a certain program.
Key Indicators Database at:<<http://www. gemconsortium. org/key-indicators>Accessed 04 may 2012.24 Chart 1. Source:
A database of 150 private investors, serial investors and international investors has been developed who regularly receive profiles of client companies in sectors that match their knowledge domain.
Existing databases in the CEBS should be used to track the experience of grant recipients and course attendees so as to facilitate measurement of the impact of CEB activities.
A database of entrepreneurs could be established and managed by the LEOS or Enterprise Ireland, as a national registry.
KTI) KTI will create a searchable national database of licencing opportunities in order to increase the visibility of research opportunities to industry.
the DJEI will develop a simple open database to attract new mentors. A working group will implement changes to broaden its scope (particularly including mentoring to help enterprises scale),
Officehours. ie is a database of experienced Irish entrepreneurs who voluntarily give their time to advise startups.
-Create a simple mentors database to which new types of mentors (volunteer entrepreneurs or serving executives) would be attracted.
launch of an Open Data Portal for access to public sector databases; launch of an Energy efficiency Fund and other measures such as raising standards in the retrofitting of homes;
Improved efficiency and accuracy of internal business processes as a result of improved accuracy and consistency of databases across public and private sectors;
which will improve logistical efficiency, the accuracy of databases across both the public and private sector and planning and analysis capabilities in both sectors.
and you can include excerpts from OECD publications, databases and multimedia products in your own documents, presentations, blogs,
In this respect, given improvements in electronic access to databases released by various sources, the issue of coherence across data sets can become relevant.
eventually making possible the creation of an open-access same-time information system a Web-based secure database of transmission-systemrelated information.
questions and lessons learnt for SMES based on the existing resources, notably specific assessment tools, checklists and databases,
Carbon Trust guide to carbon footprinting for organisations and products. www. carbontrust. com/resources/guides/carbon-footprinting-andreporting/carbon-footprinting EC's LCA database
Energy productivity expresses the amount of economic value generated by one unit of energy input or consumption. 5 SERI 2011, Global Material Flow Database. 2011 Version. www. materialflows
A guide to eco-innovation for SMES and business coaches. 41 eco-innovation observatory MATREC materials database showcases developments in new,
a repository of good practices and online database. www. eco-innovation. eu INNOWATER is a public-private partnership of public innovation agencies, water associations and technology specialists
Visit our website to get open access to our reports, interactive database with charts and maps,
LTE IPR declarations on the ETSI database are 1, 860 as of January 11th, 2010. The following graph exemplifies the number of IPR declarations on the LTE project.
and an important feature for building up IPR databases at SSOS o Ex-Ante Disclosure of FRAND licensing Promise is important for necessary privilege under Art. 101 (3) EC Treaty as every
Considerable concern has been raised about their over-reliance on international bibliometric and citation databases, e g. Thomson-Reuters World of Science or Elsevier-Scopus.
the complexity of knowledge has led to a diverse range of output formats, inter alia, audio 26 visual recordings, computer software and databases, technical drawings, designs or working models,
with an abstract and citation database of research literature and quality web sources covering almost 18,
According to its webpage, the database includes extensive conference coverage (3. 6 million conference papers), 600 trade publications and 350 book series plus 23 million patent records
Suitable data bases for a variety of disciplines and research related outputs, especially in social sciences and humanities.
i e. databases that do not only contain meta data on included In the exact sciences, peers tend to consider citation impact a relevant aspect in Citations reflect intellectual influence
Expansion of existing databases and creation of new databases (e g. based on data from institutional repositories) will 44 INDICATORS DESCRIPTION PRO/POTENTIALITIES CON/LIMITATIONS
Agree international comparative data base. Number and percentage competitive grants won Level of funding won competitively this is a sub-set of the indicator above.
Agree international comparative data base. Research income per academic staff or FTE Research income per academic staff or FTE supports cross-institutional comparisons,
and overcome some of the limitations of traditional bibliometric databases. Digital repositories and web-based tools can facilitate scientific collaboration in line with the movement for open science.
they could eventually also help overcome some of the limitations inherent in currently available bibliometric and citation databases.
the creation of bibliometric databases from raw data from Thomson Scientific's Web of Science and Elsevier's Scopus;
Data usually comes from an international database e g. Thompson Reuters Web of Science or Elsevier-Scopus.
non-refereed, extracts of paper) Edited volumes of conference proceedings Audiovisual recordings Computer software, databases Technical drawings, designs or working models Design (major works
or comes from an international database or increasingly from web-based technologies. University-based data normally requires direct entry by researchers,
Data usually comes from an international database e g. Thompson Reuters Web of Science or Elsevier-Scopus.
International bibliometric databases focus primarily on journals that publish full text in English or at very least, their bibliographic information in English.
Additionally the databases from which citations counts are assembled draw upon comprehensive publications sets (9 000 for Thomson Reuters;
Coverage of a certain discipline in the data base used citations might be found in publications not counted here.
However, collaborative research activity, especially with nonacademic partners, is reflected not easily in the major international bibliometric data bases,
Industry Employment of Phd Graduates Universities track the career destinations of their Phd graduates and alumni via postgraduation Career Destination Surveys and Alumni Databases.
other research products, e g. databases, software; third-stream projects; prizes (chemistry: work-done-at principle;
different data bases used for the two disciplines) Impact/effectiveness: Number of publications; number of patents;
publication databases (Web of Science and national databases of scientific-scholarly publications) and a survey conducted among professors.
but only a certain sample which is selected on the basis of the data base (s) used, the authors'names and the time window used.
If very heterogeneous data bases are used, a weighting scheme is applied taking into account the number of pages and of authors.
E-mail, special Excel templates, MS Access database. Dissemination, incl. how much information is available regarding data and methods:
or other national or international databases, have improved the evaluation and peer review processes. As an unintended consequence we should highlight the fact that there has been a change in the behaviour of a substantial number of researchers,
but also taking into account other materials (courseware, seminars or workshops documentation, digital libraries, databases, multimedia, personal pages,)
7 general search engines (Google*,Yahoo Search*,Live (MSN) Search*,Exalead*,Ask (Teoma), Gigablast and Alexa) and 2 specialised scientific databases (Google Scholar*and Live Academic.
All of them have very large (huge) independent databases, but due to the availability of their data collection procedures (Apis), only those marked with asterisk are used in compiling the Webometrics Ranking.'
and number of publications being collected by Google Scholar database. As it has been commented already, the four indicators were obtained from the quantitative results provided by the main search engines as follows:
These results from the Scholar database represent papers, reports and other academic items. The four ranks were combined according to a formula where each one has a different weight:
fully integrated health information and communication system with a comprehensive database that includes more than 18 years of data on almost 2 million members.
JUNE 2010 171 the insertion of the membership card generated an online connection to the Maccabi database for verification of the patient's eligibility to receive services,
confirm, or extend the data base with experts from the selected industries like firm representatives, representatives of industry associations and cluster managers.
http://www. strabag. com/databases/internet/public/files. nsf/Searchview/F78949bb3e 4e5796c1257959002c94e5/$file/STRABAG POLEN%20a2%20er%C3%B6ffnung d ez11 e. pdf?
Openelement http://www. euinfrastructure. com/article/Polished-up/http://www. strabag. com/databases/internet/public/files. nsf/Searchview/3ce29a42f9 E61f11c12575e5004d2785/$file/STRABAG
using secondary sources form OECD, Erawatch and the European Regional Monitor database. The European Regional Innovation Scoreboard (2012) provides a comparative assessment of innovation performance across 190 NUTS 1 and NUTS 2 regions of the European union, Croatia,
Given that this database of 30 cases, together with the accompanying desk research, constitutes a rich resource,
Surfen zum job operates a database of 8, 000 institutions providing ICT facilities and support
Both cases use online platforms for content creation (e g. databases and tools), and for matching assets (the job seekers and their skills and competences) with the needs of prospective employers.
Eslife also provides a database of the task provider's skills and requirements which are evaluated, rated
As in the other two cases, Skillandar provides a database of the task provider's skills and conditions,
This example uses databases, terminals, RFID23 tags with barcodes or transponders on items of laundry
It also allows linking to other databases and other groups as well as data and information sharing across all partners
civil groups, journalists to create new content & knowledge & hold corporates to account-Global database of companies, web scraping, reconciliation function, analysis and visualisation tools,
incubated in Open Data Institute, some foundation funding, other civil partners and civil operated-from 3 to 75 jurisdictions by 2014,60 million companies in database, small fees given
It provides a global open data database of over 60 million companies in 75 jurisdictions, together with web scraping, reconciliation functions, analysis and visualisation tools,
which is made openly available under the share-alike attribution of the Open Database License. In return
now has access to Patientslikeme's full database for five years. Patientslikeme is an online network of some 250,000 people with chronic diseases who share information about symptoms, treatments,
or use the interactive online database. 112 www. q2l. org 113 http://izonenyc. org/in New york 114 www. professor-why. pl 117 Focus area
The long term objective is a large data base for research projects which makes creating different subsamples as well as control samples possible.
At the same time it collects data on real developments of young children to establish a database that can be used by researchers.
evidence is in principle available in the form of statistical data, analyses and regional innovation intermediaries'databases.
These activities contributed to compiling a continuously increasing regional innovation database to be used later for policy intelligence purposes.
RDA Statistical Database, author's calculations. Several CHOP project calls were announced in support of market-oriented R&d activity.
Actions include setting up an interactive company database, monitoring demand from multinationals, improving conditions of access to capital, promoting cooperation between companies (clusters and virtual companies).
Actions include setting up an interactive company database, monitoring demand from multinational companies, improving conditions of access to capital and current assets, promoting cooperation between companies (clusters and virtual companies),
and modern, networked databases open up countless new valuable research possibilities. In order to take advantage of such opportunities in the digital age,
the museum also implemented a database of all victim groups in Leipzig during the Nazi regime.
In addition to serving public requests, the museum's online database is used for several education services and projects.
The online database connects the museum with local and regional schools thus supporting explorative learning, developing students'research skills, preparing
while at the same time, the museum received about 700,000 online visitors to the collection database. In the end, the museum received about two-thirds more visitors to its digital collections than to its physical exhibition and collections.
that still need to be entered into the online database. To accomplish this, the museum will need another six to eight years
This could be achieved by selecting coaches with strong expertise in 7 E g. statistics from the IMPROVE benchmarking database with 1,
%)Although, the latest figures beyond 2008 are not available in the EUROSTAT database, the same development could be identified concerning the turnover from innovation.
The National Research Infrastructure Register is a searchable database providing information on major research infrastructures (RIS) in Hungary in all fields of science.
RIS include research facilities, resources, related services and their networks like instruments, gene banks, data bases,
and update regularly their publications in the scientific bibliography database (i e. List of Hungarian Scientific Works, MTMT in Hungarian) of the Hungarian Academy of Sciences'Library.
This database has the main purpose"to keep an inventory on Hungarian academic achievements and to make these valuable contributions known to the world in a high quality format.
The database became a central part of the Hungarian research-development information system. The knowledge transfer organisations are weak
+MTA and most Hungarian universities requires their researchers to register their publications in the scientific bibliography database run by MTA(-)Lack of national Open Access strategy Action 2:
< Back - Next >
Overtext Web Module V3.0 Alpha
Copyright Semantic-Knowledge, 1994-2011