Software Engineering

Ontological Text Mining of Software Documents

Paris, France

Abstract

Documents written in natural languages constitute a major part of the software engineering lifecycle artifacts. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. In this paper, we present a text mining system capable of populating a software ontology with information detected in documents.

Task-Dependent Visualization of Coreference Resolution Results

A single coreference chains visualized as a Topic Map

Abstract

Graphical visualizations of coreference chains support a system developer in analyzing the behavior of a resolution algorithm. In this paper, we state explicit use cases for coreference chain visualizations and show how they can be resolved by transforming chains into other, standardized data formats, namely Topic Maps and Ontologies.

Connecting Wikis and Natural Language Processing Systems

Palais de Congres, Montreal, Canada

Abstract

We investigate the integration of Wiki systems with automated natural language processing (NLP) techniques. The vision is that of a "self-aware" Wiki system reading, understanding, transforming, and writing its own content, as well as supporting its users in information analysis and content development. We provide a number of practical application examples, including index generation, question answering, and automatic summarization, which demonstrate the practicability and usefulness of this idea. A system architecture providing the integration is presented, as well as first results from an initial implementation based on the GATE framework for NLP and the MediaWiki system.

General Terms: Design, Human Factors, Languages
Keywords: Self-aware Wiki System, Wiki/NLP Integration

LockMe! for PalmOS

LockMe icon Current Version is 1.1.
  Works on PalmOS 2.x and higher
  Developed under Linux with gcc, pilrc and CoPilot.

(This web page is about an old PalmOS security utility of mine, LockMe! Although no longer being maintained, the tool and its source code are still available.)

Description

LockMe! periodically locks your Palm, starting at a specified time.

An Integration Architecture for User-Centric Document Creation, Retrieval, and Analysis

Toronto

Abstract

The different stages in the life-cycle of content—creation, storage, retrieval, and analysis—are usually regarded as distinct and isolated steps. In this paper we examine the synergies resulting from their integration within a single architecture.

Our goal is to employ such an architecture to improve user support for knowledge-intensive tasks. We present a case study from the area of building architecture, which is currently ongoing.

Supporting Reverse Engineering Tasks with a Fuzzy Repository Framework

Abstract

Bad Honnef, the place to go!
Software reverse engineering (RE) is often hindered not by the lack of available data, but by an overabundance of it: the (semi-)automatic analysis of static and dynamic code information, data, and documentation results in a huge heap of often incomparable data. Additionally, the gathered information is typically fraught with various kinds of imperfections, for example conflicting information found in software documentation vs. program code.

Our approach to this problem is twofold: for the management of the diverse RE results we propose the use of a repository, which supports an iterative and incremental discovery process under the aid of a reverse engineer. To deal with imperfections, we propose to enhance the repository model with additional representation and processing capabilities based on fuzzy set theory and fuzzy belief revision.

Keywords

fuzzy reverse engineering, meta model, extension framework, iterative process, knowledge evolution

Engineering a Semantic Desktop for Building Historians and Architects

Page scan from 'Handbuch der Architektur'

Abstract

We analyse the requirements for an advanced semantic support of users—building historians and architects—of a multi-volume encyclopedia of architecture from the late 19th century. Novel requirements include the integration of content retrieval, content development, and automated content analysis based on natural language processing.

We present a system architecture for the detected requirements and its current implementation. A complex scenario demonstrates how a desktop supporting semantic analysis can contribute to specific, relevant user tasks.

Agents and Databases: Friends or Foes?

Friendly Meetings in Vancouver

Abstract

On first glance agent technology seems more like a hostile intruder into the database world. On the other hand, the two could easily complement each other, since agents carry out information processes whereas databases supply information to processes. Nonetheless, to view agent technology from a database perspective seems to question some of the basic paradigms of database technology, particularly the premise of semantic consistency of a database. The paper argues that the ensuing uncertainty in distributed databases can be modelled by beliefs, and develops the basic concepts for adjusting peer-to-peer databases to the individual beliefs in single nodes and collective beliefs in the entire distributed database.

Architektur von Fuzzy-Informationssystemen

(This web page is about my book, "Architecture of Fuzzy Information Systems", which is written in German. You can try a Google translation.)

Buch-Cover

Architektur von Fuzzy-Informationssystemen

von René Witte

ISBN 3-8311-4149-5

330 Seiten, 82 Abbildungen

Copyright © 2002 René Witte
Alle Rechte liegen beim Autor.

Bezugsquellen

Inhaltsbeschreibung

Informationssysteme gehen heute aufgrund der eingesetzten Modelle und Technologien davon aus, daß die verwalteten Daten immer präzise, sicher und konsistent sind. Doch die Wirklichkeit sieht anders aus: Informationen sind tatsächlich oft ungenau, vage, unsicher oder inkonsistent.

Insbesondere bei komplexen Informationssystemen, die eine möglichst naturgetreue Abbildung der Realität erreichen sollen, möchte man aber diese sogenannten Imperfektionen nicht verlieren, sondern sie vielmehr explizit repräsentieren, um daraus für die Entwicklung und den Anwender Vorteile zu schöpfen: eine Bank etwa hat großes Interesse an einer korrekten Beschreibung der Kreditwürdigkeit eines Kunden, ein Umweltinformationssystem muß glaubwürdige Daten über die Umweltbelastung einer Region vermitteln, ebenso ein Verkehrsleitsystem über mögliche Staugefahr. Business-to-Business Marktplätze brauchen Informationen über die Zuverlässigkeit von Geschäftspartnern, Elektronische Bibliotheken über die Relevanz aufgespürter Textstellen.

Zur Modellierung solcher unscharfer und unsicherer Daten läßt sich die sogenannte Fuzzy-Theorie verwenden, die bereits in vielen anderen Bereichen, wie der Steuer- und Regelungstechnik, erfolgreich industriell eingesetzt wird. Für Informationssysteme existierte jedoch bisher keine systematische Vorgehensweise zur Erweiterung existierender Modelle, Technologien und Architekturen, die kompatibel mit etablierten Standards bleibt und die neuen Möglichkeiten in orthogonaler Weise einbettet. Im vorliegenden Buch, das auf der Dissertation des Autors beruht, wird nun erstmals ein komplettes Architekturmodell für die Entwicklung von Fuzzy-Informationssystemen vorgestellt. Nach einer Einführung in die notwendigen Grundlagen aus der Fuzzy-Theorie wird ein für Informationssysteme geeignetes Modell formal aufgebaut, und es wird gezeigt, wie dieses Modell mit gängigen objektorientierten Sprachen realisiert werden kann. Für die Systementwicklung schließlich wird eine passende Referenzarchitektur vorgestellt, die sich an aktuellen, mehrstufigen Client/Server-Architekturen orientiert.

Darüber hinaus bietet das Buch dem Praktiker zwei konkrete Anwendungsbeispiele, ein Fuzzy-Entscheidungshilfesystem und ein Fuzzy-Textanalysesystem, anhand derer die Entwicklung von Fuzzy-Anwendungen detailliert beschrieben wird.

Favourite Framework (Architecture) for NLP/Text Mining?

Syndicate content