.w. “ \ .‘Z‘ ‘.. ' '1..-"
sagas “Egg—11% - - "‘2;
.35

”v. ‘2- f . “‘3 ’* .' U: 5‘:
mi '

_ .

¢1t4é§£§ 3.32“? "“3“
M¢LAc3~&&1%L“ sit-€17... ‘23:; "‘3'
.. ’i' u ‘ﬁ‘ZR’E-é .."‘"°'¥Z\"‘? 3‘ ’

A...
3:: M ,. f‘
gac\;:,a~.§.n_<w .. x:

. _, 4:33“ 3:- r - -
a“ a. a. .. _

‘ u- k‘ 0" KI. 9.
:a “: :*x39’u::z
”15%;? W

4. '2'.“ . 513% ' ‘3‘
—:' - 1 ‘JLa' V

A ‘l. i:~‘.,L‘,-..s.- 3

54L 1133‘ 7." ‘ ‘ '

a. .-.-.\r’.<~‘"‘;.-"“‘

w 57*...ﬁt-‘vtuh 75w

‘2: 1. “.u-A

‘.

Kg'.‘ 5-
:57.

3.
r;

«I

h'.‘2‘~’
.h R
=Lu2523r- 3?; ~
‘ AA,» «ha. .33.:173‘:
a ,‘1“{ - V‘
n‘

.107‘- '19

fg“

«W155

,

no. ‘ ‘L‘Y g
V ‘- “Awﬂﬁ

* gage.
ﬂask «a: "-

who“;
. g...
-u*

V! ;

‘ttfp. .333“ M:
H . “,8 .
ﬂ 247:3.“- .. .

wr- {'- zs; é? “ *
wing,

,.. .
49:. ~23! -'
..L. t: .:

caul- .

:‘u‘
;v I
1 «333?. _v

a
.um Irv :- ‘

. r

 

 

9 15:. M
‘ stf"v4%. J
A v. . .
»‘ ...‘,‘.
*g. {4;
i5, 93;"

 

§
M...
‘1 s

‘4

 

THESIS

llllll

 

J lllllllllllWill\llllll ll‘lL ll

This is to certify that the

dissertation entitled

Applying Formal Methods to Software Reuse
presented by

Jun-dang Jeng

has been accepted towards fulﬁllment
of the requirements for

Ph.D. Computer Science

 

 

degree in

- Lu

Major professor

12/23/93
Date

 

MS U is an Afﬁrmative Action/Equal Opportunity Institution 0-12771

 

 

 

 

 

LIBRARY
Michigan State
University

 

 

 

PLACE ll RETURII BOX to romovothlo checkout from your record.
To AVOID FINES Mum on or baton doto duo.

DATE DUE DATE DUE DATE DUE

 

 

 

 

 

I ”J T——‘ ——l '
L-:-_[———:_J
---J

--:J———
-—-J[-— ---J——-
flat—J

 

 

 

 

 

 

 

 

 

 

 

 

 

MSU Io An Atﬂnnotlvo ActIon/Equol Opportunity Institution
Wm

 

 

 

.L‘L

 

Applying Formal Methods to Software Reuse

By

Jun-Jung Jeng

A DISSERTATION

Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of

DOCTOR OF PHILOSOPHY

Department of Computer Science

1993

ABSTRACT

Applying Formal Methods to Software Reuse
By

Jun-Jung Jeng

This dissertation presents an approach, based on formal methods, to the speciﬁca-
tion, classiﬁcation, retrieval, and modiﬁcation of reusable software components. From
a set of reusable components that are described by formal speciﬁcations, a two-tiered
hierarchy of software components is constructed. The hierarchical structure provides
a means for representing, storing, browsing, retrieving, and modifying the reusable
components; furthermore, the formal speciﬁcations provide a means for verifying that
a given software component correctly satisﬁes the current problem. The lower-level
hierarchy facilitates the application of logical reasoning techniques for a ﬁne-grained,
exact determination of reusable candidates. The higher-level hierarchy provides a
coarse-grained determination of reusable candidates. Based upon the framework of
the two-tiered component hierarchy, a set of candidate components, which are more
general than or analogous to the query speciﬁcation, can be retrieved from the hi-
erarchy. Two methods are proposed for the modiﬁcation of candidate components
in order to satisfy the query speciﬁcation. One is to modify the component that is
more general than the query speciﬁcation; the other is to modify a component based

on analogy. The graphics-based implementation of the reuse framework is described.

ACKNOWLED GMEN TS

First of all, I’d like to thank my advisor, Dr. Betty H. C. Cheng for her constant
support over the years. Her advice, encouragement, and willingness to allow me so
much freedom of direction, provided a wonderful research environment to work.

I am grateful to my committee members, Professors Abdol H. Esfahanian, Jacob
M. Plotkin, and John Geske, for their interests in my work, and their helpful sug-
gestions. I am also indebted to Dr. Moon-Jung Chung for his guidance at the early
stage of my doctoral research.

My appreciation also extends to my colleaques of Computer Science Department at
Michigan State University for providing a great research atmosphere, and my friends
for making my life bountiful during these years at East Lansing.

This dissertation would not have been possible without the support from my
parents, sisters, and brother. Last but not least, I want to thank my dear wife

Shuefung for her continued trust and love to make this dissertation a reality.

iii

TABLE OF CONTENTS

LIST OF FIGURES viii
1 Introduction 1
1.1 The Need for Software Reuse ...................... 1
1.2 The Need for Formal Methods ...................... 3
1.3 Research Contributions .......................... 5
1.4 Organization of Chapters ......................... 6

2 Component Reuse E'amework 7
2.1 Research Problems/ Challenges ...................... 7
2.1.1 Speciﬁcation ............................ 8

2.1.2 Classiﬁcation ........................... 8

2.1.3 Retrieval .............................. 8

2.1.4 Modiﬁcation ............................ 9

2.1.5 Other Issues ............................ 9

2.2 Reuse Framework ............................. 9
2.3 Reuse Tools ................................ 12

3 Speciﬁcation of Reusable Components 15
3.1 Introduction ................................ 15
3.2 Larch Shared Language .......................... 17
3.3 Larch Interface Language ......................... 19
3.4 Component Speciﬁcation ......................... 21
3.5 Method Speciﬁcation ........................... 24
3.6 Subtypes and Subclass .......................... 24
3.7 Deﬁning Generality ............................ 28

4 Construction of Hierarchical Component Library 31
4.1 Lower- Level Hierarchy .......................... 31
4.1.1 Determining Generality ...................... 32

4.1.2 Building Lower-Level Hierarchy ................. 33

iv

4.2 Higher-Level Hierarchy ..........................
4.2.1 Measure of Similarity between Components ..........
4.2.2 Hierarchical Clustering ......................
4.2.3 Hierarchical Clustering Algorithm ................

4.3 Implementation ..............................
4.3.1 Browsing Hierarchy ........................
4.3.2 Implementation of the Construction of Hierarchy .......

4.4 Summary .................................

Search and Retrieval of Reusable Components

5.1 Hashing Scheme for Software Components ...............
5.2 Retrieval Algorithm ............................
5.3 Implementation of Retrieval Process ...................

Modiﬁcation of Reusable Components Based on Generality

6.1 Introduction ................................
6.2 Predicate Transformer wp ........................
6.2.1 Simple C++ Statements ......................
6.2.2 The abort Statement .......................
6.2.3 The ship Statement ........................
6.2.4 The assignment (y := E) Statement ..............
6.2.5 The alternative (IF) Statement .................
6.2.6 The iterative (D0) Statement ..................
6.2.7 The compound (50; 51) Statement ................
6.3 Modifying a More General Component .................
6.4 Modiﬁcation Process ...........................
6.5 Modiﬁcation Example ..........................
6.6 Summary .................................

Modification of Reusable Components Based on Analogy

7.1 Introduction ................................
7.2 Analogical Matching ...........................
7.3 Heuristics for the Matching Process ...................
7.4 Top-Down Matching Approach ......................
7.5 Matching Algorithm ...........................
7.6 Matching Example ............................
7.7 Modiﬁcation Example ..........................
7.8 Summary .................................

38
39
43
46
47
47
49
53

55
55
58
62

65
65
66
68
68
69
69
69
72
73
73
76
78
84

86
86
89
90
92
95
99

108

8 Case Study 110

8.1 Introduction ................................ 110
8.2 Widgets and Larch ............................ 111
8.3 The Speciﬁcation for Widgets ...................... 113
8.3.1 Primitive Widget ......................... 114
8.3.2 Object and SetResource ...................... 115
8.3.3 Well-Formed Window ....................... 119
8.4 Speciﬁcation of Scrolled Text Editor with Popup Menus ........ 120
8.4.1 The X Model ........................... 121
8.4.2 The LSL Level .......................... 122
8.4.3 The Interface Level ........................ 129

8.5 Applying Reuse Processes to Speciﬁcation Components of Xt/Motif
Widgets .................................. 132
8.5.1 Construction Process ....................... 134
8.5.2 Retrieval Process ......................... 138
8.5.3 Modiﬁcation Process ....................... 138
8.6 Summary ................................. 141
9 Related Work 145
9.1 Related Work for Reuse ......................... 145
9.2 Related Work for Analogy ........................ 150
9.2.1 Analogy in Al ........................... 150
9.2.2 Analogy in Software Engineering ................ 153
9.3 Related Work of Specifying GUI ..................... 156
10 Concluding Remarks 158
10.1 Summary ................................. 158
10.1.1 Construction of Component Hierarchy ............. 158
10.1.2 Incorporation of Formal Methods to Software Reuse ...... 159
10.1.3 Retrieving Reusable Components from the Hierarchy ..... 159
10.1.4 Modifying More General Components .............. 160
10.1.5 Modifying Analogous Components ............... 160
10.2 Future Work ................................ 161
A Analogical Reasoning 163
B Bottom-Up Matching Approach 167
C The Speciﬁcations for Motif Widgets 169

vi

C.1 The Functionalities of Motif Widgets .................. 169

C.2 The LSL Traits of Motif Widgets .................... 171
C.3 The LIL Speciﬁcations of Widgets .................... 179
D Algorithm to ﬁnd LGCs 183
E Program Listing 186
El C++ Implementation for List ...................... 186
E2 C++ Implementation for Array ...................... 188
E3 C++ Implementation for Stack ...................... 189
EA C++ Implementation for DoubleList .................. 190
E5 A Window with Pulldown Menu (Existing Component) ........ 193
E.6 A Window with Popup Menu (Modiﬁed Component) ......... 196
BIBLIOGRAPHY 200

vii

2.1
2.2
2.3

3.1
3.2
3.3
3.4
3.5
3.6
3.7
3.8

4.1
4.2
4.3
4.4

4.5
4.6
4.7
4.8
4.9
4.10
4.11
4.12
4.13
4.14

4.15

LIST OF FIGURES

Development and Reuse Modules .....................
Overview of Component Reuse Module ..................
Reuse Tools ................................

An Example Trait: Table. ........................
More Examples of Traits ..........................
Basic Construct for Component Speciﬁcations .............
Component interfaces for intSet ......................
Grammar for Component Methods. ...................
The Borland C++ Subclass Relationship. ................
Relations Among the Traits .......................
Subsumption Test Algorithm based on 2...... .............

Resolution Rule ..............................
P1111 Resolution Rule ...........................
Modiﬁed Resolution Rule .........................
Using MST to decide the generality relationship between components
Camp; and Comps. ...........................
Building the lower-level hierarchy by pair-wise comparison .......
Building lower-level hierarchy by recursive comparison. ........
Example of building hierarchy by recursive comparison .........
Example of comparing two SOLs. ....................
Matrices for components X and Y. ...................
Similarity matrix 3’ (X, Y) ........................
Reﬁnement of partitions in an agglomerative clustering algorithm

Agglomerative Hierarchical Clustering Algorithm ...........
Hierarchical Clustering Algorithm ....................
Two-tiered hierarchy formed by the subsumption test and the cluster-
ing algorithms ..............................
Sample application of subsumption test algorithm ...........

viii

10
13
14

22
23
25
26
29
30

32
32
33

34
35
37
38
39
42
43
45
46
48

50

4.16
4.17
4.18

5.1
5.2
5.3
5.4

6.1
6.2
6.3
6.4

7.1
7.2
7.3
7.4
7.5
7.6

8.1
8.2
8.3
8.4
8.5
8.6
8.7
8.8
8.9
8.10
8.11
8.12
8.13
8.14
8.15
8.16
8.17
8.18

Sample application of clustering algorithm ............... 51

Speciﬁcation of Component Queue. ................... 52
Speciﬁcation of Component DoubleQueue. ............... 54
Knuth-Bendix Ordering ......................... 59
The Process of Retrieving Software Components. ........... 60
The Algorithm for Retrieval by Subsumption Test. .......... 61
Sample application of construction and retrieval processes ...... 63
Modifying Implementation Based on Speciﬁcation Changes ....... 75
The Process of Modifying More a more general method ......... 77
The LCL Speciﬁcation of List Class. .................. 79
The LCL Speciﬁcation of Array Class ................... 81
Analogical Reuse Modiﬁcation Process .................. 88
Matching Algorithm ........................... 96
Simple example of the matching algorithm ................ 100
Larch speciﬁcation of Stack class ..................... 102
Larch speciﬁcation of DoubleList class. ................ 103
An implementation for matching process ................. 105
The Manager Window trait. ....................... 112
Part of the ScrolledTezt speciﬁcation ................... 113
The dependencies of the Window’s traits ................. 114
The Basic Window trait. ......................... 115
The Primitive trait ............................. 116
The Object trait. ............................. 118
The SetResource trait. .......................... 118
The WFWidget trait ............................ 120
An Example of Scrolled Text Editor with Popup Menus. ....... 121
Parent-child relationships between widgets ................ 123
The LtScrolled trait. .......................... 124
The LtTextField trait. ......................... 126
The Lt'l‘ext trait .............................. 127
The LtMenu trait .............................. 128
The LtScrolledText trait. ....................... 129
The LtScrollod‘l‘oxtMenu trait ...................... 130
Part of the Interface Speciﬁcation of Primitive Widget. ....... 131
The Larch Interface Speciﬁcation of Popup ................ 132

ix

8.19
8.20
8.21
8.22
8.23
8.24
8.25
8.26
8.27

A.1
A.2
A.3

D.l
D.2

The Larch Interface Speciﬁcation of Popupmenu. ............ 133
The unstructured widget components ................... 135
The lower—level hierarchy of widget components ............. 136
The two-tiered hierarchy of widget components. ............ 137
An example of retrieving an exact—matched component ......... 139
An example of retrieving a set of more general components ....... 140
An example of computing matching between two methods. ...... 142
The output of the pulldown menu implementation. .......... 143
The output of the popup menu implementation. ............ 143
Gentner’s Model of Analogy ....................... 164
P1 Model ................................. 165
Analogy and Generalization ....................... 166
The algorithm to ﬁnd a set of LGCs. .................. 184
Coloring, joining, and collecting a partial ordering set. ........ 185

CHAPTER 1

Introduction

Software reuse has been claimed to be a means for overcoming the software crisis
[1, 2, 3, 4]. Some of the major issues that make software reuse difﬁcult are com-
ponent classiﬁcation, retrieval, modiﬁcation, and library maintenance. Research in
Software Reuse has great potential to facilitate an improvement in the productivity,
the quality, and the reliability of software development. Results from this research
serve to enhance the practice of software reuse by using formal methods to provide a
means to construct a hierarchical structure of reusable software components, retrieve
reusable software components from the constructed hierarchy, and modify retrieved

reusable components based on a comparison between existing and user-supplied spec-

iﬁcations.

1.1 The Need for Software Reuse

The rapid advancement in the hardware industry has provided users and developers
with increased and less expensive processing power and storage capacity in a rela-
tively short period of time. This new technology facilitates new applications, thus
leading to a growing demand for reliable and functionally complete systems. In to-

day’s technology race, software has proven to be the bottleneck in the production

process [5, 6, 7, 8, 9]. Delivered software is often of poor quality and is very difﬁcult
to maintain. Software reuse has the potential to increase the quality of developed
software. Software reuse has been practiced for years, for example, the libraries of
mathematical subroutines, report generators, and many application packages. How-
ever, in order to achieve the desired beneﬁts, software reuse must be expended beyond
small- or medium-sized software systems. In order to handle the size and complexity
of modern software, we are obliged to solve the problem of Very Large Scale Reuse
(VLSR). VLSR introduces a new area of research problems centered around the is-
sue of making the software representation sufﬁciently general to allow reuse over a
broad range of target systems, and of automating the reuse process, including the
understanding and adaptation of reusable components for a new system.

For more than twenty years, programmers dreamed of developing software system
that are engineered like traditional physical systems. They expressed great interest in
having a software components catalog from which software parts can be assembled,
much as is done with mechanical engineering or electronic engineering. Reusable
software component libraries have been suggested as a means for facilitating software
reuse [2, 3, 10]. The idea of a software component library was ﬁrst invented by Mcllroy
in the NATO Software Engineering Conference of 1968 [11]. McIlroy proposed a
library of reusable components and automated techniques for customizing components
for'diﬁ'erent degrees of precision and robustness. A similar view based on software
103 has been proposed by Cox [12].

There is considerable room for transferring general purpose engineering design
and management techniques to reusable software component’s design and manage-
ment. Reusable hardware components, such as ‘I'TL integrated circuits are universally
used in hardware design [4]. In the Texas Instruments’ 'l'l'L Data Book, a diverse
collection of device abstractions are presented to help the users understand and select

among candidate components. The analogy is often drawn between 'I'TI. components

 

and offothe-shelf reusable software components. Some researchers believe that ﬁnd-
ing an analogous diverse collection of expositions for the understanding of reusable
software components is the fundamental operational problem that must be solved in
the development of any reuse system. However, general purpose reusable components
libraries will likely consist of several orders of magnitude more components than the
libraries of hardware components. The issue of scale will make large scale reusable
component libraries more difﬁcult to understand and use than the hardware analogy.

There are notable differences between software and products from other engineer-
ing disciplines, and these differences induce some technical and non-technical issues
of reusable software components. Despite some progress in the development of reuse
systems in recent years, most software systems are not developed from reusable com-
ponents.

There are both technical and non-technical impediments to achieving software
reuse. The examples of non-technical impediments include management resistance,
psychological barriers etc [13]. Serious efforts must be expanded in addressing these
two issues in order to achieve success, much as is true for software engineering. How-
ever, in this dissertation, only the technical issues will be addressed. The premise is
that with a solid foundation for software reuse, the tasks for the managers, designers,

programmers, and users will be greatly facilitated in making software reuse a reality.

1.2 The Need for Formal Methods

The goal of software reuse is to increase the productivity of programmers and improve
the quality of developed software [2]. Software reuse has been impeded by the lack of
effective techniques for representing and managing software component libraries. In
current industrial settings, software reuse is often based on the availability of the origi-

nal authors to act as consultants or the existence of relevant and descriptive documen-

 

tation. Another approach is to exploit information retrieval methods that are based
on analyses of natural language documentation. These methods have also been pro-
posed for constructing software libraries [14, 15]. Unfortunately, software components
represented by natural language may hinder the retrieval process due to the problems
of ambiguity, incompleteness, and inconsistency inherent to natural languages. Nei-
ther of these options facilitate the automation of software reuse determination, nor do
they provide a means for verifying that a given software component correctly satisﬁes
a new problem speciﬁcation. The above mentioned problems can be minimized by
using formal speciﬁcations to represent software components [16, 17, 18, 19] and ap-
plying formal methods to the software artifacts that are formally described. Formal
methods provide a means for establishing properties by formal reasoning. This formal
reasoning is not possible by means of informal methods. Rigorous reasoning guar-
antees the correctness of some properties via strict argumentation. Formality also
allows for showing consequences of a speciﬁcation. Therefore, implicit contradictions
of requirements that are not obvious in the process of software development can be
detected in an early stage of software production.

Formal methods enable people to write speciﬁcations that have precise meanings.
Precision demands deeper insight into the problem domain than informal speciﬁca-
tions. Speciﬁcations usually become much clearer when expressed in a formal notation
and then rephrased into natural language. Thus, formal speciﬁcations can help clients
comprehend what they want to purchase. Formal speciﬁcations can be used to guide
the implementation of a system [20, 21]. Because of the precision of the description,
the programmers know exactly what needs to be implemented. Formal speciﬁcations
help clarify what the purpose of each component of a system, therefore increasing the
maintainability property. Formal methods also lead to software components that are
easier to reuse. A precise and clear description as provided by a formal speciﬁcation

can provide assistance to the user of a software library.

 

 

As a consequence of all the advantages mentioned above, formal methods decrease
the cost of development. The investment in formal methods in the early stages of
production aims to eliminate errors and ambiguity and to discover missing properties

in the requirements and speciﬁcations. These assets minimizes costs impoud during

maintenance and implementation.

1.3 Research Contributions

This dissertation presents an approach, based on formal methods, to the speciﬁcation,
classiﬁcation, retrieval, and modiﬁcation of reusable software components. Rom a
set of reusable components that are described by formal speciﬁcations, a two-tiered
hierarchy of software components is constructed. The hierarchical structure provides
a means for representing, storing, browsing, retrieving, and modifying the reusable
components; furthermore, the formal speciﬁcations provide a means for verifying that
a given software component correctly satisﬁes the current problem. The lower-level
hierarchy is created by a subsumption test algorithm; this level facilitates the ap-
plication of logical reasoning techniques for a ﬁne-grained, exact determination of
reusable candidates. The higher-level hierarchy provides a coarse-grained determina-
tion of reusable candidates and is constructed by applying a hierarchical clustering
algorithm to the most general components from the lower-level hierarchy. Based upon
the framework of the two-tiered component hierarchy, a set of candidate components,
which are more general than or analogous to the query speciﬁcation, can be retrieved
from the hierarchy. Two methods are proposed for the modiﬁcation of candidate
components in order to satisfy the query speciﬁcation. One is to modify the com-
ponent that is more general than the query speciﬁcation; the other is to modify an
analogous component. All of the above reuse tools are incorporated and integrated

within a graphical reuse framework. In addition, we present several examples, includ-

ing a large scale example, to illustrate the applicability of the reuse framework on

real problems.

1.4 Organization of Chapters

The remainder of the dissertation is organized as follows. Chapter 2 gives the frame-
work of our reuse system for construction, retrieval, and modiﬁcation of reusable
software components. Major research issues associated with component reuse are
identiﬁed. Chapter 3 describes the formal speciﬁcation language that we use to spec-
ify the reusable software components. Chapter 4 focuses on the task of constructing
the two-tiered hierarchy of reusable components. Chapter 5 presents retrieval pro-
cesses for ﬁnding the candidate components from the two-tiered hierarchy. Chapter 6
describes a method to modify those retrieved components that are more general than
the query speciﬁcation. Chapter 7 describes a new approach to modifying retrieved
components that are analogous to the problem speciﬁcation. Chapter 8 presents a
moderate-sized example to illustrate the application of our reuse processes. Chap-
ter 9 reviews other projects that are related to our work. Chapter 10 summarizes the

dissertation and suggests future investigations.

 

CHAPTER 2

Component Reuse Framework

In order to make software reuse successful, several problems must be resolved includ
ing software classiﬁcation, retrieval, modiﬁcation, and library maintenance [22, 23].
Software component libraries have been suggested as a means for facilitating software
reuse [2, 3, 10, 24]. The potential beneﬁts of software reuse and of large component
libraries are obvious but the promise of increased productivity and quality remain
elusive because several issues (problems) have not been resolved. In this chapter, we
describe the research issues associated with component reuse and give an overview
of our component reuse framework. Based on this framework, we have developed
several reuse processes to facilitate the reusability of software components. The reuse

tools associated with the reuse processes are also brieﬂy described at the end of this

chapter.

2.1 Research Problems / Challenges

Reuse is a simple idea: attempt to use something for one purpose that was originally
intended and used for another purpose. It can be a procedure invoked several times
with different parameters. However, it is nontrivial to deﬁne the object that we will

reuse. A software component may consist of one or more of the following related items:

 

source code, speciﬁcations, requirement, architectures, and test cases, thus indicating
that we need to reuse more than code [25]. This dissertation focuses on the reusability
of software components based on formal speciﬁcations of the software components. In
general, to build a reuse system based on software components involves the following
techniques: speciﬁcation, classiﬁcation, retrieval, and modiﬁcation. Each of these

topics will be described in turn.

2.1.1 Speciﬁcation

Speciﬁcation describe the behavior of reusable software components. One of the major
tasks in developing large reusable component libraries is in providing concise semantic
abstractions for components. It is not reasonable to present a large number of source
code components to a user and expect the source code to serve as the only description
of the component behavior. What is required is an abstraction speciﬁcation level that
describes the behavior of the component in the library in a more succinct form. A
higher level than source code should be utilized because we want to emphasize what

the component does rather than how it does for the purpose of reuse.

2.1.2 Classiﬁcation

Classiﬁcation addresses the process how the system/ designer catalogs the reusable
components in the software component library. Techniques may include the con-
struction of a component hierarchy and domain analysis. This dissertation focuses on

automating the former technique. The latter one has largely been addressed in [26].

2.1.3 Retrieval

Retrieval concerns the process how the programmers identify and retrieve a set of

reusable components from the software component library that could be unstructured

or hierarchical depending on the scheme of the construction process [27]. A retrieval

process should also provide some type of browsing mechanism to assist the user in

navigating through the library.

2.1.4 Modiﬁcation

' Modiﬁcation refers to changes to existing components needed to satisfy new/current
problem needs. Direct modiﬁcation of source code is a difﬁcult task for most reusers
of the software components from a component library. Therefore, we are interested
in modifying the speciﬁcations of retrieved components that are more general than
or analogous to the query speciﬁcation. A user can modify the source code based on

the information of the speciﬁcation that is required to be changed.

2.1.5 Other Issues

Decomposition of the query speciﬁcation into simpler queries reduces the retrieval
complexity and enhances the reusability of the component library. Integration of
reusable components concerns the framework in which a collection of components
can be combined to form a set of reusable components and an executable system.
Synthesis of the complete reusable component represented formally speciﬁed refers to

the development of an executable code component from the speciﬁcation component.

2.2 Reuse Framework

Our model for reusing software components divides the conventional life-cycle models
into two phases: ﬁrst, the component development module delivers developed com-
ponents; second, the component reuse module supplies reusable components to com-
ponent development module and stores the developed components into the software

components library. The latter module primarily addresses the reusability of existing

 

 

10

library components and can be decomposed into several processes. Our approach to

reusability is based on software models and formal methods. The overview of com-

 

 

 

formaI/informal requirements

Component Component

Development ‘ reusable components Reuse

MOdUIC , developgd components I MOdulc

 

 

 

 

 

 

Figure 2.1. Development and Reuse Modules.

 

ponent reuse module is shown in Figure 2.2. Each box represents a different process
of the component reuse module.

Reusable Software Component Library contains the reusable software com-
ponents that include the requirements, speciﬁcation, source code, and other sup-
porting documents. Components must be organized under a classiﬁcation scheme to
facilitate any searching technique. The process for the Construction of a Hierarchy
supports to build up the hierarchies to classify the components in the library. Au-
tomation for component selection will be very essential since a manual search through
a library containing thousands of components is not practical. The Retrieval pro-
cess narrows the choice of candidate reusable components by applying automated
reasoning and hashing techniques to the speciﬁcations of software components in the
component library. The Modiﬁcation process modiﬁes the candidate components
to suit the required speciﬁcation. The modiﬁcation needs to be accomplished at a
higher level than the code level since direct editing of source code should be avoided as

much as possible. The above four processes (speciﬁcation, construction, retrieval, and

ll

modiﬁcation) are speciﬁcally addressed in this dissertation. However, other processes
are shown in Figure 2.2 in order to give a complete overview of our reuse framework.

The Component Development Module is project oriented. Its goal is to
deliver the products required by the customer and can be modeled by the traditional
software model, e.g., waterfall model. Two kinds of requirements may be given from
this module: informal or formal. Informal requirements need to be converted into
formal requirements, represented in the Larch speciﬁcation language for our system,
by two processes, Requirement Analysis and Speciﬁcation Editor. We believe
the full automation of these two processes are beyond current technology. But some
progress has been made in helping users create formal speciﬁcation from informal
requirements [28, 29].

In order to reduce the complexity of the Retrieval and Modiﬁcation processes,
decomposing the required speciﬁcation into “smaller” speciﬁcations is necessary. But
some issues are still not clear, for example, “how to deﬁne the size of a speciﬁca-
tion 7”, “how to automate the decomposition of a speciﬁcation ?” and “what is the
cptimal level of granularity ?” etc. The Integration process attempts to integrate
the modiﬁed components to a reusable component. The correctness of components
should be preserved after integration. After obtaining a reusable component that ﬁts
the requirement from the Component Development Module, the process Synthesis
transforms the reusable speciﬁcations into executable code such as C or Pascal based
upon the information from the modiﬁcation process since we do not want to reim-
plement software that already exists. The classiﬁcation of reusable components in a
library can be based on the application domain: each component hierarchy belongs
to a distinct domain. Examples of domains are ﬂight control system, information
management software for insurance system, and so on. The purpose of the Domain
Analysis process is to catalog the reusable software components into different appli-

cation domains and to ensure that the components within a given hierarchy are in

12

the same domain.

2.3 Reuse Tools

The reuse processes of our system can be realized by the reuse tools shown in Fig—
ure 2.3. The Constructor performs the construction process with input compo-
nents from either existing libraries or a group of newly developed components. The
Browser enables the user browse through the constructed component hierarchy. The
Retriever retrieves a set of candidate components from the component library based
on the user’s query and returns them to the user. The user then can either accept
the candidate components or submit them to the Modiﬁer, thus requesting assis-
tance from the system in the modiﬁcation of the candidate components. Figure 2.3
only gives the reuse tools in which our investigations address. Other tools such as
the component speciﬁcation editors, component synthesizer, and query speciﬁcation
decomposer are also important in the Component Reuse Framework, but they are

beyond the scope of this dissertation.

 

 

reusable components

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 
  
 

 

 

 

  
 

 

 

 

 

 

: informal query requirementu . .
Component : '1 Requirement Analysrs]
l
Development l formal query speciﬁcations S if .
Module : Query pee icatIon ] '
developed I l . .
components 5 Decomposition l
l!
Domain '"
Analysis
h.

 
 

. znmjjggngpi. f‘. y" - ,. Integration
I. ., ‘deu '3 ,V"“. t

_- . 'j- I
‘ V l | Documentation Fl

Component Reuse Module

 

 

 

 

Figure 2.2. Overview of Component Reuse Module.

 

 

l4

 

 

 

 

components

 

  

Constructor

 

 

component hierarchy

existing
components

 

J

 

User Brow
Modifier
candidate

components

1

{

 

 

 

l
@

Retriever

Figure 2.3. Reuse Tools

 

 

Component
Library
retrieved

components

 

CHAPTER 3
Speciﬁcation of Reusable

Components

3.1 Introduction

A formal speciﬁcation is a description that is expressed in a notation whose syntax and
semantics are well-deﬁned. Musical compositions are commonly speciﬁed in standard
notation, which is understood by musicians everywhere. Such notations help us to
specify things very precisely, completely, and unambiguously.

Most software is made up of procedural and data abstractions, that is, procedures
and user-speciﬁed and system-deﬁned data structures [30]. Object-oriented analysis
can be used to decompose complex software, which involves deﬁning a set of user-
speciﬁed data abstractions or abstract data types (ADTs) [31, 32, 33, 34, 35, 36]. Thus,
in order to apply an object-oriented approach to software reuse, we focus on data ab-
straction, where it is assumed that procedural abstractions are implicitly addressed
when discussing the operations that are applicable to the data abstractions. The
speciﬁcation for a software component corresponds to the speciﬁcation of an ADT and
a set of methods that operate on that ADT. The specifications of object-oriented (or

object-based) languages have been explored by many researchers [37, 38, 39]. There-

15

16

fore, we intend to build on what has already been done when choosing a speciﬁcation
language for ob ject-oriented programming languages.

The formal speciﬁcation language used in this work is Larch [39, 40]. Larch was
designed to specify the properties of ADTs. The advantage gained in using Larch is
the explicit separation of concerns between state-independent and state-dependent
properties. Larch provides a two-tiered approach to speciﬁcation. In the ﬁrst tier, the
speciﬁer writes traits in the Larch Shared Language (1.31.) to assert state-independent
properties of a program. Each trait introduces sorts and operators and deﬁnes equality
between terms composed of the operators.

In the second tier, the speciﬁer writes module specification in a Larch interface
language to describe state-dependent effects of a program, such as GCIL [41]. A
requires clause states each procedure’s precondition; a modiﬁes clause lists those
objects whose values may possibly change; an ensures clause gives its postcondition.
The assertion language for the pre- and postconditions is drawn from LSL traits.
Through based on clauses, a Larch interface links to LSL traits by specifying a
correspondence between programming-language speciﬁc types and LSL sorts. An
object has a type and a value that ranges over terms of the corresponding sort.

The Larch Shared Language is presented in Section 3.2. The Larch interface
language is described in Section 3.3, where Larch/CH is used as an example to
demonstrate the concept of the interface language. Section 3.4 and Section 3.5 present
the speciﬁcations for component and method, respectively. The distinction between
subclass and subtype is discussed in Section 3.6. Finally, the generality relationship

between a pair of components is deﬁned in Section 3.7.

 

17

3.2 Larch Shared Language

We use the interface languages to specify reusable program components (e.g. C++
classes).i Since each LIL deals with specifying the behavior of components observable
in a particular programming language, it provides a mechanism for writing assertions,
in terms of pre- and postconditions, about the program states; these states can be
translated into predicate calculus formulas that are the speciﬁcations used for reusable
components in our system [42, 43, 44]. It incorporates programming-languagespeciﬁc
notations for constructs such as side effects, exception handling, and iterators. Several
programming languages have been speciﬁed by Larch interface languages, such as
C [45], Modula-a [37], and CLU [40]. been chosen as such role

The Larch family of speciﬁcation languages supports many useful features for
designing modular and reusable systems. Many language-independent abstractions
are useful in a wide variety of speciﬁcations, for example, integers, lists, sets, queues,
arrays, relations, mappings, and orders. Larch facilitates the accumulation of open-
ended collections of reusable speciﬁcation components in the form of LSL handbooks.
Based on the set of reusable speciﬁcations, larger reusable speciﬁcation components
can be built. Data and functional abstractions play an important role in the program
design of Larch’s speciﬁcations from which modular programming components are
relatively easily constructed. In LSL, all the data types and terms appearing in inter-
face languages are explicitly deﬁned to ensure that the understanding of programmers
matches the speciﬁer’s. This feature is exploited in the reuse system when determin-
ing the candidate reusable components derived from the deﬁned terms in LSL library.
The rest of this section presents an overview of the traits in the LSL. If the reader is
familiar with LSL then the remainder of this section may be skipped.

The trait is the basic unit of speciﬁcation in the LSL. Sometimes the collection of

operations will correspond to an ADT. Frequently, however. it deﬁnes some type that is

18

not fully characterized. Figure 3.1 shows an example of a trait Table specifying a class
of tables that stores values in indexed places. It is similar to conventional algebraic
speciﬁcations. The part of the speciﬁcations following the introduces keyword gives

a list of operators, each with its signature. Each trait deﬁnes a theory in typed ﬁrst-

 

Table: trait

introduces
new: —. Tab
# E # : Ind, Tab -’ Bool
lookup: Tab, Ind: _. Val
isEmpty: Tab rightarrow 8001
size: Tab -v Card

assert Vi,i’: Ind, val : Val. t : Tab

lookup(add(t,i,val),i’ == ifi = i’ then val else lookup(t, i’)
ﬁ(i 6 new)

i6 add(t,i’,val) == i= 3" V i E t

size(new) ==

size(add(t,i,val)) == ifi E t then size(t) else lookup(t, i’)
isEmpty(t) == size(t) = 0

Figure 3.1. An Example Trait: Table.

 

order logic with equality. A theory is a set of formulas without free variables. It
contains the trait’s assertions, the conventional axioms of ﬁrst-order logic, everything
that follows from them and nothing else. Equational theories are useful, but a stronger
theory is often needed, for example, when specifying ADTs. The clause generated
by asserts that all values of a sort can be generated by a given list of operators,
thus providing a generator induction schema for the sort. This clause is useful for
Specifying constructors of ADTs. The clause partitioned by asserts that all distinct

Vallues of a sort can be distinguished by a given list of operators. Terms that are not

19

distinguishable using any of the partitioning operators of their sort are equal.

For modularity, it is often useful to include a separate trait by reference. This
convention makes it easier to reuse pieces of others speciﬁcations and handbooks. We
might add a clause following the keyword include or import to some trait. The
theory with the including (importing) trait is the theory associated with the union
of all of the introduces and asserts clauses of the trait body and the included
traits. Figure 3.2 gives two more trait examples, Container and Member. A new
clause beginning with assumes is introduced. The trait Member builds on Container
by assuming it. It constrains the new and insert operators that it inherits from
Container, as well as the operators that it introduces, E and ¢. Similarly, the trait
Set assumes the traits Container and Member. The theories associated with any

trait includes the theories of the traits that is assumes, includes, or imports.

3.3 Larch Interface Language

The Larch Interface Language (LIL) describes the effects of a program component’s
loperations. In this section, we deﬁne the speciﬁcations related to a component of
object-oriented programming languages. Leavens and Choen have designed a pre-
liminary version of Larch/CH [38]. We use similar speciﬁcations to specify program

components (classes).

Most notations of Larch/CH are taken from LCL [45] that is a Larch interface
language for the C language. The basic notations from LCL are described as follows.
The states are mappings from locs to objects, where lacs refer to the locations of

variables. Each variable identiﬁer names a loc. The major kinds of objects are

0 basic values are mathematical abstractions deﬁned by LSL.
o locs are abstractions of computer memory cells.

0 structs are collections of locs.

 

Container (E, C): trait
introduces
new: -v C
insert: C, E -+ C
asserts
C generated by new, insert

Member (E, C): trait

assumes Container

introduces
#€#: E,C—>Bool
#¢#: E,C—’Bool

asserts V c: C, e1, e2: E
-(e1 6 new)
e1 6 insert(c, e2) :2 el = e2 V e1 6 c
e1¢ c == «(e1 6 C)

implies converts 6, ¢

Set (E, C): trait

assumes Container, Member

introduces
# U #: C, C —* C
# n #: C, C -r C
# \ #1 C. C -+ C
delete: C, E: -> C
isempty: C -r 8001
size: C -> Card

asserts V c: C, e1, e2: E
C partitioned by E.
e1€(slU52)==e1651Ve1652
e16(slﬁs2)==el€sl/\e1€s2
elE(sl\sZ)==e1€sl/\el¢s2
e1 6 delete(sl, e2) == e1 E 51 A e1 # e2
isEmpty({}) = a isEmpty(insert(s1, e1))
size(m ==
size(insert(sl, e1) == if e1 6 51 then size(sl) else succ(size(sl))
s1 Q 82 == isEmpty(sl \ 32)

Figure 3.2. More Examples of Traits.

 

21

o unions are similar to structs, except that their locs overlap.
o arrays are bounded vectors of adjacent locs, indexed from 0.

s pointers are references to collections of one or more adjacent locs.

The following LCL primitives are available for accessing the initial and ﬁnal states.

o ” ”’ can be applied to locs, arrays and structs. It is used to extract their values

from the ﬁnal states. If no symbol is applied then the extracted values are from

initial states.
0 t is used to dereference a pointer, producing its locs with offset 0.

o -> is a syntactic shorthand to deference a pointer to a struct and then select

one of its members.
o [i] is used to index an array, producing a loc.

[] is applied to a pointer to cast it into an array.

Each identiﬁer’s type deﬁnes the kind of objects which may be in any state. Each
LSL value has a unique sort. There is a mapping from LCL abstract types to LSL sorts.
Each abstract data types deﬁned in LCL is based on an LSL sort. LCL speciﬁcations
are written using types and values. The properties of these values are deﬁned in LSL,
using operators on the sorts on which those types are based. The mapping between

LCL (or Larch/CH) ADTs and LSL sorts enhances the reusability of speciﬁcations

and veriﬁcations.

3.4 Component Speciﬁcation

The component construct is used as the vehicle for encapsulation and data hiding.

Objects are deﬁned in terms of components. A component is a user-deﬁned type. A

 

 

th

Cal
lug
ant

SPCI

Cons

22

 

component componentmameidentiﬁer

{ .

inherit: componentmameidentiﬁer’“

private:

members (data and methods)
protected:

members (data and methods)
public:

members (data and methods)

Figure 3.3. Basic Construct for Component Speciﬁcations

 

component description deﬁnes the characteristics of all objects declared to be in the
same component. The component construct also provides the basic unit of reusability.
It binds an underlying data type with a set of public, protected, and private methods
(functions) that allow the underlying data type to be manipulated by passing messages
to the object. The skeleton of a component speciﬁcation is shown in Figure 3.4. The
key word inherit indicates that the current component inherits the properties from
the components of the following components. The speciﬁcation of inheritance will
be discussed shortly. The speciﬁcations in the private section deﬁne the behavior of
private methods in this component. Similarly, protected and public sections specify
the corresponding methods in this component.

The objects declared to be a part of a given component may communicate with its
callers by returning a result, by accessing objects accessible to the caller, or by modify-
ing such objects. The speciﬁcation of each method in an object can be comprehended
and used without reference to the speciﬁcations of other methods. An example of
specifying several methods in the component intSet is shown in Figure 3.4, which

consists of a function prototype followed by a body speciﬁed in terms of of pre- and

 

component intSet based on S from Set (Int for E)
{
public:
insert()
ensures new(return) A return’ = {}
insert(int e)
modiﬁes (this)
requires this’ = add(this,e)

delete( int e)

modiﬁes (this)
ensures this’ = rem(this,e)

Figure 3.4. Component interfaces for intSet.

 

postconditions. The requires clause describes restrictions on the arguments, thereby
deﬁning how the caller may use the method. We interpret an omitted requires clause
‘as equivalent to “requires true.” The ensures clause places constraints on its be-
havior when it is called properly. They relate two states, the state when the method
is called, which we call preCondition, and the state when it terminates, which we
call postCondition. A requires only refers to the values in the preCondition. An
ensures clause may also refer to values in the postCondition. As in C++, this denotes
the object at which a component Operation is invoked. The reserved symbol return
is used as an implicit result to denote the object returned as a result of executing
an operation. An omitted modiﬁes clause is equivalent to the assertion modiﬁes
nothing, meaning no objects are allowed to change in value. A new(object_list)
predicate that appears in a constructor’s postcondition asserts that fresh storage is

allocated for each object listed in object-list. The values speciﬁed in the members

:24

of private segment are not allowed to be changed by the callers. It says that the

method must not change the value of any objects visible to the callers in the private

segment.

3.5 Method Speciﬁcation

The expressions used in method of components are based on ﬁrst-order predicate
logic. Figure 3.5 gives the grammar of the speciﬁcation of expressions in methods.
In this grammar, symbols expressed in the Roman font represent terminals, italicized
symbols represent terminals, bold-faced symbols denote keywords, the Kleene star
(*) denotes zero or more repetitions of the preceding unit, and parentheses (‘0’)
indicate groupings. The symbol “::” separates an identiﬁer from a description of the
value denoted by the identiﬁer, and the symbol “z” separates identiﬁer declarations
from a description of the type associated with the identiﬁer. The operators obey
the following decreasing precedence order: negation (a), conjunction (A), disjunction
(V), implication (=>), and if and only if (4:). Primitive types, e.g. Bool, Int, and

Real, are predeﬁned in Larch traits and can be referenced by the users.

3.6 Subtypes and Subclass

The features that most distinguish ob ject-oriented programming languages from those
only supporting data abstraction are message passing and inheritance. In this disser-
tation, only inheritance speciﬁcation will be addressed since speciﬁcation for message
passing is beyond the scope of this dissertation. A useful abstraction in an object-
oriented programming language method is the use of supertypes as abstractions of
their subtypes [46]. Because of the complex inheritance mechanism of C++, the sub-

typing relationship is abandoned in Larch/CH [38]. Since it is an interesting issue

 

met hod method .name( ( variable: type_name) *)

requires: expression

modiﬁes: expression

ensures: expression
expression = true

| false

I ( expression )

I -. expression

| expression A expression

| expression V expression
| expression => expression
| expression 4: expression
| ( V variable : type :: expression )
| ( 3 variable : type :: expression )
| predicatemame [( term (, term)*)]
I

def .
term 2 expression

term = variable
I functionmame [( term (, term)‘ )]

Figure 3.5. Grammar for Component Methods.

 

when we attempt to deﬁne speciﬁcations of classes in an ob ject-oriented programming
language, This section presents a discussion on the topic of subtyping [38, 47].

A type, that is, an 1101', is a behavioral notion and may be implemented by many
different classes. A class is a program module that implements an AD‘l‘. A subtype
is also an AD’I‘, each of whose objects behave in a way similar to some objects of its
supertypes. A subclass is an implementation that is derived by inheritance from its
superclass. In contrast, a subtype represents a behavioral relationship. When a sub-
type is derived from some supertype, the object’s behavior with this subtype can be
veriﬁed by the objects with its supertype instead of reverifying this subtype, whereas
a subclass relationship is a purely implementation relationship. The generality rela-
tionship in our system [42, 44] is similar to the supertype-subtype relationship.

The C++ type system does not distinguish completely between subtypes and sub-

‘26

classes. Each class name is the name of a type for type checking purposes. The only
way to declare S as a subtype of T is to declare class 8 as a public subclass of T. S
and T may have an implementation relationship but have no subtype relationship.
Conversely, if S and T have a subtype relationship then they must have a subclass
relationship. Because the designers of Larch/0H tried to make Larch/C++ match
C++, there is no type keyword in the class speciﬁcation of Larch/C++. Therefore,
S and T have a subclass relationship in C++ but they may not have the subtype re-
lationship at the speciﬁcation level. For example, in the Borland C++ library, class
Set is a subclass of class Hash Table (See Figure 3.6) but we would not intuitively
consider HashTable to be a supertype of Set at the speciﬁcation level. In the C++

language, the subclass relationship is implementation-speciﬁc and cannot represent

the true supertype-subtype relationship.

 

 

 

 

 

 

 

Figure 3.6. The Borland C++ Subclass Relationship.

 

._.
"‘Tt
ital.

I'm

sea

era
on
51*
:io

mC

p—- u
(I)
A

27

There are many ways to interpret what inheritance of speciﬁcations means for a
class speciﬁcation. The goal of the semantics of inheritance should be to deﬁne the
interface of a class that implements the speciﬁcation and to describe the effect of each
deﬁned message sent to objects of that type or its subtype. This goal can be satisﬁed
by overloading the trait functions to interpret the inherited member functions, but
this kind of overloading is only meaningful for subtypes that are public subclasses.
The other goal is to construct a speciﬁcation for a class that uses no speciﬁcation in-
heritance, thus reducing the speciﬁcation problem to an already speciﬁed component.
This goal can be achieved by in-line expansion of the inherited speciﬁcation, and then
reinterpret the trait function in the speciﬁcation by the used traits of the subclass.

Regarding the speciﬁcations of the semantics of multiple inheritance, current re-
search has not completely solved this problem so far. We say conﬂict happens when a
class inherits two methods with the same name from different classes. There are sev-
eral ways to eliminate the conflicts. The user is forced to give an explicit speciﬁcation,
overriding all other speciﬁcations. Or the user may choose the most “representative”
speciﬁcation. Finally, we can use some scheme of composing the inherited speciﬁca-
tions. for example, taking the disjunction of the preconditions, the intersection of the
modiﬁes clauses, or the conjunction of postconditions. It is not clear what approach
is appropriate for reusing the inherited speciﬁcations. Currently, the third alternative
is used as the semantics of multiple inheritance.

As for the deﬁnition of subclass, two classes A and B are used to explain the
concept of subclass. Ideally, a subclass should be designed to implement a subtype.
If r is an operation of A and B is a subclass of A, then the precondition of r in
the speciﬁcation of B may be no stronger than the precondition of r in A, and the
postcondition of r in the speciﬁcation of B must be no weaker than the postcondition
of r in A. This rule ensures that the implementation of an operation in subclass B

satisﬁes the speciﬁcation of that operation in the superclass A. However, this case

IQ

of If

28

does not hold in C++ because, in C++, an operation in subclass B may not exist in the
superclass A. Therefore, the subtype relationship is changed to a new relationship

called generality that is deﬁned in next section in order to match the C++ subclass

relationship.

3.7 Deﬁning Generality

Chapter 4 will describe the construction of the lower-level hierarchy that serves to clas-
sify a set of software components according to the subsumption relationship between
reusable components. In simple terms, component A is said to subsume component
B if A is more general than B, denoted by A 230m, B. A new resolution rule is
described that increases the range of candidates, as compared to the number of exact
matches, that can be retrieved using automated reasoning techniques.

We use LSL traits as the basic reusable units in specifying program components.
All the terms (including predicates and functions) are assumed to be well deﬁned in
the Larch trait library. Therefore, reusing traits drawn from a generally accessible
handbook will serve to standardize the notation. A Larch trait browser [48] is being
developed to facilitate the user’s search for suitable reusable components or the con-
struction of a query speciﬁcation. Figure 3.7 shows the relations among some traits

with respect to a new relation 2“,", which is deﬁned as follows:

Deﬁnition 3.1 Let the relationships of assumption, inclusion or importation among
traits be called the relationships of inheritance. Assume S and T are two traits in

the Larch library: 5 2,"... T if S has the reﬂexive and transitive closure of inheritance
with respect to T.
In Figure 3.7, a trait inherits the properties of its parent traits. From the deﬁnition

of the relation 2,..." a partial ordering is imposed upon the basic traits of the LSL

library. This partial ordering can be applied to the subsumption test algorithm,

inI
be?
Ira

orde

Ineﬁ
Pone
fls
Pau(
1111101
Condj
C11,,
”Hills
in

0135383

 

 

 

Figure 3.7. Relations Among the Traits

 

in the construction and retrieval processes, to determine the generality relationship
between every pair of speciﬁcation components that are built from the set of LSL
traits. Figure 3.8 shows a modiﬁed subsumption test algorithm based on the partial
ordering 2......

Component A is more general than component B (A 2mm, 3) if, for every
method (operation) f in component A, there exists at least one method f’ in com-
ponent B such that f is more general. than f’, denoted by f 2mm»: f’. Method
f is said to be more general than another method f’ if pre( f’ ) 2mm. pre( f) and
post( f) 2d...“ post( f’), where pre( f) and post( f) represent the pre- and postcon-
ditions of f. pre(f') 24“,, pre( f) (post( f) 2d,“, post(f’)) means that the pre-
condition(postcondition) of f ( f’ ) subsumes the precondition(postcondition) of f' ( f)
Chapter 4 gives an algorithm that builds the lower-level hierarchy based on the gen-
erality relationship between components (gimp).

An ADT, is a behavioral notion and may be implemented by many different

classes. A class is a program module that implements an ADT. A subtype is also

»—i

Cl

10 [h

30

 

The input is the set of expressions W. The output is either an empty set or the empty
clause, represented as D.

1. L813 W ={"L10’, . . . , “Lm0’}.
2. Set I: = 0 and 20 = {A}, where A is the original set of clauses.

3. If Z" contains Cl, terminate; A subsumes B, denoted as A 2cm“, B.

Otherwise
let 2"“ = {Resolvents of C1 and C2 w.r.t. 2m", [[01 E Z“ and C; E W}.

4. If 2““ is empty, terminate; A does not subsume B. Otherwise, set k = k + 1
and go to (3).

Figure 3.8. Subsumption Test Algorithm based on 2......

 

an ADT, each of whose objects behave in a way similar to objects of its supertypes.
A subclass is an implementation that is derived by inheritance from its superclass.
A subtype represents a behavioral relationship. When a subtype is derived from
some supertype, the object’s behavior with this subtype can be veriﬁed according
to the objects with its supertype instead of reverifying this subtype. In contrast, a
subclass relationship is a purely implementation relationship. In the C++ language,
the subclass relationship is implementation-speciﬁc and cannot represent the true
supertype-subtype relationship. The generality relationship in our system is similar

to the supertype-subtype relationship.

OIEa
scri't
Cons
and

Paem

com;

4.1

The 0

01 1911
Search
illlCab]
51...?

10 the

CHAPTER 4

Construction of Hierarchical

Component Library

This chapter presents an approach, based on formal methods, to the classiﬁcation and
organization of reusable software components. From a set of reusable components de-
scribed by formal speciﬁcations, a two-tiered hierarchy of software components is
constructed. The formal speciﬁcations represent software that has been implemented
and veriﬁed for correctness. The hierarchical organization of the software compo-
nent speciﬁcations provides a means for storing, browsing, and retrieving reusable

components that are amenable to automation.

4.1 Lower-Level Hierarchy

The objective of the construction process is to construct a hierarchical organization
of reusable components that will provide a fast means for browsing, retrieving, and
searching of software components exploiting the automated reasoning techniques ap-
plicable to formal speciﬁcations. The lower-level hierarchy provides a means for a
ﬁne-grained, precise determination of reuse, where logical reasoning can be applied

to the speciﬁcations.

31

tot

rt‘SC

32

4. 1 . 1 Determining Generality

Chang and Lee’s subsumption test algorithm [49] is used to decide the subsumption
relationship between clauses, that is, whether clause A subsumes clause B, denoted
by A 2d,“, B. We exploit the traditional resolution strategy [50] to compute the

resolvents of two clauses, say 01 and C; (Figure 4.1). The full resolution rule involves

 

Cl: L,K1,...,K,,
Cg: qL, M1, ..., Mm

resolvent: K1, ..., Kn, M1, ..., Mm.

Figure 4.1. Resolution Rule

 

an instantiation of the formulas by a substitution 0', which is a mapping of variables
to terms. Application of this substitution must result in the same atoms for both

resolution literals [51]. In Figure 4.2, if a is the most general substitution that makes

 

clause 1: L, K1,...,K,,
clause 2: -L’, M1, ...,Mm Lo = L’o

 

Figure 4.2. Full Resolution Rule

 

the atoms L and L’ equal, then it is the most general uniﬁer for the two expressions,

where the resolvent is obtained by instantiating the remaining variables of clauses 1

”Mb

of [WC

4.1.2

8%

“Wm

33

and 2. Atom L is said to be congruent to atom L', denoted by L 2:: L', when both
L and L' are in an equivalence class partition eq-class that may be deﬁned by the
user or the system (see Section 4.2.1 for further details). Following the approach of
the resolution rule, if a congruity relationship exists between L and L’, then L and
HL’ can be eliminated in order to obtain a c-resolvent, a resolvent with respect to the
congruity relationship. As a result, a modiﬁed resolution rule given in Figure 4.3 is

derived, where o is a substitution that maps variables to terms. Using the modiﬁed

 

Cl: L,1{1,...,1{n
Cg: "L’,M1,...,Mm L0 2 L’U

 

resolvent: K10, ..., Kno, M10, ..., Mmo.

Figure 4.3. Modiﬁed Resolution Rule

 

resolution rule, the subsumption test algorithm [49] is modiﬁed to ﬁnd the c-resolvent
of two clauses Cl and 02 rather than their resolvent. The modiﬁed subsumption test
(MST) algorithm can be applied to every pair of methods of the two components being
compared in order to determine the generality relationship between two components.
The MST algorithm between two sets of methods is shown in Figure 4.4, where
methodsA (methodsg) is the set containing the methods of the component CompA

(Comps). The cardinality of methodsA (methodsg) is m (n).

4.1.2 Building Lower-Level Hierarchy

Based on Algorithm 1, the generality relationship can be determined between any pair

of components in order to build the lower-level hierarchy. The straightforward ap-

pro
bet]
Hon
OIde
ﬁnd

to Cc

34

 

Algorithm 1 More_General-Component
Input: Two sets methodsA = {A1, A2, ..., Am} and methodsg = {81, B2, ..., Bu}.

Output: The generality relationship between components Camp, and Comps.

Procedure:
begin
ﬁnd «— true;
while methodsA 74 {} and ﬁnd = true do
select some A.- 6 methods A;
methodsA «— method-9A \ At;
setg «— methodsg;
ﬁnd «— false;
while sets # {} and ﬁnd = false do
select some B, 6 sets;
setg «— setg \ B);
if Ai amethod Bj
then ﬁnd «— true;
endwhile;
endwhile;
if ﬁnd = false
then return(“-‘(C'0mm geomp C 0771113) ”)i
2182 TCt‘llTﬂ(‘COTnPA acorn}: Campy ’7;
end.

Figure 4.4. Using MST to decide the generality relationship between components
CompA and Comps.

 

proach is to construct the lower-level hierarchy by performing a pair-wise comparison
between all components. The pairowise comparison algorithm is shown in Figure 4.5.
However, the transitivity property of the generality relationship can be exploited in
order to reduce the execution time of building the lower-level hierarchy. If A _:_J B
and B Q C then the relation A Q C is automatically established without having
to compare components A and C. A few deﬁnitions are given before presenting the

improved algorithm. For some set of lattices (SOL) III, the set of top nodes in \I!

35

 

Algorithm 2 Pairwise.Comparison
Input: A set of components SET = {C1, Cg, ..., Cu}.
Output: A hierarchy of components based on the generality relationship.

Procedure:
begin
while SET # {}

select some component C.- 6 SET;
SET 0—- SET \ 0;;
set «— SET;
while set 96 {}
select some C,- 6 set;
set «— set \ 0,;
if C,- Qm, C,-
/"r More-General.Component algorithm will be used to
compare C,- and C,- */
then make C,- a parent of C 1
else if C; 2mm, C;
then make C, a parent of C,-

endif
endif
endwhile;
endwhile;
end.

Figure 4.5. Building the lower-level hierarchy by pair-wise comparison.

 

36

is denoted by T op(\Il) and the set of bottom nodes by Bottom(\II). If node a has
no parent nodes in the SOL \II, then a E Top(\Il). Similarly, if a has no children
nodes in the SOL ‘11, then a E Bottom(\P). The internal nodes in \II are deﬁned
as I nternal(\II) = \II \ (Top(IIl) U Bottom(\II)), where ‘\’ represents set subtraction.
For some node a 6 \II, the set of parent nodes of a is denoted by parent(a) and
the set of children nodes by child(a). The set of the descendants of a, denoted by

descendant(a), is deﬁned as follows:
ﬂ 6 descendant(a) 4: ((ﬂ 6 child(a)) V (37 : 7 6 child(a) : ﬂ 6 descendant(7))

The set of the ancestors of a, denoted by ancestor(a), has a similar deﬁnition. A par-
allel algorithm to build the lower-level hierarchy based on recursive comparisons and
the generality relationship is given in Figure 4.6. A pictorial representation of an ex-
ample construction of the lower-level hierarchy by procedure Recursive_Comparison
is shown in Figure 4.7, where dashed lines represent the application of the procedure
Recursive.Comparison, solid lines represent the generality relationship, and the dot-
ted lines encapsulate SOLs. Initially, the example contains eight SOLs and each SOL
contains only one component. These eight SOLs are merged into one SOL after ap-
plying the two procedures Compare and Merge. Compare(\II,-, ‘11,) determines the
generality relationship between nodes in SOLs III,- and W,- by using a recursive ap-
proach. For example, if some node a is more general than some top node 19 of III,
then it is not necessary to compare a with the descendants of ,6. However, if some
top node ,3 is more general than a then the comparison between a and the descen-
dants of ,6 is required. The same reasoning can be applied to the comparison between
a and the bottom nodes of the SOL \II. The procedure M erge(\I’,~, \Ilj) “connects”
the newly generated generality relationship between SOLs ‘1',- and W,- to form a new

SOL. Recursive.Comparison can be implemented as a parallel algorithm since the

Al

Cu
Pr:

37

 

Algorithm 3 Recursive-Comparison

Input: A set {\Po,\P1,...,\II,,-1}, where W,- represents a set of lattices and assume
it = 2'”.
Initially, \I': = {C,-} where C,- is a component.

Output: \Ilo contains a hierarchy of components based on the generality relationship.

Procedure:
begin
for i := 0 to m-I do
(I ‘__ 2:".
do all III), where 0 S k S 2'” — 1 /* Parallel execution of all iterations */
if 1: mod 2"H = 0
then
Compare(\I'k, \Pk+d);
\Pk «— Merge(\llk, \I’Hd);
endif;
end.do.all;
endfor;
end.

Figure 4.6. Building lower-level hierarchy by recursive comparison.

 

comparisons between the SOLs are independent of each other. Only the nodes in Top
and Bottom sets are compared in the procedure Compare.

Applying algorithm Compare(SOLA,SOL3) to two SOLs SOLA and SOLB is
illustrated in Figure 4.8. For discussion purposes, attention is focused on the top
node E in SOLA and the bottom node F in SOLE. If F 2 E, then make node F
a parent of node E since all nodes in ancestor(F) U {F} must subsume the nodes
in descendant(E) U {E}. However, if E 2 F, then node E needs to be compared
with the nodes in ancestor(F) and node F needs to be compared with the nodes
in descendant(E) in order to obtain complete generality relationships. Using the

recursive method to build the lower-level hierarchy may reduce the computational

38

 

  

o ...a"’/.

K ',
~7“u-.--..-...unm
\
I, \
I \

 

Figure 4.7. Example of building hierarchy by recursive comparison.

 

time of construction since the comparisons of the internal nodes in the SOL can be

eliminated.

4.2 Higher-Level Hierarchy

After applying the MST, the software components may be grouped into disjoint clus-
ters in a set of graphs (ASG'). In order to form a connected hierarchy of software
components, a conventional clustering algorithm [52] is applied to the most general

components obtained from the MST, that is, the roots of trees and the top elements

 

 

Figure 4.8. Example of comparing two SOLs.

 

of the lattices in ASG.

Classiﬁcation by clustering techniques has been used in many areas of research,
including information retrieval and image processing [53]. Typically, the objective
.of clustering is to form a set of clusters such that the intercluster similarity is low,
and the intracluster similarity is high. Applying a clustering algorithm to the most
general components of the lower-level hierarchy leads to the generation of the higher-
level hierarchy of the component library. The similarity between two components X
and X ' , denoted by s(X, X’), is used as the basic criterion to determine clusters. In

general, the criterion used to evaluate similarity determines the shape of the resultant

clusters.

4.2.1 Measure of Similarity between Components

In this section, a simple evaluation method for computing similarity is given. The

similarity between a pair of components, X and Y, is denoted by s(X, Y). Similarity

'lhe (

Where

Th.
Order a
leprese
inequal
”SOCia:
9’60tgr

The equ
The

40

is symmetric, thus for any two components, X and Y:
s(X, Y) = s(Y, X).

In addition, similarity s is said to be normalized if 0 S s(X, Y) S 1. Each predicate
formula in the library expressed in DN F represents a software component and is
regarded as one of the input objects that are to be classiﬁed by the hierarchical
clustering algorithm.

If s(X, Y) represents the similarity between two software components X and Y,

then it is assumed that X and Y are of the following forms.

X = xIngv...me and

Y = yIVygv...Vy,,.
The disjuncts x,- and y,- are deﬁned in terms of conjuncts, that is,

x.- = p;,/\p;,/\...Ap.~u._, lgigm and

yt = q,-,/\q,-,A...Aq,-vi,1<ign,

where u; and v,- are the number of conjuncts within disjunct x,- and y,-, respectively.

The disjuncts of each object are ordered from left to right in a nondecreasing
order according to the number of conjuncts in each disjunct. Given that u,- and v,-
represent the number of conjuncts for disjuncts x,- and y,-, respectively, the following
inequalities are true: u,-_1 S u,- and v,-_1 S v,-, for all i. Moreover, each conjunct pg, is
associated with an equivalence class eq.class. For example, if p:, = greater(a, b) and
greater is in the equivalence class for comparison, then eq.class(p,-,,) = comparison.
The equivalence classes may be speciﬁed by the users or be system-deﬁned.

The number of equivalence classes in a software component library is assumed

from

Slf‘dCl

Where
The si
is cgc
Win.

0“ 53's

41

to be a known value, say T. Using the above deﬁnitions, a matrix me(1~+1) is
constructed for every component X. The matrix me(T+1) derived from component
X has m rows and T+l columns. X (i, j ) represents the entry in row i and column

j. Row i represents the i“ disjunct of X as follows, where there are u,- conjuncts in

disjunct x,-.

X(i,0) = u,-, 05ism—land

X(i,j) = l, x,- has 1 terms in eq_class j.

Similarly, for component Y containing v,- conjuncts in disjunct y,-, the corresponding

matrix is deﬁned by

Y(i,0) = v,-, OSiSn-land

Y(i,j) = l, y,- has 1 terms in eq.class j.

From the derived matrices me(1+1) and Ynx(T+l): the similarity matrix 51“,, is con-

structed. The following expression deﬁnes Sian'

for all i,j if X(i,0) = Y(j,0)
,1, 2 * min(X(i, t), Y( j, t))
ELKXUJ) + YUM)
else s'(i,j) = 0 (4.1)

 

then s'(i,j) =

where s’(i, j) is the similarity of the i“ disjunct of X and the j‘h disjunct of Y.
The similarity between two conjunctive expressions from two software components
is calculated according to the minimum number of common occurrences of a given
equivalence class. Since the results from the clustering process are purely based

on syntactic similarities, only the disjuncts with the same number of conjuncts are

{II

(I)

42

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0 l 2 3 4 5

0 l 2 3 4 5 l I 0 0 I 0 0

l 2 1 1 0 0 0 2 2 0 1 1 0 0

2 3 0 1 2 0 0 3 3 0 0 2 0 1

3 3 0 0 I I I 4 3 0 I 0 0 2
(a) Matrix for X (b) Matrix for Y

Figure 4.9. Matrices for components X and Y.

 

selected for comparison. The semantic similarities are used in the construction of
the lower-level hierarchy. Assume N is the number of nonzero entries in sin“. The

similarity between software components X and Y is calculated as follows:

:21 Z?=l $’(i,j)
N .

 

s(X, Y) = (4.2)

Here, s(X, Y) is a normalized similarity since 0 _<_ s( X, Y) S l. The following example

is presented for clariﬁcation purposes.

Example 4.1 Suppose the similarity of two components X and Y is to be computed,
where both speciﬁcations are in DNF. Let X = (01 A C2) V (C2 A C3 A C3) V (0;, A
C4 A Cs) and Y = (C3) V (Cg A Ca) V (0;; A C3 A C5) V (C; A Cs A C5), where C.- refers
to the term that corresponds to the i“ equivalence class. There are 5 equivalence
classes in this case, so T = 5. X has 3 disjuncts and Y has 4 disjuncts. The
corresponding matrices for X and Y are shown in Figure 4.9a and 4.96, where the
vertical axis represents the disjuncts in each component and the horizontal axis refers
to the equivalence classes. From Formula (4.1), the similarity matrix s’(X, Y) can
be computed yielding results shown in Figure 4.9c, where the vertical axis represents

the disjuncts in component X and the horizontal axis refers to the disjuncts in the Y

In,

U6:

{ind

43

 

 

 

 

 

1 2 3 4
1 0 2/4 0 0
2 o 0 4/6 2/6
3 o 0 4/6 3/6

 

 

 

 

 

Figure 4.10. Similarity matrix s’(X, Y)

 

2/4+4/6+2/6+4[6+2@ __ _1_ is
5 _

N

component. From Formula (4.2), the similarity s(X, Y) =

obtained. This value is used as input to the clustering algorithm when determining

which software components should be merged into one cluster.

A similarity value automatically determines the distance between two components.
That is, the distance between two components X and Y may be deﬁned as d(X, Y) =
1 - s(X, Y). The distance between two components within a cluster should be small
and distance between two clusters should be large. However, a distance (computed
from similarity) does not necessarily satisfy the triangle inequality, and therefore is

unable to form a metric space [14].

4.2.2 Hierarchical Clustering

Input to a clustering algorithm is a set of components and the similarity val-
ues between each pair of components. A ﬁnite set of components is denoted by
X = {x1,xg,...,x,,}. Output from the clustering algorithm is a partition I‘ =

{01,632, ...,GN}, where Gk, k = 1, ..., N is a subset of X such that
GlUGgU...UGN=X, Vl,lc,l;ék,GmGk=0, (4.3)

and GI, Gg, ..., GN are the clusters of I‘.

44

The relationship between the partition of clusters generated from the intermediate

stages of reﬁnement, denoted by I“, i = l, ..., K , is expressed as follows:
I“ = {G;,...,G;',,,}, W: {G{,..., ’51,}, i: 1,...K, i <j < K+ 1, (4.4)

where for all I, N; Z N, and N is the ﬁnal number of partitions. I" is a reﬁnement
of I“, i < j, that is, for any member subset G}; E I“, there exists G? 6 I‘5 such that
G}; Q G{ . Such groups formed by intermediate partitions yield a hierarchy of clusters.
A method for generating such a hierarchy is termed hierarchical clustering [52].

In general, hierarchical clustering algorithms are divided into two categories: di-
visive algorithms and agglomerative algorithms. A divisive algorithm starts with the
set X and divides it into a partition F" = {05, ...,G’ﬁk}, then each cluster G,“ is
subdivided to form a ﬁner partition PK '1, and so on. An agglomerative algorithm
initially regards each component as a single cluster: I‘1 = {{xl}, {x2}, ..., {and}. The
clusters are merged into a coarser partition I", and the merging process continues
until the trivial partition I‘K = {X} is obtained. Thus an agglomerative clustering
algorithm generates a sequence of partitions I‘1 —+ I" -+ ...I‘K that is ordered from
a ﬁner partition to a coarser one. This algorithm can be stopped at any partition
I",l S l S K, if the maximum value of computed similarities is below a speciﬁed
threshold or if the number of clusters generated for a partition is equal to a user-
speciﬁed or system-defined value.

In most agglomerative algorithms, only one pair of clusters is merged at a time.
Hence if I”i = {Gj,...,G’}vi} and I".+1 = {G§+1,...,G)G;1I}, then N,“ = N; - 1. That
is, N; = n — i + 1,i = 1,...,n and I‘1 = {{xl},{xg},...,{x,,}},I‘N = {X}. Figure
4.11 gives a pictorial representation of the reﬁnement process. Similarity between
clusters is used as the criterion for the selection of a pair of clusters in I“ that are

to be merged. A pair of clusters (GWG'Q) is selected to be merged if it has the

45

 

 

 

 

,. O...OO

6' O G“ 4 O".

 

6: on“: ON

r... O "MOO

/\ l
.. oo- - co

0| G: a GM, -l on,

.. @- --000

0i

 

 

 

 

 

 

 

 

 

 

 

 

an, .1 0N,-i on,

Figure 4.11. Reﬁnement of partitions in an agglomerative clustering algorithm

 

maximum value of similarity among all pairs of clusters. Let the current partition
be P = {G1,...,GN}. The similarity value between two clusters is the maximum
value of all similarities calculated between disjuncts from the respective components.

Formally, the sim relationship is expressed as

s(X, Y)).

sim(G,,,Gq) = max ( max

An agglomerative procedure is given in Figure 4.12. The similarities between the
new cluster and other clusters are computed as follows: if G, and G, are merged into

a new cluster G,, then

Vi, sim(G,.,G'.-) = min(sim(Gp,G,),sim(Gq, 0.)).

he

,8:

46

 

Algorithm 4 Agglomerative Clustering
Input: A set of disjoint lattices.
Output: A uniﬁed cluster.

Procedure:
1. Let each root of a tree or the top element of a lattice of the ASG be an
initial cluster consisting of the single element.

2. Find the pair of clusters that has the maximum value of similarity among
all pairs of clusters.

3. Merge the pair of clusters found in step 2 into a new cluster.

4. If there is only one cluster remaining, then stop. Otherwise, update simi-
larity values between clusters; go back to step 2.

Figure 4.12. Agglomerative Hierarchical Clustering Algorithm

 

4.2.3 Hierarchical Clustering Algorithm

The hierarchical clustering algorithm used is similar to Kruskal’s algorithm for ﬁnd—
ing a minimal spanning tree [54], which always chooses an edge with the least weight
in the construction of the spanning tree. In this case, weights are replaced by simi-
larity values for software components and the maximal weight rather than the least
weight is sought. After applying the algorithm, a tree-like hierarchical clustering is
obtained. Figure 4.13 contains the detailed description of the hierarchical clustering
algorithm where X is the set of predicate components, s(X,, X5) is the similarity be-
tween components X; and X j, and sim(Gk, G") is the similarity between clusters G].
and 0;. The algorithm begins by creating a cluster for each software component to
be classiﬁed, that is, the most general components found in the lower-level hierarchy,
and the ﬁrst partition contains all of the initial clusters. Next, a pairwise calculation

of similarity between the clusters is made. Based on the similarity values, two clus-

47

ters yielding the greatest value are selected to be merged. After the two clusters are
merged, the similarity values between clusters is updated, thus deﬁning the partition
for the next iteration of the clustering algorithm. The user may specify an upper
bound on the number of iterations (reﬁnements) or stop the clustering algorithm
‘ while viewing the clustering process. This ﬂexibility allows the user to incorporate
background experience in order to determine when further reﬁnements will fail to
yield substantial changes between partitions. The ﬁnal hierarchically organized li-
brary could be of the form given in Figure 4.14, where ﬁlled nodes, termed real nodes,
represent software components and unﬁlled nodes are newly generated nodes created
by the hierarchical clustering algorithm, called meta-nodes. A meta-node acts as a
container for the software components from it which it was derived. Dashed lines
represent relationships formed by the MST algorithm and the solid lines are formed

by the hierarchical clustering algorithm representing similarity relationships.

4.3 Implementation

In order to facilitate the user’s involvement in determining reuse based on formal
speciﬁcations, the construction of the software component hierarchy has been imple-
mented in the framework of a graphical browser. This section discusses the imple-
mentation framework and describes an example construction of a hierarchy from a

set of software component speciﬁcations.

4.3.1 Browsing Hierarchy

A prototype system for constructing the hierarchical library has been implemented
in the Quintus ProWindows language ;‘ a dialect of Prolog that supports the object-

oriented organization of graphical elements. There are several advantages to using

 

'A product of Quintus Computer Systems, Inc.

48

 

Algorithm 5 Hierarchical Agglomerative Algorithm
Input: The set X = {x1,x2,...,x,,} and the similarities s(x,,x,-), 1 S i,j S n.

Output: one or more clusters.

Procedure:
begin
N = n;
for i = 1,...,N do
G.“ = {1;}

endfor;
F] = {61,015 .... GN};
Limit = I;

for 1 Si,j g ;\’,i;éj do
sim(G.-,G,) = s(x,,x,-)
endfor; /* Initialization */

/* If there is more than one cluster then iterate, otherwise stop. */
while (N > Limit) do
N = N - l;
/* Select the pair of clusters to be merged */
find a pair of clusters G, and G, such that
sim(G,,, Gq) = maxa,,ajepn_,,,,-¢,- sim(G.-, Gj)
= mach.G,EFn-N.i¢j maxrereG,‘ 3(zi 31);
G, = G, U 0,;
Fn-N+l = (Fn-N - {Gpqu}) U {Gr}
/* Update the similarity values */
for all G.- 6 Ilka/+1, G.- # G“ do
calculate sim(G,, G.) = max,ea,,yeg,. s(x, y)
endfor;
Limit = query.user.for.number.of.clusters;
/* Query user for a limit on the number of generated clusters*/
endwhile;
return I‘MNH
end.

Figure 4.13. Hierarchical Clustering Algorithm

 

49

 

 

Figure 4.14. Two-tiered hierarchy formed by the subsumption test and the clustering
algorithms

 

Prolog, and speciﬁcally ProWindows, as the implementation language. First, Pro-
log’s declarative properties facilitate the handling of ﬁrst-order predicate logic spec-
iﬁcations and the application of automated reasoning capabilities. Second, Prolog’s
procedural properties facilitate the implementation of backtracking algorithms such
as those used in the search for reusable components. Third, the support for graphics
in ProWindows using Prolog predicates facilitates a homogeneous implementation of
a system for handling the construction, searching, and browsing of a software compo-
nent library. Speciﬁcally,ProWindows provides direct support for high-level features

of the user interface such as dialog boxes, scrolling menus of sorted items, and term

editing windows.

4.3.2 Implementation of the Construction of Hierarchy

Figures 4.15 and 4.16 show screen dumps of sample applications of the subsump-

tion test algorithm and the hierarchical clustering algorithm, respectively, to a

50

set of software components. Originally, there were ﬁfteen components in the li-
brary speciﬁed in the format described in Chapter 3. A larger example will be
described in Chapter 8. Selecting the Clustering option shown in the pop-up
menu in Figure 4.15 will execute the subsumption test algorithm, which causes
the four subwindows shown in Figure 4.15 to be displayed, where each window

contains one cluster. In the example, component DoubleQueue is a child of

re loose but!”

@u-WDCEc—mm c——3....Q....
@ﬁumsdon for!) Wbﬁfubdn) (Two-Tiered whereby) ( Um Search) (Hierarchical Search) ( Comput- Analog)

El

F'""'j["“"‘”° ﬂ alum ” um- “ L“... H and lLeJom H «Janus J

_ J

 

 

 

 

  

 

 

 

 

 

 

 

 

Figure 4.15. Sample application of subsumption test algorithm

 

.. A . ___.i .;;l

«1 [ON]

 

 

ll

51

 

louse lanes 38

GE) (15) @ CM") 0"" QM) OWE (5770 (WE
(Se—bum Test) (Watchful ClusbrlnD (Two-Tiered "£0th ( Unsv Search) (Hhmchlcal South) ( Compute Ailey)

ET

 

 

I“! um:

 

 

 

 

 

Figure 4.16. Sample application of clustering algorithm

 

the component Queue, where DoubleQueue and Queue represent the classes (or
types) DoubleQueue and Queue, respectively. Figure 4.17 gives the speciﬁcation of
Queue, which contains ﬁve methods: Queue, ~Queue, Queue: :appendAtEnd, and
Queue: :clear. The ﬁrst two methods Queue and ~Queue represent the construc-
tor and destructor of the component Queue, respectively. Queue: :appendAtEnd ap-

pends an element to Queue. Queue: :clear removes all objects that Queue con-

52

tains. Similarly, the speciﬁcation of DoubleQueue is given in Figure 4.18, which

 

component Queue
abstract type Queue;
public
void Queue(Queue It-queue) {
modiﬁes tqueue;
ensures (*queue)’ - NULL.Queue;
}
void ~Queue(Queue IInqueue) {
modiﬁes Iliqueue;
ensures trashedhqueue) ;
}
void Queue::appendAtEnd(Queue tqueue. Objectt element) {
requires ﬁfuIICqueue);
modiﬁes I«queue;
ensures 1ast(queue’) 3' element A
butLast(queue’) 8' butLast(queue) A
}

void Queue: :clear(Queue *queue) {
modiﬁes I“queue;
ensures isEmpty(*queue)

}

Figure 4.17. Speciﬁcation of Component Queue.

 

contains six methods: DoubleQueue, ~Doub1eQueue, DoubleQueue: :appendAtI-an,
DoubleQueue: :appendAtHead and DoubleQueue::c1ear. For data types Double-
Queue and Queue, (Queue: :appendAtEnd 2mg,“ DoubleQueue: :appendAtEnd), and
(Queue: :clear Qmethod DoubleQueue: :clear). Therefore, Queue is more general
than DoubleQueue as a whole, and Queue is displayed as a parent of DoubleQueue in
the browser graphically.

After determining the generality relationships between components, the clustering

l8

53

algorithm is applied to the four groups, displayed in Figure 4.15, in order to create a
connected hierarchy in the library. At this stage, only the roots of the trees from the
output of the MST algorithm are chosen as input to the clustering algorithm. In the
example, components Table and Sequence are merged and the hierarchical clustering
algorithm yields a meta-node metal for them. Figure 4.16 shows the results of the
clustering algorithm, where the components metal, meta2, and meta3 are represented
by newly~created meta-nodes. Finally, a two-tiered hierarchy of software components
is constructed, shown in Figure 4.16, that can be used in the retrieval process. Upon
completion of the construction process, the user may choose to rename meta-nodes

(e.g. metal and meta2) to more descriptive names.

4.4 Summary

A classiﬁcation scheme of software components expressed in Larch speciﬁcations has
been presented in this chapter. We have also described algorithms for implement-
ing this scheme. The classiﬁcation algorithms, implemented in Prolog, are able to
construct a two—tiered hierarchical library from formal speciﬁcations. Thus, the hi-
erarchy can help users store, browse, and search existing reusable components. The
two-tiered hierarchy of reusable components provides a framework for the following

reuse processes: retrieval and modiﬁcation.

54

 

component DoubleQueue
abstract type DoubleQueue;
pubﬁe
void DoubleQueue(Doub1eQueue tdeque) {
modiﬁes tdeque;
ensures (*deque)’ I NULLDoubleQueue;
}
void ~Doub1eQueue(Doub1eQueue lIndeque) {
modiﬁes Itdeque;
ensures trashed(tdeque);
}
void DoubleQueue: :appendAtEndwoubleQueue Itrdeque. Objectt element) {
requires ntull(deque);
modiﬁes tdeque;
ensures 1ast(deque’) " element A
butLast(deque’) '8 butLast(deque) A
}
void DoubleQueue: :appendAtHeadwoubleQueue Itideque, Objectl element) {
requires wrul1(d.que);
modiﬁes Itideque;
ensures first(deque’) 8- element A
butFirst(deque’) -- butFirst(deque) A
}
void DoubleQueue::clear(Doub1eQueue *deque) {
modiﬁes #deque;
ensures isEmpty(sdeque)

}

Figure 4.18. Speciﬁcation of Component DoubleQueue.

 

CHAPTER 5

Search and Retrieval of Reusable

Components

In the previous chapter, we proposed a model to classify the reusable components by
forming a two-tiered hierarchy of software components that are speciﬁed in terms of
the Larch speciﬁcation language. Based upon this framework, we address the issue
of identiﬁcation and retrieval of reusable components in this chapter. We present a
_hashing scheme to provide an initial indication of components for a query. A retrieval
algorithm is described in detail in Section 5.2. The output of the retrieval algorithm

can be used as input to the modiﬁcation process that examines and modiﬁes the

extracted components for reusability.

5.1 Hashing Scheme for Software Components

In this section, the method used to retrieve software components is described. A
hashing function HF: Component —+ R is deﬁned, which maps a component to a

unique real number used in the retrieval process to reduce the search space of reusable

components.

The following auxiliary deﬁnitions are necessary for the hashing function deﬁni-

55

56

tion. For a given component C that is speciﬁed by the predicate P, the closure of P

has the following deﬁnition.

Definition 5.1 For some predicate P, let closure(P) denote the closure of P, a set
of predicates and symbols. Let pred-fune be a predicate name or a function name.
closure(P) is deﬁned as follows:

0 P E closure(P).

e ifQ = pred_func(arg1.arg2,...,argn) and Q E closure(P), then argl 6
closure(P), dry: 6 closure(P),..., and argn E closure(P).

The level of a term T with respect to predicate P is deﬁned as follows:

Definition 5.2 Let pred_func be a predicate name or a function name. The level of
some term T upon predicate P is denoted as level(P, T) and

e If T é closure(P) then level(P,T) = 0.
e If T E closure(P) and T is a variable or a constant, then level(P, T) = I.

e If T e closure(P) and T is a predicate or a function and
T = pred-func(arg1,argg, ...,argn), then level(P, T) =
max{level(P,arg1),level(P,arg2),...,level(P,arg,,)} + 1.

For a predicate P, the set of variables X:, i > 0, contains the variables in
closure(P) with the i“ type, where it is assumed that all possible data types can
be referenced by an index and the indices range from 1 to the total number of data
types deﬁned. Let X = U00 X.- and E = closure(P) \ X (set subtraction). The arity
of the element f of E is denoted by a(f). Ifa,t 6 closure(P) and level(t) Z level(a),
then n(a, t) is used to represent the number of occurrences of'a in t.

A partial ordering on a set of terms is an irreﬂexive transitive binary relation. If
>- is a partial ordering, then the symbol 2': is used to denote a >- b or a = b. Now a
partial ordering >1: is chosen for 2, and a non-negative number, or weight, w( f) is
assigned to each element f 6 S. and a positive weight w.- is assigned to each variable

of type i subject to the following conditions:

57

e If a(f) = 0 then w(f) >5 w,- for all i.
e If a(f) Z 1 and w(f) = 0 then f >~g g for all elements 9.

For each such choice of weights and partial ordering, the Knuth Bendix Ordering
(KBO) [55] is deﬁned on the terms, which is denoted by >- (>g,w) or just >-. For

this ordering, a weight is assigned to each term as follows. For any term t, let

w(t) = Z Z n(x,t) *w; + Zn(g,t) *w(g).

i 36X. 962

Thus, for example, the weight of a variable of type i is w;, and the weight of a term
is the sum of the weights of all symbols appearing in it. Suppose some component
C is represented by a ﬁrst-order predicate P. In order to incorporate the structural
information of P into the computation of the weight of P, the levels of the terms
(Deﬁnition 5.2) in P are determined, where the value of the level is used to redeﬁne
the weight of the predicate P. For this case, a term can be a variable, a constant, a

predicate, or a function. Now the weight of the predicate P is deﬁned as follows:

leuel(P,P)
w(P) = Z z: n($,P)*wi+ Z Z n(g,P)*w(g)*a,-
i xEX¢ j=l leuel(P,g)=jAg€E

where 0:,- denotes the input parameter, which emphasizes how the level of the terms
in P inﬂuences the weight of P, that is, part of the structural information of P is
incorporated in the calculation of w(P).

Since the software components in the framework are expressed in ﬁrst-order logic,
which contains a relatively small set of different types of symbols, it is simple to
determine their weights. However, it is possible that two terms may have the same
weight, thereby allowing two components to be mapped to the same value. In order
to resolve this conﬂict, each component C is labeled with an index, ind(C) = (W, r)

containing a pair of real numbers, where W denotes the weight of the component and

58

r denotes the rank of this component with respect to the other components having

the same weight. The rank can be deterministically decided by the Knuth-Bendix

Ordering described in Figure 5.1.

Deﬁnition 5.3 For some component C with weight W, where no other component
has the same weight then ind(C) = (W, 1). If there exist 1: other components with
weight W and all of them are greater than C according to the [(80, then ind(C) =

(w,k+1).

Similarly, the partial ordering for an index is deﬁned as follows:

Deﬁnition 5.4 For any two components C and C’, ind(C) = (w,i) and ind(C’) =
(w’,i’). We say ind(C) >- ind(C’) if either w >- w’ or w = w’ and i < i’.

Finally, the hashing function HF is deﬁned as follows.

Definition 5.5 For some component C with index ind(C) = (w, i), the hashing func-
tion is deﬁned as H F (C) = w as at + i, where 43 is a parameter that can be set to a
value that is either pre-deﬁned by the system or set by the user.

5.2 Retrieval Algorithm

The previous sections explained how software components are indexed, that is,
hashed to real numbers by the hashing function HF. We also need to be able to
retrieVe the components that satisfy the requests if one exists and to help the user
select the most similar components via the two—tiered hierarchy. In order to retrieve
the candidates for a given query, we devise the algorithm in Figure 5.2. The hashing
scheme in steps 1 and 2 has been illustrated in Section 5.1. The following explains
how the last two stages are supported in our approach. According to the two-tiered

hierarchy of software components described in Chapter 4, there are two kinds of

59

 

If s and t are two terms, then s >- t if and only if n(x,s) :- n(x,t) for all variables x 6 X

and either
1. w(s) >- w(t), or
2. w(s) = w(t)
and either
(a) s = f"(x) and t = x for some n 21, or
(b) s = f(sl,...sa(n) and t = f(t1,...tam) and f >- g,
(>- represents well-founded ordering) or
(C) 8 = f(81,...Sau)) and t = fa], ...tau)) and 81 =t1,... ,Sk-1 =tk-1
and s], >- t), for some lc with 1 S k S a(f).

Figure 5.1. Knuth-Bendix Ordering

 

nodes in the hierarchy. The ﬁrst kind of nodes, called real nodes, are the original
components, located in the nodes of the lower-level hierarchy. The second kind of
nodes, called meta nodes, are those generated by the clustering algorithm and located
in the higher-level hierarchy. In step 2 of the algorithm in Figure 5.2, the components
retrieved are always located in the real nodes since only real nodes are mapped to
real numbers by the hashing function HF.

The subsumption relationship is the major structuring mechanism of the
lower-level hierarchy. For the real nodes, a parent node subsumes any one of its
child nodes, that is a parent node is more general than its child node. This property
can be exploited in the retrieval process. Whenever a component is retrieved, the
components located in its child nodes can also be suggested as reusable candidates
by traversing the component hierarchy. However, since we emphasize the retrieval of
large scale software components, we need to automate the retrieval process as much as
possible. Figure 5.3 describes the retrieval algorithm that applies the subsumption

test given a query speciﬁcation and a set of candidate components. The output of

60

 

Algorithm 6 Retrieval Process

1.
2.

The query speciﬁcation Q is hashed to some number H F (Q)

Given a ﬁxed value 6, a set of components 9 is returned to the user and
(VC : C69 :IHF(C)-HF(Q)I S 6),

that is, the diﬂerence between the hashing function value of each retrieved com-
ponent and the hashing function value for the current problem is less than some

value 6 .

Find the nearest common ancestor of all nodes in (I, that is, the retrieved nodes.

Depending on the type of ancestor node, one of the following steps should be
followed:

Case 1: If the nearest common ancestor is a meta-node, then compute the sim-
ilarity values for all speciﬁcations C: contained in the node with the current
speciﬁcation and {2’ = {Cf}.

Case 2: If the nearest common ancestor is a real node R, then {2’ = {R}.
For all C 60’, apply subsumption test algorithm to C and Q to ﬁnd the compo-

nents that subsume or are subsumed by Q. If there are such components, then
return them to the user otherwise go to step 6.

Using background information, the user may choose to browse through the hi-
erarchy of the software components in order to retrieve the components most

suitable for the new speciﬁcation.

Figure 5.2. The Process of Retrieving Software Components.

 

61

 

Algorithm 7 Retrieval by Subsumption Test

Input: A set of candidate components 9 and Query Speciﬁcation Q.
Output: Three sets of components Agemmz, Away,” and Anfnmce.
Procedure:
begin
Ageneral = Aspecific = Areference ‘— 0
for each real node C let mark(C) «— false; endfor;
while (I 75 {}
for each component C E Q
mark(C) «— true;
9 ‘- n\ {0};
switch (Q,C):
Case (Q Q C and C Q Q): return(C);
Case {Q 21 C and C 23 Q):
for each component C’ 6 parent(C)
if C’ is a real node and mark( C’) = false
then Q o— n U {C’}
else if C’ is a meta-node then [inference «— Anfﬂmc, U {C ’}
endfor;
Case (Q Q C):
Aspecifie h Aspecific U {C};
for each component C’ 6 parent( C)
.if mark(C’) = false then It *— 0 U {C’}
endfor;
Case (C Q Q):
Ageneral i" Ageneral U {C}:
for each component C’ 6 children( C)
if mark(C’) = false then It ._ Q U {C’}
endfor;
endswitch;
endfor;
endwhile;
return Ageneral, Aspecific; and Areferencei
end.

Figure 5.3. The Algorithm for Retrieval by Subsumption Test.

 

62

this algorithm is a set of updated candidate components that are more similar to the
query requirements. For two predicates A and B, A '__'_l 8 means that A subsumes B.
For some node A, in our two—tiered hierarchy, child(A) denotes the set of its child
nodes (components) and parent(A) denotes the set of the parent nodes of A. The
input to this algorithm is the query speciﬁcation Q and a set of candidate components
0 derived from hashing scheme H C . The output from this algorithm contains three
sets of components: Agmd, Amdﬁc, and Anfmna. Agnew; includes those com-
ponents that subsume Q and Amen-ﬁe includes those components that are subsumed
by Q. Anjﬂem, holds a group of meta nodes produced from the retrieval algorithm.
Therefore, in our approach the user may obtain two sets of components, Agmd and
AMIic, that have a subsumption relationship to the user’s query speciﬁcation. Oth-
erwise, the user may communicate interactively with the system’s browser to conduct
the browsing when A,,,,,,.; and Amdfgc return no reusable components. Typically,
the user starts from the nodes contained in the set Anhmm that contains a group
of meta nodes as references. Single meta node represents at least two real nodes.
Consequently, all meta nodes of Arden“, can be replaced by a number of real nodes
ranked by their similarity with the user’s query speciﬁcation. Ranking is achieved
by comparing analogous functionalities shared by the new and reusable components.

The computation of similarity is described in further detail in Section 4.2.1.

5.3 Implementation of Retrieval Process

Figure 5.4 contains an example application of the construction and retrieval processes.
A group of reusable components are classiﬁed to from a two-tiered hierarchy. The
lower-level hierarchy generated by the subsumption test algorithm represents the
generality relationships among the components, where the parent component is more

general then the child component. The elliptic nodes are nodes newly created by the

63

 

Issse “use! as ‘l

(Subsumption Test) thlclmurlna (Two-Tiered Hierarchy) ( Llnesr chh’ (Hierarchical Such) (Compute Mule!)

 

 

 

a
-(E

' eon-postmssLe-ou ' Hierarchical mes Isa.

 

 

 

 

Win-rebut sou-as as? H

 

     
  

Essa listened Commas
Hen Ceml Conan-nu
More Speciﬁc Commas

 

““80"“-n

“cm I
CW’I’UCB (.00....) -> ID...
wee [Ia-see. n-. glue-us]

 

.memm...“ l

 

 

 

 

 

 

 

"L CONDITION:

 

 

 

(was) i E masses» 1
7. c. sue
me

MILCOIOITION:

mullﬁwmlm

numnmusmmuasnmsnms
times: he.”

teem

smut“: been) -) boobs.
m messes. k Useless]

 

 

 

 

 

 

 

 

 

 

 

 

Figure 5.4. Sample application of construction and retrieval processes

 

64

clustering algorithm to generate the higher-level hierarchy. On the right hand side,
a retrieval option is selected from the main menu to execute the retrieval process. In
this example, the user wants to ﬁnd those components that are more general than
the query speciﬁcation adt_queue, which is partially displayed on the left window
of Figure 5.4. Below the search menu, a window displays the result obtained by the
retrieval process. The result, in this case, is a set of components that are more general

than the query speciﬁcation adt.queue.

CHAPTER 6

Modiﬁcation of Reusable

Components Based on Generality

6.1 Introduction

In an attempt to perform software reuse, it is often the case that a given reuse can-
didate closely matches the needs of a query speciﬁcation. A challenging problem is
to determine what modiﬁcations are necessary to an existing speciﬁcation in order to
make it satisfy the query speciﬁcation. Accordingly, the software implementation of
the existing speciﬁcation must have corresponding changes. Modiﬁcation differs from
transformation in that correctness with respect to the original speciﬁcation is not
necessarily preserved. What we want is for the resultant program to be correct with
respect to the transformed speciﬁcation. Correctness-preserving transformations and
speciﬁcation-changing modiﬁcations are thus complementary. The objective of pro-
gram modiﬁcation is to obtain a program that satisﬁes its input-output speciﬁcation
along with the speciﬁcation for a new program. Comparison of two speciﬁcations may
suggest a transformation of the given program that will bring it in line with the new
speciﬁcation. Even if the transformed program does not fulﬁll the goal, it may serve

as a basis for constructing the desired program.

65

66

One of our reuse processes, modiﬁcation, will be described in this chapter. This
process is based on our previous work in the construction and retrieval processes.
We will ﬁrst present the speciﬁcation of a C++ class in items of an existing speciﬁca-
tion language Larch. Section 6.2 illustrates how to deﬁne the the expanded weakest
precondition semantics of the C++ language. Section 6.4 discusses the process of mod-

ifying a component that is more general than than a query speciﬁcation. Section 6.6

summarizes this chapter.

6.2 Predicate Transformer wp

There are several approaches to describe the semantics of a given program. One
approach is to regard the ﬁnal state in which statement 5 terminates as a function of
the initial state in which S has been started [56, 57]. For a given postcondition R and
a statement , the weakest precondition wp(S, R) describes the set of states in which
statement S can begin execution and terminate with R true and the weakest liberal
precondition wlp(S, R) is the weakest precondition under which 5 is guaranteed to
establish the postcondition R if the computation terminates. Generally, wlp(S, R) is
concerned with the partial correctness of S and wp( S, R) is concerned with the total
correctness of S. Since wp is more useful in program design, in this chapter, we only
use wp for the purpose of transforming predicates.

The following properties are from Gries’ book [56] except that the “Law of the
Excluded Miracle” is omitted, since in this law wp(S, false) holds precisely in those
initial states for which no computation under control of S exists. The “Law” erected
a considerable barrier for the conception of those initial states in which starting S
would be simply “inappropriate” or “impossible”.

Law of Monotonicity:

 

67

if Q ——+ R then wp(S,Q) —+ wp(S, R)

For two predicates Q and R, if the relation Q -> R holds then the relation wp(S, Q) —>

wp(S, R) also holds.

Distributivity of Conjunction:

wp(S, Q) A wp(S, R) = wp(S, Q A R)

Given a conjunction of two wps for one statement S with the two postconditions Q
and R, the conjunctive predicate can be converted into the wp of S with respect to

QAR.

Distributivity of Disjunction:

wp(S, Q) V wp(S, R) -* wp(ﬁQ V R)

A disjunction of two wp’s of a statement S with respect to different postcondition, Q
and R, implies the wp of S with respect to the disjunction of the two postconditions.

The syntax of a programming language deﬁnes the set of all programs that are
representable in it. In this chapter, the semantics of a programming language has to
deﬁne the semantics of each representable program, that is, it has to deﬁne the predi-
cate transformers wp and wlp of S. The deﬁnition of the semantics of a programming
language can be simpliﬁed to a deﬁnition of the functions wp and wlp, which are
functions from a representable program to predicate transformers. The domain, the
program, being recursively deﬁned by the grammar, wlp and wp need to be deﬁned
recursively over the grammar. Similarly, only the wp is deﬁned in this chapter and

its semantics in Gries’ pseudo-language and C (C++) is summarized here.

 

68

6.2.1 Simple C++ Statements

The following is a list of the allowed simple statements in Borland C++ [58].

1. Expression statements: assignment and function call.
2. Alternative statements: if-else, switch, case, and default.
3. Iterative statements: while, do while, and for.

4. Others: break, continue, goto, and return.

A compound statement consists of a series of statements. A null statement per-
forms no operation and is the same as the skip statement. We will describe the wp
semantics of Gries’ primitive programming structures (statements), i.e. abort, skip,
assignment, alternative, iterative, and CMpound statements [56, 21]. For each
statement, we attempt to deﬁne the wp semantics of corresponding C (C++) state-

ments. The wp semantics of recursive functions is not described here because it is

tedious and difﬁcult to use [59].

6.2.2 The abort Statement

For a given postcondition R, the abort command has the wp:
wp(abort, R) 5 false.

The operational interpretation of abort is that for all initial states its execution fails
to terminate. One of the corresponding C (C++) statements of abort is an inﬁnite loop

of the form

:899;
uhile(x < 100){

x";

69
6.2.3 The skip Statement

The operational interpretation of skip is that its execution, which is guaranteed to
terminate, leaves the values of all variables unchanged. Its wp(S, R) is an identity

function as follows

wp(skip. R) E R.

6.2.4 The assignment (y := E) Statement

wp(“y := E”,R) E (y 2,, E) A domain(E) A 12%,,

where the predicate (y _>_,,,, E) is true when y is greater than E in the partial ordering
of symbols, and the predicate domain(E) is true when E is deﬁned in the domain
of the current state. If both of the predicates are true, then all the occurrences of
y are replaced with E in R. The statement y := E is known as “the assignment
statement”. Its operational interpretation is that its execution, which is guaranteed
to terminate, leaves the values of all variables, except y, unchanged, whereas the ﬁnal
value of y equals the initial value of E. In order to apply wp to C++ programming
language, the expression E can be generalized. For example, if S Ey++ then S can
be transformed into y := y + 1, hence E E y + 1. However, generalization sacriﬁces

the elegancy of predicate transformers.

6.2.5 The alternative (IF) Statement

The alternative statement is traditionally written as a list of guarded commands
Surrounded by the parenthesis pair “if-ﬁ”. A fat bar “0” separates the guarded

commands in the list. A typical alternative statement is of the form:

ZfBl '451
082%52

Cl 8,, —-) 5,,
fi.

With the abbreviation BB given by

BB

(3i i: B.)

where ‘::’ indicates the range of i is not needed for the discussion. The semantics of

the alternative statement 5 satisﬁes
wp(S, R) 5 BB A (Vi :: B, —. wp(S,-,R)),

It is straightforward to convert/ transform an alternative statement in C++ into the

above general form. The if-else statement of the form

if (81)
81;

else if (B2)
82;

else 53;

is converted to

1TB! "" SI
Cl (“Bl A 82) -> 52
U (“181 A ﬁBg) —* S3

fi.

In C++, the test condition is considered true if it is nonzero and false otherwise.
This means that even negative values are considered true but, in Gries’ primitive
programming language (GPPG), the test condition must be a Boolean expression.
For the purpose of brevity, we omit the information of type checking when we deﬁne

the wp semantics. For the switch statement of the form

switchCExpr) {

case BI:

81;

break;
case 82:

S2;

break;
case B3:

33;

is converted to

if (Expr 8 B1) —-> Si;
O (Expr 8 82 —> S2;
Cl (Expr =- 83) -* 83;
fi
This type of statement is designed to facilitate the handling of multivalued branch-
ing. The expression Expr in C++ must be reducible or convertible to type int. The
constants B1, B2, and so on, must all be integers or must be unambiguously con-
vertible to integers. When a break expression is encountered in processing a case
.or default statement, the switch block is exited. In GPPG, if any of the guarded
command statements are executed successfully then the alternative structure is said
to have executed properly. The wp semantics of if-ﬁ in GPPG can not describe the
default statement in C++. Let us consider another switch statement of the form
suitch<Expr) {
case Bl:
Si;
break;
case B2:
52;
default:

33;
break;

which is the same as

 

72

svitch(Expr) {
case Bl:
SI;
break;
case B2:
82;
case DEF: \\ if (Expr !- Bi /\ Expr !- B2)
33; \\ then Expr = DEF
break;
}

Its corresponding GPPG structure is

if (Expr - Bl) -> Si;
[3 (Expr 8 B2) —» S2;

[2] (Expr = DEF) —-+ 83;
fi

6.2.6 The iterative (D0) Statement

The iterative command is of the form

dOBl—tsl
DBz-tSz

Cl 8,, —+ 5,,
0d

The wp of the iterative command with respect to a postcondition R is implied by the

predicate
wp(do-od, R) +— (ElP :: P A HBB —+ R A(P A E.- -v wp(S,-,P)))

where P is an invariant of the do-od command and BB is the disjunction of the

guards. The while statement is one of the major constructs for iteration in C++. The

general format is

uhile(condition)
statement;

 

73

which can be converted to the GPPG iteration statement of the form

do condition —> statement
0d.

The most common iteration construct is the for statement. Here is the general form:

for(init,statement; condition; control_statement)
loop_statenent;

The for statement can be converted to the iterative GPPG statement of the form

do first.iteration —> init.statement
0 condition —+ loop_statement; controljtatement

0d

where the predicate first .iterat ion is true when the program is in the ﬁrst iteration

of the 100p.

6.2.7 The compound (So;Sl) Statement

The expression So; 31 indicates that the statement SI is executed after So. The wp

of So is dependent on the wp of SI with respect to R. The wp semantics is

wp(So; 51, R) E wp(So, wp(Su R))

6.3 Modifying a More General Component

The modiﬁcation process is part of our reuse system that applies formal methods to
the reuse of existing software. In this section, the problem deﬁnition, algorithm, and
example of program modiﬁcation based on speciﬁcation changes are presented.
After the retrieval process, if an exact-match component has been found then we
return it directly to the users for reuse. However, it is unlikely that we can retrieve the

components that are exactly the same as the input speciﬁcation every time. A reuse

 

74

system supporting ones this approach will provide a minimal amount of assistance for
reusing existing software components, since generated speciﬁcation components are
limited to at most combinations of existing software components. In order to overcome
this weakness, a mechanism for modifying a retrieved component to satisfy the query
speciﬁcation is needed. In this section, we present the process of transforming some
retrieved component that is more general than the query speciﬁcation with respect
to the Q,,.,,, relationship (See Chapter 4 for details regarding this relationship).

The modiﬁcation of software components can be accomplished by direct editing
of the source code. However, editing source code by intuition may invalidate the
correctness of the original components and force the developer or user to work at a
low level of abstraction. The effort of veriﬁcation and modiﬁcation of source code
components offsets a signiﬁcant amount of the effort in component reuse.

Our approach is to modify the reusable component at the speciﬁcation level instead
of the code level. Once the necessary changes the old speciﬁcation needed to satisfy
the query speciﬁcation have been determined, the information needed to modify the
speciﬁcation is applied to some program synthesizer (or programmers) to generate
the required program. The degree of modiﬁcation can be achieved through the use of
automated reasoning techniques since our components are represented by ﬁrst order
logic. Using our approach reduces the drawbacks noted above for modiﬁcation via
direct source editing. Use of automated reasoning and a program synthesizer preserve
the correctness of software components for all possible query speciﬁcations. Thus the
user will be less concerned about the validation of a reused component. Also, given
the behavior of what a software component does, it is possible to use and modify
a software component without having to understand the low level implementation
details in the component. Since fully automated modiﬁcation is not possible now,
Semi-automatic modiﬁcation is an alternative to expedite the whole process.

A query speciﬁcation is a set of requirements that have a form similar to the

75

component speciﬁcations. We deﬁne that some existing speciﬁcation Old_Spec is

reusable for the query speciﬁcation Query.Spec if they satisfy the following conditions:
0 0ld.Spec.pre —> Query.Spec.pre
s Query.Speepost —* 0ld-Spec.post

In terms of logical reasoning, the above two conditions can be regarded as that the
preconditions of Query.Spec are a superset of Old-Spec preconditions and the post-
conditions of Query.Spec are a subset of Old-Spec postconditions.

The problem of modifying an implemented module based on the speciﬁcation
changes is given in Figure 6.1. This diagram illustrates how the generality relationship
can be incorporated into the modiﬁcation process. The refinement relationship deﬁnes
the relationship between the speciﬁcation and implementation modules. Therefore,
this diagram can be applied to any level of program abstraction only if the generality
and reﬁnement relationships hold. What we want to ﬁnd is a mapping that transforms

the old implementation module to a new implementation module that satisﬁes the

query speciﬁcation.

 

Recognition of Speciﬁcation Changes

 

 

Old_Spec _ : Query_Spec
t t
reﬁnement E refinement E
' Program Modiﬁcation '
Old_Progrsm : QueryJ’rogrsm

Figure 6.1. Modifying Implementation Based on Speciﬁcation Changes.

 

76

6.4 Modiﬁcation Process

Three sets of candidate components are output from the retrieval process for some
query speciﬁcation [42] and they are Agencmz, Auna'fic, and Anfmnce. The set Agnew;
(Amen-ﬁg) contains the candidate components that are more general (specific) than
the query speciﬁcation. The set Anhmm is used as a reference set for browsing the
component hierarchy so we regard it as a reusable set. Since the implementation of
a more speciﬁc component satisﬁes the query speciﬁcation, it can be reused directly.
Only the generality relationship is used to facilitate the modiﬁcation in Figure 6.1.
More helpful relationships will be added to our modiﬁcation process. for example,
modifying the analogous components [60].

A process for modifying a more general component is presented in Figure 6.2.
From the deﬁnition of the generality relationship, component A is more general than
component B, i.e., A 2cm, B, only if all methods of A are more general than the
subset of the methods in B. Therefore, some methods in B may have no relationship
with any method of A, even if component A is more general than component B.
Hence, the modiﬁcation process shown in Figure 6.2 can only give a partial imple-
mentation of a query speciﬁcation. For the method Mold in an old component that is
more general than some method in a query component (speciﬁcation), we attempt to
ﬁnd the weakest precondition of its implementation with respect to the postcondition
of the subsumed query method M,,,,,,,. The method for ﬁnding the wp for a C++
language construct is illustrated in Section 6.2. Then construct a conjunctive expres-
sion from the found weakest precondition with the precondition of Mold that is a new
precondition for the old implementation. Finally, input the new precondition, the old
implementation and the new query postcondition to some program synthesizer, either
a semi-automated program synthesis system or a programmer, in order to obtain a

new implementation satisfying the query speciﬁcation.

 

Algorithm 8 Modifying a More General Component

Input: Two components SPECOM, SPECqucry, where the relationship
(SPECOM 2m, SPEC’quﬂy) holds.

Output: Partial implementation of SPECqucry.

begin
for every method Maid of S PECOM
for every method MW", of S PECq,my
if (Mold Qmethod Mquery)
then
Let PROGOM be the implementation of Mom.
Find WP E wp(PROGou,Mquc,y.post).
Let NEW.pre 4— Moldpre A WP.
Let PROunm, «— synthesizer(NEWpre,PR0G°14,Mq,,,,,,.post).
Assign PROun", as the implementation of M,,,,,,.
endif
endfor;
endfor;
end.

Figure 6.2. The Process of Modifying More a more general method.

 

Program synthesis will not be discussed here since it involves some techniques that
are beyond the scope of this report but one example will be shown shortly. In the
modiﬁcation process, the relationships among components and methods have already
been established in the retrieval process so that the complexity of the modiﬁcation
process only arises from the calculation of weakest preconditions and the process of
program synthesis. The former uses polynomial time in terms of the size of the query
speciﬁcation plus the length of the old implementation, i.e., source code. It is difﬁcult
to measure the complexity of program synthesis, however, we believe the complexity
is highly reduced since the needed changes at speciﬁcation level are determined before

the source code is edited.

78
6.5 Modiﬁcation Example

An example of the modiﬁcation problem shown in Figure 6.1 is given in this section.
An old component, the speciﬁcation of List class is given in Figure 6.3. Object in the
interface represents any kind of mathematical numbers, e.g., real number, characters,
strings, and so on. This component deﬁnes ﬁve methods and all of them will be
implemented in the public segment of a C++ class. The methods List and ~List are
the constructors and destructors of class List, respectively. By convention, an object
of a class is initiated by the constructor before any other use. A client of an object
should call the destructor when it knows that the object will never be referenced
again. The clause ensures trashed(*s) says that upon return from the destructor
nothing can be assumed about the storage pointed to by s in the precondition state.
A good implementation of a destructor will free storage that is no longer needed,
although the speciﬁcation does not require it. The method List: :insert adds an
Object element to the tail of a List. The method List: :detach deletes an Object
element from List. The method List: :isA reports the user that this object belongs
to List class. Lists that are not aliased to any object visible in the precondition.
Thus the Lists that they return can be modiﬁed without affecting the values of other
list. One way of implementing this is to allocate new storage. The operators U and
n are assumed to be deﬁned in LSL trait Set.

C++ currently has no standard classes or hierarchies other than those used with
streams. C++ compiler vendors used the NIH classes [61] to test their own compiler.
The Borland company decided to include a variation of the NIH classes with Borland
C++, which they called the Borland C++ container class library. The library contains
classes that are organized into three logical groups: simple classes, container classes,
and iterator classes. Simple classes are classes that cannot be iterated upon such as

class String. In contrast, the container classes can contain a whole series of other

79

 

component List
abstract type List;
public
void List(List *list) {
modiﬁes I«list;
ensures (*list)’ = NulLList;
}
void ~List(List *list) {
modiﬁes *list;
ensures trashed(*list);
}
void List: :add(List IIllist, Objects e) {
modiﬁes l“list;
ensures e1ement(list’,tail(list’)) 8= e A
1ength(1ist’) 88 1ength(list) + 1;
}

bool List::detach(List *list, Objecta e) {
requires e 6 (*list);
modiﬁes l“list;
ensures in(list’,e) =8 FALSE;
}
string List::isA() {
ensures result 8 ”list class";

}

Figure 6.3. The LCL Speciﬁcation of List Class.

 

80

objects, which in turn can be containers themselves. Examples of containers are
classes of Collection, Stack, Bag, and Set.

The container class library has several classes that are capable of storing multiple
objects. In order to handle the objects inside these containers, we need a way to
iterate through them. Borland C++ chose to do this with special iterator classes,
which allow us to iterate through the objects in a container. The implementation of
List class is given in Appendix E.1 (except for the destructor). List inherits some
operations from the Container class.

A query speciﬁcation, the speciﬁcation of Array class, given in Figure 6.4. This
component deﬁnes six methods: Array, ~Array, Array: :addEnd, Array: :addAt,
Array: :clear, and *Array: :isA. The method Array: :addAt has no corresponding
method speciﬁed in the speciﬁcation of List class. Similarly, the operations ‘6’ and
‘delete’ are assumed to be deﬁned in the LSL traits library. Let us consider the two
classes List and Array. From the deﬁnition of generality, List 2cm, Array since
the following relationships hold:

List 2mg.“ Array
~List QM,“ ~Array
List: :add 2mm“ Array: :addEnd

List: :detach 2mg,“ Array: :clear
List: :isA 2mg,“ Array: :isA

For the purpose of conciseness, we only consider the relationship between two
methods - List: :add and Array: :addEnd. Following the procedure of modiﬁca-
tion process for more general component, we ﬁrst ﬁnd the weakest precondition
of the statements in the method List: :add with respect to the postcondition of

Array: :addEnd. The statements in the method List: :add are

81

 

component Array
abstract type Array;
public
void Array(Array tarray) {
modiﬁes 8array;
ensures (It-array) ’ 8 NULLJirray;
}
void ~Array(Array *array) {
modiﬁes *array;
ensures trashed(*array);
}
void Array::addEnd(Array 8array, Objectt element) {
modiﬁes 8array;
ensures array’(maxIndex(array’)) 88 element A
size(array’) 88 size(array) + 1 A
naxIndex(array’) 88 narIndexCarray) + 1;
}
void Array::addAt(Array 8array, Objects element, int atIndex) {
modiﬁes I“array;
ensures array’(atIndex) 88 element A
size(array’) 88 size(array) + 1;
}

bool Array::clear(Array *array, int Index) {
modiﬁes I«array;
ensures array’(Inder) 88 NULL;
}
string Array::isA() {
ensures result 8 "array class";

}

Figure 6.4. The LCL Speciﬁcation of Array Class.

 

82

void List::add( Objecta neuElement )

{
ListElenent 8neeElenent 8
nee ListElenent( anevElenent );
newElenent->next 8 head;
head 8 newElenent;
itensInContainer++;
}
The corresponding GPPG structure of the body statements is
List-Prog:
{

nevElenent->next :8 head;

head :8 newElenent;

1ength(1ist) :8 1ength(1ist) + 1;
}

The postcondition of List: :add in Figure 6.3 is

List_Post:

{
element(list’,head(list’)) 88 newElement /\

1ength(1ist’) 88 1ength(1ist) + 1
l‘
The postcondition of Array: :addEnd in Figure 6.4 is

Array_Post:

{

elenent(array’ ,naxIndex(array’)) 88 newElenent /\
size(array’) 88 size(array) + 1 /\
naxIndex(array’) 88 naxIndex(array) + 1

1*
The relationship

(List :: add) 2mm“ (Array :: addEnd)

holds because the following relationships hold:

list 3“,", array
head(1ist) 2m... narIndex(array)
length 2“,,” size

The relationship (list 2m", array) refers to the generality relationship deﬁned

in the LSL level. The second and the third relationships hold since the corresponding

 

83

terms are deﬁned in the same equivalence classes, for example, length and size are
both in the class referring to the cardinality of the objects in some ADT. Replacing

the terms in List.Prog by the more speciﬁc terms, we have

Array-Prog,1:
{

nevElenent->next :8 arrayEmaxIndex(array)];
array[naxIndex(array)] :8 nevElement;
size(array) :8 size(array) + 1;

}

The ﬁrst statement is unexecutable since the assignment command is applied to two
objects with different types, so we discard this statement momentarily and have the

following candidate program for Array: :addEnd:

Array_Prog-2:
{

array [maxIndex(array)] :8 newElement ;
size(array) :8 size(array) + 1;
}
Applying predicate transformer wp to Array_Prog_2 with respect to ArrayJ’ost, we

have

wp(Array.Prog.2, Array_Post)
= (newElement == newElement) A
(size(array) + 1 == size(array) + I) A
(maxIndex(array’) == maxIndex(array) + 1)
= TRUE A TRUE A (maxIndex(array’) == maxIndex(array) + 1)

= (maxIndex(array) == maxIndex(array) + I )

Since the precondition of List: :add it TRUE, the NEW. pre of the modiﬁed pro-

gram will be

maxIndex(array’) == maxIndex(array) + 1.

Based on the information of the above weakest precondition, we know that the value of

maxIndex of some array needs to be incremented by one ﬁrst before Array.Prog_2

84

executes to compute the correct result. Therefore, we have the following modiﬁed

GPPG statements that satisfy Array_Post:

Array_Prog_3:

{
maxIndex(array) 8 maxIndex(array) + 1;
array [narIndex (array)] : 8 newElement ;
size(array) :8 size(array) + 1;

}

And we convert Arrayfrogrj into C++ program and have the implementation for

ArrayzaddEnd:

void Array: :addEnd( Obj ectt newElement )

‘I
while( theArray[ whereToAdd ] !8 ZERO as
whereToAdd <8 upperbound )
{
whereToAdd++;
}
if( whereToAdd > upperbound )
{
reallocate( whereToAdd - lowerbound + 1 );
}
theArray[ vhereToAdd - loverbound ] 8 tnevElenent;
whereToAdd++;
itemsInContainer++;
}

Other parts of the implementation of Array class are illustrated in Appendix 13.2.
Clearly, modifying the source code of a reusable class such as List is not fully au-
tomated yet. However, the modiﬁcation process can be greatly facilitated given the

information from the speciﬁcation changes as described in the above example.

6.6 Summary

This chapter showed how C++ statements can be generalized to GPPG and deﬁned the

wp semantics for simple GPPG constructs. More deﬁnitions will be required for the wp

85

semantics like function and recursion in the future. We also described the modiﬁca-
tion process as applied to modify the source code whose speciﬁcation is more general
than the query speciﬁcation. A general issue of modifying a reusable program-com-
ponent from the speciﬁcation changes is the incompatibility of subclass deﬁnition in
the application language and the generality deﬁnition in the speciﬁcation level. The
subclass deﬁnitions in C++ are so implementation speciﬁc such that they cannot be
used in the speciﬁcation level. In the next chapter, we will generalize our modiﬁca-
tion process to other types of speciﬁcation revision problems, i.e., using analogy to

determine program modiﬁcation [60].

 

CHAPTER 7

Modiﬁcation of Reusable

Components Based on Analogy

This chapter presents an approach, based on formal methods, to the modiﬁcation
of reusable software components. Phom the framework of a two-tiered hierarchy of
reusable software components, the candidate components that are analogous to the
query speciﬁcation are retrieved from the hierarchy. An analogous retrieved compo-
nent is compared to the query speciﬁcation to determine what changes need to be

applied to the corresponding program component in order to make it satisfy the query

speciﬁcation.

7 .1 Introduction

Analogy is often used in everyday situations to assist in decision making. The main
objective of analogy is to make use of past experiences to solve similar problems.
Analogical reasoning has long been recognized as an important tool to overcome
the search complexity of ﬁnding solutions to novel problems or inducing generalized
knowledge from experience. Analogy presents a basic and challenging question: when

are two speciﬁcations (problem representations), for some purpose, alike? [62]. This

86

87

section outlines the scope of analogical reasoning techniques that we are using to
enhance our software reuse system [43, 42].

We regard the modiﬁcation process as a problem solving process, where-Figure 7.1
contains a framework for modifying components based on the analogy of formal spec-
iﬁcations and is similar to the problem of modifying an implemented program pre-
sented in Chapter 6. The problems in this process are the speciﬁcations that represent
reusable software components and the solutions become the executable implementa-
tions of the corresponding speciﬁcations. The objective of software development in
our context is to “solve” speciﬁcation Query.Spec which is referred to as a query
speciﬁcation. Old.Spec is referred to as a candidate speciﬁcation whose implementa-
tion 0ld.Program is known. A match that is found between the two speciﬁcations
represents the similarity of the two speciﬁcations. Based on this match, the analogy is
used to guide the modiﬁcation process, i.e., the software developer. Figure 7.1 shows
how the match can be incorporated into the modiﬁcation process. The reﬁnement
relationship deﬁnes the relationship between the speciﬁcation and implementation
modules.

If a “good” analogical match between the candidate and query speciﬁcations
can be found, then the effort required to develop the appropriate implementation,
Query.Program, will be reduced. We call this approach to modify existing program
based on an analogical match between two speciﬁcations, the Analogical Reuse Mod-
iﬁcation Process (ARMP). The problem of ﬁnding promising candidate speciﬁcations
for some query speciﬁcation from large knowledge bases of known and implemented
speciﬁcations is referred to as the base ﬁltering problem [63]. In Chapter 5, a retrieval
scheme based on the similarities among reusable components ﬁnds a set of candi-
date speciﬁcations that are similar to the query speciﬁcation. Our retrieval process
augments the ARMP model in the form of a pre-processing phase/ stage.

The idea that programs should be constructed by a series of transformations has

88

 

 

Find Analogical Matches
O|d_Spec : : Query_Spec
. e
reﬁnement E reﬁnement :
' Program Modiﬁcation ' ’
O|d_Progrsm : QueryJ’rogrsm

 

Figure 7.1. Analogical Reuse Modiﬁcation Process.

 

been widely promoted. Modiﬁcation is different from traditional program transfor-
mation because a program transformation is typically correctness preserving with
respect to the original speciﬁcation, but the program modiﬁcation approach needs a
program that satisﬁes its input-output speciﬁcation along with the speciﬁcation for a

new program. Program modiﬁcation process can be divided into the following phases:

0 Match: discover an analogy between two speciﬁcations.

Veriﬁcation: check the validity of the proposed modiﬁcation.

Replacement: apply the proposed modiﬁcation to the program.

Recoding: rewrite any unexecutable statements.

Synthesis: synthesize new segments.

Optimization: make the new program optimized, e.g., more efficient.

We emphasize the matching process in the ARMP model, i.e., the ﬁrst phase
of the modiﬁcation process because we try to ﬁnd an effective way to reduce the
effort of modiﬁcation by providing a set of “useful” matches to the programmers.
Dershowitz [64] presents an appealing approach .to program construction by modiﬁ-
cation. His method reﬂects the observation that programmers only devoted a limited
amount of time and effort to create code for a given speciﬁcation from scratch. Pro-

grammers often apply their knowledge about earlier programs to the development of

89

similar problems. Our work focuses on augmenting Dershowitz’s methods in order to
make it amenable to automatic applications.

In- attempt to analyze and design programs to perform the analogical matching
process. We found it useful to consider the phenomenon of analogical reasoning as
a whole. Both the brief accounts of analogy in human reasoning and the difference

between analogy and generalization are described in Appendix A.

7 .2 Analogical Matching

An analogical match is deﬁned to be a group of pairings between symbols in
terms of candidate and query speciﬁcations. It consists of the following form:
(symboll, symbolg,position1,positiong) where symbol; is located at positionl of term
term“ and similarly for symbolg, positiong, and termg. The two symbols in any
associated pair in their corresponding positions are called the match. For instance,
for the two terms max(a,array(b)) and min(queue(c),a). An analogical matching

process may generate the following match:
( (max! min) i], ll), (““3 a: ll], [2])? (array, queue, [2]) [1]), (bici[291]3[1,1]))'

The above example exhibits a bijective mapping between terms max(a,array(b))
and min(queue(c),a). However, as far as the ﬂexibility of a match is concerned,
some required features of match are needed to enhance the power of the matcher.
’ For example, (1) predicates, function, and constant symbols may be matched with
different predicate, function, and constant symbols, respectively; (2) the arguments
for predicates and functions that are matched may be permuted by the matching
process. Since the argument order of functions and predicates is often arbitrary, it is
obviously unreasonable to insist that matches preserve argument order; (3) matches

may contain inconsistencies where a symbol on one side is matched with two distinct

90

symbols on the other; (4) ﬁnally, symbols and subterms may be left unmatched in an
analogy.

According to Hesse’s theory [65], analogy is most useful when the domain of
query speciﬁcation is not well understood. Applying his theory to the software reuse
problem, if the relationships between the query and candidate speciﬁcations are not
well known, e.g. no generality relationship [43], then we apply the analogical matching
process because we are uncertain as to whether we can reuse the old programs to
satisfy the new speciﬁcation. Furthermore, it seems that the nature of analogy is
empirical, that is, only after analogy has been applied to a query speciﬁcation can
we determine whether a speciﬁc match is good or bad. There is no formal theory or
rule that rigorously describes the process for generating a reliable analogical match.
Therefore, most analogical matching algorithms use heuristics to direct searches for
good analogical matches. A given heuristic captures system-deﬁned criteria as to what
constitutes a reasonable analogy. It is believed that the notion of analogical heuristics
is a useful tool in building analogical reasoning systems. A top-down matching process

is described in this chapter and a bottom-up approach can be found in Appendix B.

7 .3 Heuristics for the Matching Process

Since the search for analogical matches can potentially be combinatorially explosive,

using an exhaustive search mechanism is certainly not feasible. Consider the following

two expressions from propositional logic:
{aV(bAc) = (aVb)A(aVc), aA(ch) = (aAb)V(aAc) }.

A matching is an association between the two expressions; i.e., a subset of the Carte-
sian product of the sets of symbol occurrences in the expressions. In this example, each

expression contains 13 symbols, so the Cartesian product contains 13 x 13 = 169

l.—

 

91

symbols, and hence has 2169 subsets. Obviously, some heuristics are deﬁnitely needed
to prune the search space.

-- When aheuristic is used in analogy systems, issue is to determine-what. kindef
information should the system have to enhance the applicability of the heuristics,
that is, what contextual knowledge should be included in the heuristics. A powerful
automated analogy system should be able to operate without much contextual knowl-
edge. However, if the context is very important, a “pure” syntactic analogy system
may fail on some intuitively straightforward examples. Therefore, an ideal analogy
system should allow some contextual knowledge to be supplied interactively by the
users or domain experts.

Identical associations are often believed to make good analogues, similarly, the
matchings containing high proportions of identical associations make good analogies.
We call this approach the identical symbols heuristic. While it is obvious that most
interesting analogies involve a signiﬁcant proportion of non-identical associations.
However, the identical symbols heuristic is valuable in ﬁnding similar speciﬁcations
(problems). This heuristic has already been incorporated in the construction and
retrieval processes in our system. In the analogical matching process, we also use this
heuristic since the candidates to be matched are retrieved from a hierarchy by the
retrieval process (See Chapter 5).

Another promising analogical approach is to consider matchings that respect the
structure of the terms. We call this approach a homomorphism heuristic. A homomor-
phism is a mapping that obeys the structure of the entities to which it relates. Most
analogy systems use some form of structural mapping. For example, Gentner [63]
introduces the notion of systematicity of a structural mapping; this property refers to
the extent to which matched objects are mutually constrained by identical relations.

For some heuristic criteria preferring matchings between items (predicates, func-

tion symbols, or atoms) of the same or similar semantic type according to an equiv-

92

alence class partition of symbols, we call this approach the semantic type heuristic,
which requires some kind of type hierarchy or other means to make a similarity judge-
ment. In Chapter 3, we used the Larch LSL trait hierarchy as the semantic type hier-
archy to determine the generality relations between two terms. Since the semantics of
an LSL trait hierarchy is well-deﬁned, an LSL trait hierarchy can be used as the basis
for a heuristic to determine the similarity between any two logical entities. In Chap—
ter 4, the construction process accesses the semantic class (or equivalence class) for
each of the predicates and functions in the pre- and postconditions that are clustered
into a uniﬁed hierarchy. For example, the predicate bigger(a, b) denoting that atom
a is bigger than atom b, belongs to the semantic class comparison(atom1,atom3).
The intended meaning is that bigger is a comparison operator applied to two atoms.
Similarly, the predicate smaller(a, b) belongs to the same semantic class. In all the
. analogies considered in ARMP, predicates are only mapped to predicates, and propo-
sitional connectives are mapped only to propositional connectives. All functions and

constants are converted to predicates. Thus there are no functions and constants.

7 .4 Top-Down Matching Approach

In the context of recognizing reusable candidate speciﬁcations for solving query spec-
iﬁcations, the analogical matching process should have a ﬂexible notion of analogy-
based match rather than imposing a design bias towards any single heuristic. It
should be easily tailorable to any particular domain-speciﬁc heuristic (knowledge).
The heuristics mentioned in Section 7.3 should be reﬁned to precisely deﬁned val-
ues. That is, some numerical metric should he invented to measure the usefulness
of a given analogy. Once a precise deﬁnition for the goodness of an analogy match
is given, the analogy problem can be regarded as an optimization problem. For an

optimization problem, ﬁnding a global optimum is not feasible at most of the time

93

therefore we might reasonably want to generate a set of local optimal analogies.

Since predicate logic is used to express the behavior of software components, the
object language of our analogy system is based upon ﬁrst-order logic. A semantic
type heuristic suggests the match of two terms but the pairings are not limited to
the terms in the same semantic group. We explore how to include the properties of
associativity and commutativity in predicate logic in the analogy matching process.
For example, it is unsatisfactory that the analogical matches from a set of paired
atoms is based on an arbitrary preservation of argument order.

The analogical matching can be regarded as a recursive problem solving process.
The initial problem is to match two terms expressed in predicate logic. As the match-
ing process continues, new subproblems are produced and recursively solved. No more
subproblems will be produced in two cases: (1) one of the subproblem’s terms is a
constant or variable; (2) no new analogical pairing is applicable for this subproblem.

The generation of subproblems from the matching process can be classiﬁed into
two kinds of branches: or—braneh and and-branch. When we want to match terms
containing a commutative operator, the set of derived subproblems may suggest more .
than one way of solving the problem. Therefore, this case applies, the current problem

Should branch into a set of new subproblems, each generating a new group of pairings.

For example, consider the following problem:

{ f1($1ay1)/\ f2($2,y2),gl(a1,bl) A 92(021 b2) }'

Since A is a commutative operator, the matching process may produce two sets of

In at ches:

((AiAillillli (flaglilllilllla (f27921[2]’[21)>
< (Ar/Milt”): (flig2’[1]’[2]), (f2)91,[2]’[1])>

Respectively, the matching process generates the following two sets of subproblems:

;_

 

94

{ {($1,91): (a1, bllla {(172,312), (02, 172”}

and

f {(31,311), (029 ’92)}, {($2,y2),(01, bill}

Thus, the current state of matching between a pair of terms involves a set of partial
matches and two sets of or-branch subproblems. We only need one of the or-branch
subproblems to be solved in order to proceed to later stages of the matching process.

The or-branch subproblems are generated by permuting the order of arguments to

obtain new sets of argument mappings.

In contrast, if the match process tries to match two terms where one or both
of them are predicates, then the problem is branched into two subproblems, each

yielding one set of pairings. For the purpose of illustration, we change the operator

A of the previous example to an equality predicate as follows:
{ fi($1,y1) > f2($2,y2): 91011.51) 2 92(02,52) }

Where > and Z are not commutative operators. Hence, only one match is generated

<(>’Ztﬂ,[])’ (fliglilllilllli (fZigzilzlilzlll

And the matching process generates the following two sets of subproblems:

{(31ay1),(01a 51)}
{(xh y2)i (“2, b2)}-

31101) that the current problem is split into two new sets of subproblems, with each
represented as an and-branch together with a partial mapping. The newly generated
matches from these two subproblems should not conﬂict with each other, that is, no
inconsistent pairings will be generated. We will deﬁne inconsistency shortly.

Fram the above examples, we may conclude that and-branch subproblems are

g("Etherated whenever the argument pairing within an identical argument mapping is

‘—

95

performed; or-branch subproblems are generated whenever the matching process en-
counters commutative terms and attempts to perform argument pairings of permuted

argument mappings of the terms. The terms in the latter case include ordinary pred-

icative connectives, for example, A and V.

Deﬁnition 7 .1 Conﬂict. Some pairing 01 = (x,y,P,,P,,) has a conﬂict with an
existing mapping Q {g (1 ) there is some pairing 0; E Q, and 02 consists of either x
or y but not both; or (2) there exists some pairing or; = (x, y, Q,, Q”) E Q and both
01 and 02 are argument pairings within the same predicates or functions, but either

P29£Qz 0er7in'

Deﬁnition 7.2 Consistency. Some pairing 0' is consistent with some existing map-
ping Q if a has no conﬂicts with Q.

If the matching algorithm is restricted to the preservation of argument order, then the

second requirement of Deﬁnition 7.1 will never be applicable in the matching process.

7.5 Matching Algorithm

Given a matching subproblem, consisting of a pair of speciﬁcations or and ﬂ rep«
resented in ﬁrst-order logic, and an existing mapping Q0”, the matching algorithm
attempts to ﬁnd a new consistent mapping QM,” and returns it to the user. We assume
all variables of a and ,6 have been either skolemized or universally quantiﬁed. The
matching algorithm is given in Figure 7.2. This algorithm is based on the matching
Process approach presented in Section 7.4.
Several heuristics are exploited in Algorithm 9. This algorithm uses a top-down

8S-‘lleme to compare two input terms. The functor symbols of two input terms should

be matched before argument pairing tests are performed, hence a homomorphism

helll‘istic is partially incorporated. It is denoted as a partial homomorphism because

commutative argument pairings are allowed in this algorithm. If some term is a

Commutative operator, then several versions of the term with permuted arguments

are Created to generate the or-branch subproblems, which is case 3 in Algorithm 9.

M

96

 

Algorithm 9 M atch( a, 13. Qozd, Qnm)

Input: Two terms 01,5, and a current partial mapping Q0“.
Output: A new partial mapping an.

Procedure:
begin
switch(a, ﬂ)
case 1: one of a and ﬂ is a constant or variable
if consistent((a, ,6),Qou) then
Qnew ‘- Qold U {(013)};
8188 Qnew "" Wold;
return;
we 2: a = f(21332i Hutu) and )6 = 9(yliy2i "'9 ya):
either f or g is not commutative % and-branch subproblems
if consistent((f,g),Qou) then
4’0 *- 2014 U {(faglli
for all i do
.1!atch(x;,y,-,Q,--1,Q,-);
end_for;
Qnew '- Q7”.
else Qnew "" ﬁeld;
return;
case 3: a = f(ri.22) and ﬂ = gnaw),
both I and g are commutative % or-branch subproblems
if consistent“ f, g),Qo(g) then
Q0 "" Qold U «[99»;
Q0, ._ permutation(x1,xg)
‘Ilg .— permutation(y1, ya)
‘I! .— argumentmappingOI’a, \Ilp)
for all [a.argJist,-,B.arg.list,-] 6 ‘I’ do in parallel
for all x), 6 a.arg..list,- and y], 6 ﬂ-arg.list; E ‘I' do
Match($k,yk.Qk—1,Qk);

end.for;
Qi ‘- Qn
endJ'or;

QM,” — evaluate(f21 , 92, ..., 060,4”);
else Qnew ‘- Qoldi
return;
case 4: a = f(z19 329 ..., 3m) and ﬂ = 9(3/19 3’2, "-9 ya):
m 7‘5 n % diﬂ'erent number of arguments
if ((ahﬁl) = transform(a,ﬁ)) then
MatCh(al i 31 i Qold: Qnew);
return;
end.

Figure 7.2. Matching Algorithm

 

97

Otherwise, and-branch subproblems for case 2 are generated for each pair of permuted
arguments of the input terms.

The deﬁnition of predicate consistency is given in Deﬁnition 7.2. Therefore, im-
plicitly, we exclude the possibility of mismatched analogy that allows a mapping that
is not one-to-one, even though it may be a useful analogy in the real world. If mis-
matches are allowed in the analogy system, then a considerably large amount for
domain knowledge would need to be encoded in the system’s knowledge base.

Case 4 in Algorithm 9 deals with the condition when the arguments of two input
terms have different sizes (m # n). In this case, we need some transformation rules to
“rephrase” the input terms to make their arguments have the same cardinality. The
transformation rules require domain knowledge. For example, suppose we want to
match {fa- and 5, since the functions square-root and division have different numbers

of arguments, they need to be transformed. If the system knows the rule J- = 3?,

then a match can be easily found:

((/./, ll, ll),(\/5a0:l1l: [1]),(1,d, {21321)}-

If the system only knows the rule ﬂ = ﬂ x 1, then we have the following match:

(U, x, l], U), (x/ic, l1]. lll),(1,da [2]. (21)).

our algorithm provides a framework for a domain-independent matching process but
the domain knowledge is tailorable to more speciﬁc types of information.

The complexity of this algorithm increased when each operator is commutative.

S uppose P is a predicate, then we deﬁne

maxJevel(P) = Tedrga:(P){level(P, T)}

where closure(P) and level ( P, T) have been deﬁned in Chapter 5. From the top-down

matching algorithm, two terms A, A E closure(a), and B, B inclosurew), are to be

 

M

98

matched if and only if

maxJevel(a) - level(a, A) = maxJevel(ﬂ) — level(ﬂ, B)

For each pair of commutative operators, the matching process generates two subprob-
lems. Therefore, it is trivial to determine the number of the levels in the search tree
since it is bounded from above by min{max..level(a),maxJevel(B)}. If length(P)

represents the number of symbols in the predicate P, then this algorithm’s upper

bound is

min{max.level(a),maxJevel(B)} x 2m”{"’°""”°’(°'l'm’ch’ml.

The function permutation() generates all possible permuted argument lists for a
commutative operator. In Algorithm 9, permutation() always generates two sets of
permuted arguments because, currently, the operators that are able to generate or-
branch subproblems are commutative and they consist of only two arguments. Once
associativity is incorporated into the case of generating an or-branch, then the number
of the sets of permuted arguments becomes exponential with respect to the lengths
Of input terms and the matching process encounters the possibility of combinatorial

eJ'tplosion.

The function argument-mapping() creates bindings between a pair of selected ar-
gument lists that are the output of the function permutation(). In our approach,
the corresponding arguments of two terms should be deﬁned over the same semantic
Class, i.e., equivalence class (semantic type) heuristic.

The function evaluate() chooses the best mapping from a set of mappings that are

output from a set of or-branch subproblems. Certainly, evaluate() needs some assess-

ment scheme that determines the degree of reusability for some analogical mapping.

 

99

The assessment scheme should be able to select the most promising mapping from a
set of candidate mappings to be reused, and it needs much programming and domain
knowledge. The assessment scheme is not formally addressed in this dissertation, but

a simple assessment scheme is given in the matching example of Section 7.6.

7 .6 Matching Example

A simple example given in Figure 7.3 is used to illustrate the matching algorithm
presented in the previous section. The initial problem with input terms is formulated
as:

{ x Z max(a,array(b)), y S min(queue(c),d) }.

And we use the procedure call

Match(x Z max(a,array(b)),y _<_ min(queue(c),d),{ }, QM”)

to initiate the matching process.

Figure 7.3 shows the processing tree of the matching procedure. Each node rep-
resents a matching problem and the root is the input problem being considered. We
deﬁne the and-node (or-node) to be a node generating and-branch (or-branch) sub-
Pl‘Oblems. A mapping-node is a node that generates a pair of matched terms, i.e.,
match. Every mapping-node has only one child node. For example, matching the op-

eraters ‘2’ and ‘3’ creates and-branch subproblems since the comparison operators
az- e not commutative. In contrast, the match of the operators max() and min() pro-
duCes or-branch subproblems because max and min are commutative. The matching
a‘lgorithm is recursively applied to the subproblems until either an empty set is gen-

erated or an inconsistency occurs. In this example, the only or-node generates the

following two subproblems:

{ {(0, queue(e)), (array(b). (1)}, {(0.61), (Queue(C),arrat/(b))} }.

100

 

A
[(x > max(s. crayon). y < min(queue(c). 4)”

n(m/O\c

((13)) ((mutamwnnuwctdm
[ (Is) (:8 I“)
1) mm». «mom»

/<-\

( (Md). (m0). d)) [ (ad). (we). 8181(5))!

{(I. We») Unsaid)! Had)! ((Mc). m0»)

(till-ltd) l (mm 1 (I1)”(l MI
I

l) l) ((c.b)l

..., l

t)

Figure 7.3. Simple example of the matching algorithm.

Where one of them is chosen to be returned to the parent node based on some assess-
ment scheme. Currently, we apply a simple assessment scheme where we assign the
weights 1, 2, and 3 to constants, variables and operators, respectively. The weight of
an and-node is the sum of the absolute values of its children’s weights. The weight of
a11 or—node is the minimum of the weights of its children. The weight of a mapping-
nOde is the difference of the matched terms’ weights plus the weight of its child. The
empty node has weight 0. The or-node chooses the set of matched pairs from the
children with minimum weight and passes them to its parent node. In this example,
the weights of nodes D and E are -1 and 1, respectively, so the weight of node B is
2° The weight of node C is 0 since both its children have weight 0. Therefore, node

A will return the matched pairs from node C to its parent node.

 

101

As shown in Figure 7.3, there is no inconsistency in this example. Moreover, we
assume that the arguments are deﬁned over the same semantic domain. Since an
or-branch is created, we should have two sets of matched matches returned from this

process. From the processing tree, we can easily recognize two sets of matches:

((2, S) ll: ll), (1’, yalllv [1]),(max. min, [2], [2]),
(a: queue(c),[2,l],[2,1]),(array(b), d, [2, 2]: l2, 2]) l

and

< (2.5,ll,ll).($,y,lll,[1]),(matamintl2lvlzl),
(array, queue, [2, 2], [2, 1]), (a, d,[2,1],[2, 2]), (b, c, [2, 2,1],[2,1,1]) );

where the latter set of matches is chosen according to our assessment scheme. A
more advanced assessment scheme is necessary to evaluate the matches and to give
the match information to the users. The assessment scheme can be an explanation-
based or tutoring system that contains a large amount of programming knowledge.
The ﬁnal “best” match is located in the set of ﬁnal partial mappings an.

The rest of this section presents another example to show the applicability of
the matching algorithm. An existing component, the Larch speciﬁcation of the
Stack class is given in Figure 7.4. This component deﬁnes three methods and
all of them are implemented in the public segment of a C++ class. The methods
Stack and ~Stack are the constructor and destructor of class Stack, respectively.

The method Stack: :push (Stack: :pop) adds (deletes) an Object element to (from)
St ack. Stack: :top returns the topmost element of Stack. The C++ implementations
for the methods of Stack class are given in Appendix E.3.

A query speciﬁcation, the speciﬁcation of DoubleList class, is given in Figure 7.5.

In addition to constructor and destructor, this component deﬁnes four methods:
DoubleList: :addAtHead, DoubleLi st : :addAtTail, DoubleList : :detachAtHead,

and DoubleList: :detachAtTail. For the purpose of conciseness, we only consider

102

 

component Stack
abstract type Stack;
public
void Stack(Stack 8stack) {
modiﬁes 8stack;
ensures (*stack)’ 8 Nulljtack;
}
void ~Stack(Stack *stack) {
modiﬁes *stack;
ensures trashed(8stack) ;

}

void Stack: :push(Stack 8stack, Objectt newElement) {
requires “fu11(*stack) ;

modiﬁes *stack;
ensures top(*stack’ ,nevElenent) A

size(stack’) 88 size(stack) + 1;
}

bool Stack: :detach(8tack tstack, Objectt topElenent) {
requires nenpty(*stack) ;
modiﬁes 8stack;
ensures top(*stack, topElenent) A
size(stack’) 88 size(stack) - 1;
}

Objectt Stack::topElenent() {
requires qemptyot'stack) ;
ensures result 8 topElenent A top(8stack,topElement) ;

}

Figure 7.4. Larch speciﬁcation of Stack class.

 

103

 

component DoubleList
abstract type DoubleList;
public
void DoubleList(DoubleList *dbllist) {
modiﬁes lt'dbllist;
ensures (*dbllist)’ 8 NULLDoubleList;
}
void ~DoubleList(DoubleList 8dbllist) {
modiﬁes *dbllist;
ensures trashed(8dbllist) ;

}

void DoubleList::addAtHead(Doub1eList *dbllist, Objects newElement) {
modiﬁes *dbllist;
ensures head(tdbllist’ ,neuElement) A
length(dbllist’) 88 1ength(dbllist) + 1;
}

void DoubleList::addAtTail(DoubleList *dbllist, Objecta newElement) {
modiﬁes 8db11ist;
ensures tail(*dbllist’ ,nevElement) A
1ength(dbllist’) 88 1ength(dbllist) + 1;
}

void DoubleList::detachAtHeadCDoubleList 8db11ist, Objectt elelent) {
requires head (*dbllist , element)
modiﬁes *dbllist;
ensures nhead(*dbllist’,element) A
(element ¢ *dbllist’) A
1ength(dbllist’) 88 length(dbllist) - 1;
}
void DoubleList: :detachAtTailCDoubleList Ii'dbllist, Objectt element) {
requires tail (*dbllist , element)
modiﬁes I'Idblhst;
ensures ntail(*dbllist’,element) A
(element ¢ *dbllist’) A
length(dbllist’) 88 1ength(dbllist) - 1;

}

Figure 7.5. Larch speciﬁcation of DoubleList class.

104

the relationship between two methods - Stack: :push and DoubleList: :addAtTail.
In order to ﬁnd an analogous existing component based on query speciﬁcation
DoubleList, we apply our matching algorithm to the methods of DoubleList. Fig-
ure 7.6 shows the results of the application of the matching algorithm to the method
DoubleList: :addAtTail and the method Stack: :push.

A prototype system for facilitating software reuse has been implemented in the
Quintus ProWindows language: a dialect of Prolog that supports the object-oriented
organization of graphical elements. Our system provides the functions of constructing
the hierarchical library [42], retrieving the existing components that have a generality
relationship with the query component [44], and assisting users in the modiﬁcation of
more general and analogous existing components to satisfy the query speciﬁcation [66].
The left part of Figure 7.6 displays the two-tiered hierarchy of a group of components

described by formal speciﬁcations. The Candidate Analogies window displays the
matches that are the results of the matching algorithm. The matches are helpful in
terms of modifying the existing components for reuse because the users may discover
inherent similarities between two components that have no logical relationships that
can be found by automated reasoning. Given these candidate matches, the user can
reuse or redesign the query component. In this example, the system suggests several
matches that may be useful in the modiﬁcation process. The result suggests that, in
order to satisfy the query speciﬁcation, the input object should be changed from stack
t"O dbllist and the new element should be added at the tail of dbllist instead of
the top of stack. The C++ implementation for the method DoubleList: :addAtTail
and other methods of DoubleList class is shown in Appendix EA.
The results from the matching process potentially facilitate the modiﬁcation pro-
cIess by providing the necessary information of speciﬁcation changes as described in

the above examples. A modiﬁcation example based on analogy will be illustrated in

 

 

" A product of Quintus Computer Systems, Inc.

105

 

   

 

 

 

mmmﬂmmﬁmm

c.9001

 

C... '001’

 

Come“ nucleon.

sow no." Component: moo-Noun
low Gooey "bod: MT.“

low “at." Gonna-at: 6.“.
Selected Win “but: noel

 

 

c-uun- one”...

  

  

nzﬁthst ¢-> 00'...th
use QIHst t-D stock

 

  

tut c-a tee

 

Figure 7.6. An implementation for matching process.

 

106

detail in next section. Another challenge is to provide the candidates for the match
process, i.e., how to solve the base ﬁltering problem mentioned in Section 7.1. Based
upon the similarity deﬁned in the construction and retrieval processes, the system

returns a set of similar components (to the query speciﬁcation) to the users as the

candidate speciﬁcations for the matching process.

7 .7 Modiﬁcation Example

Program modiﬁcation is a combination of analogy, transformation, synthesis, and
veriﬁcation. In this section, we give an example of program modiﬁcation based on
analogy. We can see how the matching process plays an important role in the modi-

ﬁcation process. Consider the following speciﬁcation of a square root program:

{011 020Ae>0Ar€Real}
{Riilﬁ—rke}

where Q1 and R1 are the pre- and postconditions of the square root program, re-
Spectively. We are given two numbers a and e and the desired result is an ap-
proximation r, which is a real number, to the square root of a with a tolerance

value c. Assume we have an existing program that performs real division as follows:

{Qg:0_<_c<dAe>0}
begin
8:: 1; q:=0;
whiles>edo
s:=s/2;
ifd*(q+s)$cthenq:=q+sﬁ
od.
end.

{Rz=IC/d-q|<e}
Then we apply the matching process to the pair of postconditions of these two

Programs, i.e., R1 and R2: { (I ﬁ— r |< e), (I c/d— q K e) ]. If the system has the

transformation rule: \/€—l = 5?, then a match can be found as follows:

107

<(/, /’ U, U), (ﬂ, C, [1'' 1’ 1]? l1, 1’ 1]), (d2 1" [1’ 1’ 2i, [1’ 1’ 2])(1', q? [172]? [112]), (e? e, [2], [2]))

The match is then applied to the real division program and it becomes

begin
3 := l; r:= 0;
while 3 > e do
3 := s/2;
if1*(r+s) S ﬁthenr:=r+s ﬁ
od.
end.

However, 1 :0: (r + s) S J; contains \fd which is to be implemented so we need to

rewrite the statement

l*(r+s)S\/ci

by

(r+s)2Sa

that preserves the semantics but eliminates J5 from the program. Squaring, addition,
- and comparison are regarded as elementary operations. We obtain the following
program:

{Q1:a20Ae>0ArEReal}

begin
3 := 1; r:= 0;
while 3 > e do
8 := s/2;
if(r-l-s)2 Sathenrz=r+s ﬁ
od.
end.

{Ri=l\/5-7‘|<€}
In this algorithm, the result r falls in the range [0,s), i.e., 0 S r < s. However,
0 S r S {EU/2)" < 27°(1/2)" =1 so 0 S r <1. Once a > (1+ e)”, this program
will never ﬁnd the desired answer. This problem can be solved by replacing the

initialization command 3 := l by s := a + 1 because the square root of a is bounded

108

from above by a + 1. Consequently, the desired program becomes

{Q1:a20Ae>0Ar€Real}
begin
3 := a+1; r:= 0;
whiles>edo
s:=s/2;
if(r+s)2Sathenr:=r+sﬁ

od.

end.

{Rlzlﬁ—rke}

Despite the simplicity of this example, the potential beneﬁts of program by mod-
iﬁcation is apparent. In the example of this section, the programmer can save some
programming effort by reusing the modiﬁed program instead of having to program
everything from scratch. Once the analogical match is found, the programmer has
to develop only those parts of the program that cannot be reused from the old one,

which hopefully is much less than to generate the entire program.

7 .8 Summary

This chapter presents the framework and an approach to applying analogical matching
process to reusing software components that are described by formal speciﬁcations.
Section 7.5 presents a top-down matching algorithm that is consistent, equivalence-
class based, and partial homomorphism-preserving. This algorithm allows the com-
mutative operator to be matched with another commutative operator, thus increasing
the power of the matching process, i.e., the possibility of ﬁnding more analogies. In
Section 7.7, an example shows the applicability of the matching process to program

modiﬁcation that assists the user in developing programs that make use of existing

Software.

The speciﬁcation of the query and candidate speciﬁcations may be given in a

form that obscures any analogical matching. Thus, we would like to express the spec-

109

iﬁcation in some equivalent form that makes their similarity more pronounced, for
example, the transformational process trans f orm() changes the form of some speci-
ﬁcation but preserves its meaning. This problem is in general difﬁcult to solve. Two
functions in the matching algorithm that are not presented in detail are evaluation()
and argument-mapping. The latter can be implemented by sorting arguments and
binding the arguments with same or similar semantic types. The challenge is how to
allow associativity to be incorporated into the matching algorithm and to avoid the
combinatory explosion. The evaluation function is more complex in the sense of a
large amount of domain and programming knowledge required to make a reasonable

assessment and return the best mapping to users.

CHAPTER 8

Case Study

8.1 Introduction

This chapter describes the application of our reuse framework on a large-scale exam-
ple. More speciﬁcally, the problem domain is the construction of graphical user inter-
faces based on existing graphical software components. The Larch speciﬁcation lan-
guage is used to specify the X window’s widgets, where we include a discussion of the
issues that we encountered in the process of specifying widgets. We also include a dis-
cussion of the strengths and weaknesses of our reuse processes [42, 43, 44, 60, 67, 68].

Several toolkits such as Xt [69], Motif [70, 71] and XView [72] have been built to
enhance the development of graphical user interface software. However, these toolkits
can only be applied at the implementation level and not at the speciﬁcation level.
Therefore, some common problems in software engineering, such as completeness and
Consistency, may arise during the life cycle of a software system that includes the
graphical user interface developed by these toolkits [73].

We begin with a brief description of X window’s widget set/library and Larch
in Section 8.2. In Section 8.3, we present a sketch of the speciﬁcation for window
Widgets. The full speciﬁcation is in Appendix C.2 and C.3. Section 8.4 gives an

example of specifying a scrolled text editor with popup menus. Section 8.5 shows the

110

111

application of the reuse process to a set of formally speciﬁed window widgets. Finally,

Section 8.6 summarizes this chapter.

8.2 Widgets and Larch

This section gives a brief introduction to Motif widgets and their Larch speciﬁca-
tions. Detailed descriptions of Motif widgets can be found in Motif programming
manuals [70, 71, 74]. Complete Larch speciﬁcation languages (LSL and Larch inter-
face languages) can be found in [37, 40, 45, 75, 76].

The Motif widget set contains many components, including scrollbars, menus,
buttons, etc., and they can be combined to create user interfaces. The Motif widget
set is designed to encourage the programmer to follow the Motif Style Guide, which
is based on the behavior of Microsoft’s Presentation Manager [70]. We can divide
the widget classes provided by Motif into several categories based on the general
functionality they offer. For example, some widgets display information, while others
allow the user to select from a set of choices. Still others allow other widgets to be
grouped together in various combinations. Appendix C.1 lists the developed Motif
widgets of X window systems.

As mentioned in Chapter 3, the Larch Shared Language (LSL) describes state-
independent properties of a program. For example, the Manager Window trait (Fig-
ure 8.1) introduces the sort Manager Win and the operators move-win and resize-win;
three equations constrain the meanings of move_win and resize-win, respectively. Each
trait name starts with “Lt” represents “Larch Trait”.

We write the module speciﬁcation in a Larch interface language to describe state—
dependent effects of a program. A requires clause states each procedure’s precon-
dition; a modiﬁes clause lists those objects whose values may possibly change; an

ensures clause, given the postcondition. The assertion language for the pre- and post-

112

 

LtManager Window( Manager Win): trait
includes LtComposite Window

ManagerWin tuple of pos: Coord,
size: Size,
label: String
introduces
move-win: ManagerWin _. ManagerWin
resize-win: ManagerWin —> ManagerWin

asserts
V win: ManagerWin

move-win( win ) pos == coord..map( win. pos )
move-win(win).size == win.size
move-win(win).label == win.label
resize-win(win).label == coord.map(win.pos)
resize-win (win). label == length-map(win.size)
resize_win(win).label == win.label

Figure 8.1. The ManagerWindow trait.

 

conditions is drawn from LSL traits. Through based on clauses, a Larch interface
links to LSL traits by specifying a correspondence between programming language-
speciﬁc types and LSL sorts. An object has a type and a value that ranges over terms
of the corresponding sort. Part of the interface speciﬁcation for the Scrollable Text
widget (Figure 8.2) deﬁnes the type Scrolled Text, which is based on the Scrollable sort
and Text sort, introduced in the ScrollableState and TextState traits, respectively. The
scroll.win procedure’s precondition requires that the window selected is exactly the
window that is to be scrolled, where window.to-O(win) is a coercion operator. The
postcondition states that the display value of the window is updated (as deﬁned by
the set-display operator whose meaning is obtained from ScrollableState) and that all
windows are unselected. In a postcondition, an undecorated formal e, stands for the
initial value of the object; e’ stands for the final value. The modiﬁes clause states

that scroll.win may change only the display of the selected window.

 

113

 

type Scrolled Text based on Scrollable from Scrollable Window
Text from Text Window

operation scroll.win(w: Window, view: View)
requires e.selected_win = {window.to-0(win)}
modiﬁes (e.g.,)

ensures win’ = set.scrolled.display(win,view) A e’.selected.win = {}

Figure 8.2. Part of the Scrolled Text speciﬁcation.

 

8.3 The Speciﬁcation for Widgets

Several software systems have been formally speciﬁed by the Larch speciﬁcation lan-
guage. For example, one of the earliest case studies of formally specifying “real”
systems with the language Larch involves Avalon/C++ objects [77]. Also Larch has
been used to specify visual editor Miré [78, 79, 80]. The behavior of concurrent sys-
tems are formally described by a Larch interface language GCIL [81, 41]. A copying
collector for garbage collection has also been speciﬁed in Larch [82].

In this section, we specify the Motif widgets in Larch. The speciﬁcation itself
is composed of two parts, one in the Larch Shared Language (LSL) which is used
to specify general Motif widgets, and one in the Larch/ C (LCL) which uses the LSL
traits to specify the speciﬁc C implementation of the widgets. Figure 8.3 illustrates
the hierarchical structure of LSL traits, where a trait inherits all the properties of its

parent trait. Each oval corresponds to a trait, and an arrow indicates that one trait

i ncludes another.

 

114

 

 

Om-

“mudd-

Figure 8.3. The dependencies of the Window’s traits.

 

8.3.1 Primitive Widget

Figure 8.4 shows a trait that deﬁnes a window as an object composed of content and
clipping regions, foreground and background colors. and a window identiﬁer. The
member symbol ‘6’ is qualiﬁed by a signature in the last line of the trait because it
is overloaded, and it is necessary to indicate ‘6’ is converted. Figure 8.5 shows the
trait Primitive that deﬁnes a primitive widget used in the deﬁnitions of most widgets.
The Set trait is deﬁned in the Larch Handbook; the renaming of sort identiﬁer in the
ﬁrst Set trait gives the sort WidgetSet for sets of items of sort Widget and all other
Operators. The operator that generates a widget is create-win.
We deﬁne each of the remaining operators in the trait with equations in the
asserts clause. The operator create(w) creates: the widget w, and quit(w) destroys

the widget w. The operator open(w) opens current widget w, and close(w) closes

115

 

LtBasic Window: trait
assumes LtCoordinate
includes LtRegion, LtView, Displayable( Window)
asserts Window tuple of pos: Coord,
size: Size,
cont, clip: Region,
front, back: View,
id: WId
forall w: Window, cd: Coord
cd 6 w == cd 6 w.clip
w/cd] == if at 6 w.cont w. front else w.back
implies converts ./-h E: Coord, Window —-> Bool

Figure 8.4. The Basic Window trait.

 

current widget w. The operator move(w,6) moves the widget w in the screen by 6.
The operator resize(w,7) resizes the widget w to the size 7. The observer operator
size(w) reports the size of the widget w and pos(w) the position of the widget w.
The operator overlap(w1, wg) determines if widgets wl and w; overlap. The operator
id(w) returns the identiﬁer of w. The operator parent( w) returns the parent widget of
w, and child( w) the child widgets of w. The observer operator is_child(w1, 102) decides
if ml is a child widget of 102. The function coor.map is a function that transforms
some widget’s coordinates from its original position to the destination. Similarly, the
function lengtthap is a function that transforms some widget’s geometry and size.

A new sort Widget is introduced and will be inherited by all widget traits that are

deﬁned in Appendix C.2.

8.3.2 Object and SetResource

The Object trait deﬁnes two new sorts that are useful in manipulating objects in a

trait regardless of what kind of widgets they are; the SetResource trait introduces

116

 

‘LtPrimitive: trait
includes LtBasic Window, LtView, Set( Widget, WidgetSet)
Widget tuple of pos: Coord
size: Size
id: Real
introduces
create: —+ Widget
destroy: -8 Widget
open: Widget —* View
quit: Widget -+ View
selected: Widget —. 8001
move: Widget, Coord —v Widget
size: Widget -> Size % observers
set-size: Widget, Size —> Widget
pos: Widget -i Coord
set_pos: Widget, Coord —-v Widget
overlap: Widget, Widget —* Bool
id: Widget —* Integer
parent: Widget -> Widget
child: Widget, Real —. WidgetSet
is-child: Widget, Widget -> Bool

asserts
V w: Widget

move(w).pos == coord.map(w.pos)
move(w).size == w.size
move(w).id == w.id
resize(w).id == coord..map( w. pos)
resize (w).id == length-map(w.size)
resize(w).id == w.id

w generated by create
w partitioned by id

Figure 8.5. The Primitive trait.

 

 

117

sorts and operators to change an arbitrary resource’s attribute in a widget. Since
many of the widget operators are essentially the same, we would like to operate on
objects or a set of objects rather than having separate operators for different widgets
in window systems. For example, selecting a displayable widget does not depend on
what kind of widget is selected, so would be appropriate to have a single operator
to select a widget. The Object trait (Figure 8.6) introduces the new sorts Obj, a
union of all widget types, and ObjSet, a set of objects [80]. The union of shorthand
provides coercion operators between the union set and its component sorts. So the

union declaration:

Obj union of wgt: Widget

produces operators with the following signatures:
WGT.O: WidgetSet —+ Obj
_. WGT.TYPE: Obj -» WidgetSet
TAG: Obj -+ Obj.tag
The operator WGT.O coerces a set of widgets to an object, .WG T coerces an
object back into a set of widgets, and TAG is used to determine what kind of widgets
the object contains. The Object trait also introduces operators to manipulate sets
of objects. The operator objects returns the set of all objects in a window; widgets
extracts the set of widgets from a set of objects. The operator toggle_in adds the
speciﬁed object to a set of objects if it is not in it, otherwise it deletes the object.
Moreover, the coercion function is deﬁned recursively, that is, a set of objects can
also be coerced into an object.
The SetResource trait (Figure 8.7) contains the speciﬁcation for the set..attr oper-
ator, which takes an object Object, ﬁeld name, and a value, and returns a new object

that is the same as Object except that it has a new value for its ﬁeld Attr.

 

118

 

LtObject( Obj): trait
includes LtBasicWindow, LtPrimitive Widget, Set( Obj, ObjSet)

Obj union of widget: Widget

introduces
objects: Window -+ ObjSet
widgets: ObjSet —v WidgetSet

asserts
V w: Widget, ws: WidgetSet, a: Attr,
as: AttrSet, obj: Obj, as: ObjSet, win: Window

objects(create-window == {}
objects( insert.widget( win, w) ) ==
insert( objects( win), { w} . wgtJype ({ w} ))

widgetsm) == {}
widgets(insert(os,obj)) ==
if TAG(0bj) = win-typc(obj)
t hen insert( widgets ( os ), obj. wgt-type (obj) )
else widgets(os)

implies converts objects, widgets: ObjSet -> WidgetSet

Figure 8.6. The Object trait.

 

 

LtSetResoumeMttr, Val): trait
includes Set(Attr for E, AttrSet for C), LtObject( Ob for Obj)
introduces
valid.attr: Ob, Attr -’ Bool
mliivalue: Attr, Val —> Boot
get.value: Ob, Attr -> Vol
set-attr: Ob, Attr, Val -> Ob

asserts
get..value( set_attr (Ob, Attr, Val), Attr’) ==
if equal(Attr,Attr’)
then Val

else get-value( Ob, Attr’)

Figure 8.7. The SetResource trait.

119

8.3.3 Well-Formed Window

Before introducing well-formedness, we deﬁne some relationships between widgets.
For two widgets to, and 1.02, if wl contains w, then to] is the parent of w; and 102
is a child widget of wl. We deﬁne w; to be a descendant of w if either (l) w; is
a child of ml or, (2) there is another widget 1123, where 112;, is a child of ml and w;
is a descendant of 1.03. The relationship ancestor is deﬁned similarly to descendant.
With LtPrimitive trait, we have introduced the widget sort, Widget, and the basic
operators. One well-formedness condition for widgets is that for any two widgets,
say w, and 102, if mg is a descendant of 10,, then to] must not be a descendant of
1.02. Another constraint is that the geometry, size, and location of a descendant is
determined by its parent widget. If a widget is destroyed then all of its child widgets
are destroyed, too. The widgets with the above constraints are called well—formed
widgets. If all widgets of a window are well-formed, then the window is a well-formed
window.

The well-formed Widget trait in Figure 8.8 introduces operators that deﬁne well-
formedness properties and the new well-formed version of the operators that create
and modify a widget. In many cases, the result of a well-formed operator differs
from the result of its non-well-formed counterpart. For example, deleting just a wid-
get may violate well-formedness, since it may result in “dangling” widgets that have
no parent. Hence, delete-wf_widget must delete all child widgets before deleting the
widget. Thus, we introduce an additional operator, delete-children to handle the
deletion of a set of child widgets of a. widget. The operator delete-objs returns a
widget that is the result of deleting a set of objects in a well-formed manner. The
Operator extract-wf returns a widget w that is a maximally well-formed subset of a

Set of objects, i.e., no ancestor of w is well-formed. The result of extract_wf(os) is a

widget that contains all the objects of 03 except the dangling widgets. The mean-

120

ing of the operators child, parent. descendant, and ancestor are straight forward.
The operator is-well.formed(w) checks if w is a well-formed widget. The result of
managed.by(w1, 102) will show whether the resources of ml is managed by 102. The

formal assertion of well-formedness is deﬁned in Figure 8.8.

 

Lt WF Widget( WF Widget ) trait
includes LtPrimitive, Set( Widget, WidgetSet), LtSetResource, Object
introduces
ancestor: Widget, Widget — Bool
descendant: Widget, Widget —» Bool
is-well.formed: Widget —. Bool
managed.by: Widget, Widget - Bool
eztract-wf: ObjSet —+ WidgetSet
delete_wf.widget: Widget, Widget—v Widget
delete-objs: Widget, ObjSet -+ Widget
delete_children: Widget .. Widget
asserts for all w, w’: WFWidget
is-well.formed(w) ==
(managed.by(w, parent(w)) A
(for all w’::descendant(w’,w) => - ancestor(w’, w)))
parent(w,w’) == child(w’,w)
ancestor(w,w’) == descendant(w’, w)
for all w’ E extract..wf(w) => is-well.formed(w’)

Figure 8.8. The WFWidget trait.

 

8.4 Speciﬁcation of Scrolled Text Editor with

Popup Menus

Given the Motif Widget Model, we now build the formal speciﬁcation for a text
editor with scroll bars and popup menus, which is shown in Figure 8.9. We begin

by establishing a model of the text editor state at the trait level. Since the text

121

editor widget with the popup menu has child widgets ScrolledText and PopupMenu,
we also need to specify their traits in advance. The interface level speciﬁcation then
introduces the editor operations deﬁned in terms of changes to the state. Much of
the lower-level detail is assumed, e.g. mapping a mouse to keyboard actions and

how text interacts with words. Many of the details can be found in X window’s

manuals [83, 84].

 

 

Text Editor V3- belles, dlr; / ho me/golden/uZS/je ng/motlflchoz

Mncl ude (XI/th)
E

 

 

   

#1 ncl ude <xI/Push8. h>

  
 

Text Pane

Load File...
Vlew SB-ECrmei‘leiié

widget toplevsl . button: Edlt Store as New File...

XtAppContext app;
void Leas ushedO: Include File...
XnString l el; Empty Document

toplevel - Xtvanpplnttiallze(&a
aargc. argv, NULL. NULL):

label - XIStrlngCreate51Iple(' Push here to say hello“ );
button - XtVaCreateHanagedw1dgetC' pushes” .
anushButtonwidgetC ass. toplevel
XuNlabelString. label.

NULL);
XsStringFreeClabel):

 

  
 

sainCargc. argv)
Ehar *argvll:

    
   
    

 

 

 

 

 

 

 

Figure 8.9. An Example of Scrolled Text Editor with Popup Menus.

 

8.4.1 The X Model

The basic interface of this editor is straightforward. The main part of this editor

is the editing area, where the user actually edits the document. On the top of the

122

editing area, there are several popup menus that allow the user to specify operations,
e.g., load a new ﬁle. The user can also click in the editing area to popup the same
menu as those on the top of the editor. Along the right side of the editing window,
there is a scroll bar that allows the user to view the text vertically if the text is longer
than the display area.

The scrolled text editor with a popup menu contains a number of different man-
agers and primitive widgets, but appears to the user as a single, conceptually focused
user-interface object. The parent-child relationships among the widgets in this edi-
tor can be graphically illustrated using the tree-structure shown in Figure 8.10. A
widget contains a set of child widgets. for example, the MenuBar widget at the top
of the editor contains four PullDownHenu widgets with labels File, View, Edit, and
Find, respectively. Each PullDovnMenu widget contains a CompoundString widget
for labeling and a CascadeButton widget which also contains a set of PushButton
widgets. Due to limited space, not all widgets used in Figure 8.9 are shown, e.g., only

the child widgets of MenuIten File are displayed in Figure 8.10.

8.4.2 The LSL Level

The main purpose of formal speciﬁcation is to help the user grasp the concept of the
system easily instead of becoming immersed in implementation details. For example,
a user is not concerned with how a managed widget is mapped to the pixel level
(intensity and location) on the screen. For another example, in the domain of Motif
widgets, there is not a speciﬁc widget designed for a scrolled text editor. A user can
create a scrolled text editor by using the text editor widget (Xm’l'ext) and scrolled
window widget XnScrolledHindow). However, at the speciﬁcation level, we are free
to design such a speciﬁcation component in order to enhance the understandability of
the system. The speciﬁcation of the scrolled text editor with popup menus consists

of several widget traits. The interface speciﬁcations of those objects are described in

123

 

 

 

 

She’ll—J

ScrolledWindow

 

 

 

 

 

 

 

 

 

 

 

 

 

 

[DrawingAres] I MefuBsr j I TenPane I WenieslScrollBsn

 

 

 

 

1
[meaning] [ rainbow-Meg] [ PullDownMenu] [PullDownMenuJ lPullDownMcou ]

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(View) ‘ (Edit) (Find) (Esau)

 

 

 

 

 

 

Cage-El

SE 1" .Z-l'... Il-

(mm ) (macaw-vi

 

 

 

 

 

 

 

(MM f) (man-cm; (amumm )

 

 

 

 

Figure 8.10. Parent-child relationships between widgets.

 

the following section. Before describing the trait of the scrolled text editor with a
popup menu, we specify ﬁve constituent traits: LtScrolled, LtTextField, LtText,
LtScrolled‘l‘ext, and LtMenu.

Figure 8.11 gives the trait LtScrolled that is the Larch Shared speciﬁcation of the
ScrolledWindow widget XmScrolledHindow. ScrolledWindow provides a scrollable
view of data that may not be visible all at once. ScrollBars allow a user to scroll the
visible part of the window through the large display. A ScrolledWindow widget can
be created so that it scrolls automatically without application intervention or so that
an application provides support for all scrolling functions. Our ScrolledWindow trait

LtScrolled only gives necessary operators for a ScrolledWindow, and other features

124

such as “scroll up by the size of the viewing window” are omitted.

 

LtScrolled: trait
includes LtView, LtManager, LtPrimitive
Scrolled tuple of type: {vertical,horizontal},
policy: { automatic, app-deﬁned} ,
introduces
create: —. Scrolled
destroy: Scrolled —.
scrolLup: Scrolled -+ Scrolled
scrolLdown: Scrolled —* Scrolled
scrollJeft: Scrolled —> Scrolled
scrolLright: Scrolled —. Scrolled

asserts
scroll.up(create).cont == Empty
scroll.down(create).cont == Empty

scroll.left(create).cont == Empty

scroll.right(create).cont == Empty
scmll.up(scroll_down).cont == scmll.down(scmll.up).cont
scmll.left(scroll.right).cont == scroll..right(scmll_left).cont

Figure 8.11. The LtScrolled trait.

 

A TextField widget provides a single-line text editor that has a subset of the
Text widget that is a text editor. We put the speciﬁcation of TextField ﬁrst, since it
is easier to specify a full text editor if we have ideas about how to specify a single-line
text editor. The line editor allows movement of a cursor on a line of text and provides

a means to change symbols on the line. A user of this line editor will be offered the

following facilities [85]:

1. Generating an empty line and placing the cursor at the initial position

2. Inserting a symbol at the cursor position, and moving the cursor to the right of
the new symbol.

3. Deleting a symbol to the right of the cursor position.

125

4. Moving the cursor one position to the left, provided that it is not already at
the beginning of the line.

5. Moving the cursor one position to the right, provided it is not already at the
end of the line.

6. Moving the cursor to the beginning of the line.
7. Moving the cursor to the end of the line.

8. Clearing the whole line.

The cursor will be placed between symbols, so it divides a line into two parts:
what is to the left of it and what is to the right of it. This observation proposes to
specify a line together with its cursor as a pair of strings, where the cursor is viewed as
taking a position between two strings, and the line is viewed to be the concatenation
of two strings. The following speciﬁcation given Figure 8.12 is based on this idea.

A Text widget (Xm'l'ext) provides a text editor that allows text to be inserted,
modiﬁed, deleted, and selected. The operations of Text subsume those of single-line
editor. We may generalize the operators used in Lt‘l‘extField to design the new
trait LtText for the Text widget. A document (ﬁle) is regarded as a stream in UNIX
system. Borrowing the concept of a ﬁle from UNIXt we introduce a new abstract data
type stream that is a list of symbols. Similarly, we specify a text together with its
cursor as a pair of streams, where the cursor is viewed as taking a position between
two streams, and the text is viewed as the concatenation of two streams. Figure 8.13
gives the Lt'l‘ext trait. The difference between a text and a line is that the former
has a two-dimensional layout and a. user can move the cursor around the screen in
both horizontal and vertical directions. Therefore, we introduce the operator line
to indicate at which line the cursor is currently located. Other parts of Lt‘I‘ext are
analogous to those of LtTextField.

Menus are basic interactive components used in the design of a graphical user

interface. A Menu consists of a set of items and allows the user to choose one of

 

‘UNIX is a trademark of AT& T.

126

 

Lt Tethield(TF): trait

includes Lt View, String, LtCharacter, LtPrimitive

introduces
cursor: String, String —+ TF
create: —-> TF
insert: TF -o TF
delete: TF —* TF
moveJeft: TF -> TF
move-right: TF —> TF
ﬁrst: TF —v Char
last: TF -* Char
clear: —+ TF

asserts forall c: Char; 3,31,32: String; tf: TF
insert(c,cursor(s1,s2)) == cursor(append(sl,c),s2)
delete(cursor(sI,Empty)) == cursor(sI,Empty)
delete(cursor(sl,append(c,s2))) == cursor(sl,s2)
move-left(cursor(Empty,8)) == cursor(Empty,s)
move-left(cursor(append(sl,c),82)) == cursor(sI,append(c,s2))
move-right(cursor(s,Empty)) == cursor(s,Empty)
move-right(sl,cursor(append(c,32))) == cursor(append(s1,c),32)
ﬁrst(cursor(s1,82)) == cursor(Empty,concat(s],82))
last(cursor(sl,32)) == cursor(concat(sI,s2),Empty)
clear(tf) == cursor(Empty,Empty)

Figure 8.12. The LtTextField trait.

 

the items to perform the desired event. In Figure 8.9, the menubar consists of four
items: File, View, Edit, and Find. The popup menu contains the same items plus
one more item, Extras. Each item of the menubar and popup menu is a pulldown
menu that displays another set of items to be selected. In the domain of Motif,
HenuBar, PopupMenu, and PulldownMenu are all instances of the RowColunn widget
with different types of XmNrowColumnType resource. However, at the speciﬁcation
level, we should not restrict the menu systems to one implementation such as Motif.
The speciﬁcation of menu should be abstract enough so that it can describe how a

menu behaves without regards to the underlying implementation details. Figure 8.14

127

 

LtTezt(Te1:t): trait
includes LtTethield, Lt View, LtLine (LN for Line_Number)

LtCharacter(CN for Chamcter.Number), LtPrimitive
introduces
cursor: LN, String, String —. Text
create: _. Tezt
insert: Test, Stream —. Test
delete: Test, Stream —. Test
moveJeft: Test -> Test
move-right: Test _. Test
move-up: Test —> Test
move-down: Test —> Test
first: Test -> LN
last: Text —v LN
line: —§ LN
contents: Test, LN —. Stream
clear: Text —*
asserts forall c: Char; cn:CN; s,s1,sZ: Stream: n, n1, n2: LN; t: Test

ﬁrst(cneate) == Empty
last(create) == Empty
ﬁrst(cursor(sl,s2)) == cursor(Empty,concat(s1,32))
last(cursor(s1,32) ) == cursor(concat(sl,52),Empty)
insert(c,cursor(s1,s2)) == cursor(append(sl,c),s2)
delete(cursor(s1,Empty)) == cursor(sI,Empty)
delete(cursor(s1,append(c,82))) == cursor(s1,32)
move-left(cursor(Empty,s)) == cursor(Empty,s)
move-left(cursor(append(sl,c),s2)) == cursor(s],append(c,32))
move-right(cursor(s,Empty)) == cursor-(s. Empty)
move_right(sl,cursor(append(c,s2))) == cursor(append(sl,c),s2)
line(move-up((cursor(sl,52))) ==

if equal( line ( cursor(s] ,32) ), 1)

then 1 else line(cursor(sl,s2)) - 1
line (move-down((cursor(s1, 82))) ==

if equal(line(cursor(sl,s2)),line.mar)

then linemaz else line(cursor(sl,s2)) + I
clear(t) == cursor(Empty,Empty)

Figure 8.13. The LtText trait.

 

128

 

LtMenu(Menu): trait
includes LtPrimitive, LtButton
Menu tuple of b1: Button,
b2: Button,

bu: Button
introduces
open: —' Menu
close: Menu -+
display: Menu -> Bool
submenu: Button -’ Menu
selected: Button -» Bool
associated: Button, Event -v 8001
ﬁre: Event -—> Bool
act: State. Event -» State
asserts forall b. b,-, b,- 6 ButtonSet; e 6 EventSet
if selected(bg) A i ¢ j then -1 selected(bj)
if selected(b) A associated(b,e) then fire(e)
if selected(b) A associated(b,DisplaySubmenu)
then display(submenu(b))

Figure 8.14. The LtMenu trait.

 

contains a speciﬁcation of LtMenu trait, where Menu is deﬁned as a new type con-
sisting of a set of buttons. The type Button is assumed to be predeﬁned. Only one
button is allowed to be selected at one time. Each button is associated with one
event. As long as some button is selected the associated event is ﬁred and the state
will be changed. Since only state independent properties are speciﬁed in traits, the
operator act will be described in the interface level. The submenu denotes the Motif ’s
Pulldouﬂenu which is a menu pane for all types of pulldown menu systems, including
menus of a menu bar, cascading submenus, and the menu associated with the op-
tion menu. A PulldownMenu is associated with a CascadeButton. A PulldownMenu
can contain PushButton, ToggleButton, and CascadeButton. Here, we represent the

event displaying all types of such buttons by the operator submenu that can popup

129

a new Henu object and the associated event is DisplaySubmenu. Various kinds of

events can be deﬁned by the users.

 

LtScrolledTezt(Sc th): trait

includes LtScrolled, LtTest

introduces
all.visible: Scth —i Bool
visible: Scth, View —> Boot
visible-content: Scth -> View

asserts
all.visible(visiblexontent(Sc th)) == true
visible(Scth, visible-content(ScTz-t) == true

Figure 8.15. The LtScrolledText trait.

 

The LtScrolled‘I‘ext trait can easily be speciﬁed by including predeﬁned traits
LtScrolled and LtText (Figure 8.15). The operator a11_visible checks if the entire
text is visible to the user. The operator visiblexontent will return the contents
of the visible text. Finally, the trait for the scrolled text editor with its menu is
formally described in the LtScrolledTextMenu trait (Figure 8.16). The speciﬁer may
include any primitive traits into the ﬁnal design, e.g., deciding that either popupbar

or menubar is needed in the text editor.

8.4.3 The Interface Level

An interface speciﬁcation language is not tailored to specifying the behavior of an
entire program; instead it is tailored to specifying the behavior of a part of a program
(a module) [86], e.g. the Xt/ Motif widgets. The LSL trait utilized in the interface
language describes the abstract values and some vocabulary that is used to manipulate

the abstract values. For example, Figure 8.17 shows part of the speciﬁcation for

 

130

 

LtScrolledTethenu(Scrolled Tethenu): trait

includes LtScrolledTezt, LtMenu

introduces
with.popupmenu: ScrolledTethenu -v Bool
with-menubar: ScrolledTethenu —» Bool
label: ScrolledTethenu —. String

asserts
label(set.attr(ScrolledTethenu. Label, Value)) == Value
with-popupmenu(set_attr(ScrolledTextMenu, Popup, -)) == true
with-menubar(set-attr(ScrolledTestMenu, MenuBar, .)) == true

Figure 8.16. The LtScrolledTextMenu trait.

 

primitive widgets based on the Larch generic interface language. Each method in
the speciﬁcation component Primitive corresponds to an operation associated with
this component. Select takes a displayable widget as a parameter and makes the
widget be in a state of having been selected. If the widget is already selected, then
it remains selected. Thus, the ensures clause states that the return value of the
.Boolean operator selected applied to the chosen widget is always true. Unselect is
very straightforward in that there are no preconditions, and the effect of the operation
is that the selected operator returns false. M ove(w, delta) moves each widget w
by delta. For each widget w, the value of w after Move is the result of setting
the position of the widget to its previous position plus delta. Once the Move is
performed, the widget is unselected. This action/ behavior is reflected in the second
clause of the ensures clause. The speciﬁcation of Resize is similar; it takes one
widget as a parameter. The ensures clause changes both the position and size of the
selected widget and unselects the widget. '

Figure 8.18 states the behavior of a Popup widget in a window. The operation
Push(p) makes a popup widget viewable at the topmost level among current viewable

widgets. On the other hand, Release( p) makes it disappear and become unselected.

131

 

component Primitive Widget based on Widget from LtPrimitive Widget

method Select(w: Widget)
requires (w e DisplayableSet)
modiﬁes (w.view)
ensures selected(w’) == true

method Unselect(w: Widget)
requires true
modiﬁes (w.view)
ensures selected(w’) == false

method Move(w: Widget, delta: Coord)
requires true
modiﬁes w

ensures selected(w) == true =
(pos(w’) == pos(w) + delta) A selected(w’) == false

method Resize(w: Widget, pos: Coord, size: Size)
requires selected(w)
modiﬁes w
ensures w’ == set.size(set.pos(w,pos),size) A selected(w’) == false

Figure 8.17. Part of the Interface Speciﬁcation of Primitive Widget.

 

The component Popupnenu inherits the operations from Popup and adds more oper-
ations, and we can say Popupnenu is a subclass of Popup. The interface speciﬁcation
can be reused by inheritance. A subclass inherits its parent class’s speciﬁcation; that
is, the data member declarations and member methods of parent classes are inherited
by the a derived class.

Speciﬁcation inheritance provides a simple flexible mechanism for specifying a
widget without respecifying existing widgets. In C++, only public functions can be
inherited by derived class. This constraint is not imposed on our widget speciﬁcations

for brevity purposes. Figure 8.19 is an example of a derived widget that inherits the

132

 

component Papup
inherits Primitive

method Push(p: Popup)
requires a displayable(p)
modiﬁes p.view
ensures displayable(p’) A selected(p) A top_level(p)

method Release(p: Popup Widget)
requires displayable(p)
modiﬁes p.view
ensures -i displayable(p’) A n selected(p)

Figure 8.18. The Larch Interface Speciﬁcation of Popup.

 

member functions of the widget speciﬁcation component Popup and introduces more
member functions belonging to itself only, i.e., Createltem, Deleteltem, Choose, and
Actor. The operation Choose selects an item from the popup menu and ﬁres the

corresponding event. Actor makes the state of the current system change to reﬂect

the ﬁred event.

8.5 Applying Reuse Processes to Speciﬁcation

Components of Xt / Motif Widgets

This section describes an example that applies the reuse processes (construction,
retrieval and modiﬁcation) to the speciﬁcation components of Motif widgets. More
speciﬁcally, this section provides the reader with a documented example of a moderate
size reusable component library and gives the reader some feel for the applicability
of the reuse processes for real-world problems. We have speciﬁed about 80 widgets

that have been implemented and described informally in the Motif Programming

133

 

component Popupmenu
inherits Popup
uses Menu from LtMenu

method CreateItem(p: Popupmenu. i: Item, e: Event)
requires i ¢ ItemSet(p)
modiﬁes p
ensures i E ItemSet(p) A associated(i,e)

method DeleteItem(p: Popupmenu. i: Item)
requires i E ItemSet(p)
modiﬁes p
ensures i ¢ ItemSet(p)

method Choose(p: Popupmenu, i: Item)
requires selected(p) A -: selected(i)
modiﬁes i
ensures (associated(i,e) => ﬁre(e) A selected(i))

A (i;£j => q selected(j))

method Actor(s: State, e: Event)
requires ﬁre(e)
modiﬁes s
ensures 8’ == act(s,e) A -. ﬁre(e)

Figure 8.19. The Larch Interface Speciﬁcation of Popupmenu.

 

 

134

Reference[70l

Figure 8.20 shows a set of unstructured widget components that are to be used
as an example. The system allows several operations on the components and their
methods. The user may add, delete, or view some component in the workspace.
Similarly, the user may also add, delete or edit some method of a component. The

constructed hierarchy can also be saved in the system library that contains a set of

reusable components.

8.5.1 Construction Process

Figure 8.21 gives a snapshot of the result of applying the subsumption test to the
above set of widget components, i.e., a low-level hierarchy is formed among the above
widget components. The parent-child relationship between two components reﬂects
their generality relationship that is described in Chapter 4. Then applying the clus-
tering test to the set of the most general components of the lower-level hierarchy
generates a uniﬁed two-tiered hierarchy of the widget components which is shown in
Figure 8.22, where the box represents a real speciﬁcation component of some wid-
get and the ellipse represents a meta component which is created by the clustering
algorithm. As for the construction of the two~tiered hierarchy of the above compo-
nents, it took about 10 minutes to construct the hierarchical structure of the widget
components. Actually, most of the computation time is due to the the calculation of
the subsumption relationship among components. The subsumption test algorithm
is applied to a set of “ﬂat” components so the time complexity is proportional to
the square of the number of the components. In Chapter 4, some algorithms are
provided to improve the efﬁciency of this process. However, in most cases, the time
complexity that it takes to construct the hierarchy is not critical since we are more
interested in the ability to search and retrieve a set of reusable components efficiently

and modifying them to satisfy the query speciﬁcations correctly.

135

 

 

 

.r 7.53.5;

 
 

 

 

 

 

 

 

 

 

 

 

 

_ 33.32 . . . :. .-Lrl. ..7 : : - : . ..
_ .- z- _7. a - mm . 75.2.: : w
_ __...........E_ _.._._..3... . . ...,: . . 3-; 3... : 35...... : 3 _

 

 

_ ..5..Lr..._.........._ L?...3..3: : :......3.....:3...2..3_

 

3:02:6230: _33e1:_3§s=sa.;

 

 

_ .......J_ 5.... : 3.5.3:55:3.__.._._.......3_ .3..

 

 

_ _7..........\: 3.3.. :...............=...._.........: : in; - .._

 

 

_ . 3:.. _ -2 :333..L_ _ﬁ __ _7....._....._

 

 

_mﬁ.__ .. :2 .. __ _7: :z .- L. _

 

 

 

73.53.;

 

 

mason-ash: __ :3ss__:L.-lb~_

]

 

 

 

 

 

.3292 nuanEoU sweaOm 32:39.2: A cucumm some: V Azzococof once—blokku Amctoumiu Buivhhiv ﬂame... :ozaEammsm v

  

 

a...” Louisa onan P

Figure 8.20. The unstructured widget components.

 

136

 

 

 

 

a!

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

   

:— rL
_ \ _ .
_ : _ t _
_Iﬁlnunnuugﬁt I9_~_OE:L:\K\ LL no a __ Z I _ Y1
_ . a __ __ _ Inch: nae-I3. Lael=_uo__:-n all: unit. uu-P‘c_.e..unu
:33020Luuu 301.. [1.2 I_&_u.:—L:: I— :a- a‘Imv—Ia :0 one»: / 4 .-
EOquIIVCVIIUI EOVUSI’BLLCI I—OI I- \_ . \ °_‘_OG°_U O I.- 1—9 .——lE.—65—:
_ala_-uI_IMc=Lu no.1. notwn so: ILA. 3903qu 31- o u: ..(a .34.. n u;l°n:.~¢..:lu
— :IIEEI-anéuolsozx l__\-o. v1: _ as... such: 53 _OLuu _ a I .n0>.~—.B_...=_
CE: 9!: L: I a. on; _ in... -CI: :0» .- =CJUA0.I_nu _
/.V J!
_ unI-aevuduan __ ...u. 0.3.9.6.. \# ..Iutltuzn :Iansaotx _3 uni—.40.. I sang: _ {Ia-ET:-
\ .v
£301....Lx _i. 5..-nu- : ..Icuunu : Lenin‘s-nu : sunk. : Eur-nth: _ :301_.:hvalhu _ ~ __l£u_u>ldnlba\_
]

 

 

 

 

 

Lucaom 32:320.:

llama—Uri 339500

 

 

cvcaom 3oz:

 

 

29.9.2: _u BoilozF

'aaaaaa§"aa@

a...” Lani-:9 on... 3-

mctmuuw; U _av_:v~ano_ I

 

EL

 

Figure 8.21. The lower-level hierarchy of widget components.

 

138

8.5.2 Retrieval Process

The time spent in the construction process is compensated by the time saved in the
retrieval process. Due to the hierarchical structure of the components, a user is able
to retrieve a set of components that exactly match the query speciﬁcation, or are
more general than the query specification. Figure 8.23 gives an example of retrieving
an exactly-matched component. Given the query speciﬁcation vinﬁcrolled‘l'ext
which is partially shown in the left side of the browser, the system returns the
exactly-matched component xScrolled‘I‘ext. Figure 8.24 gives another snapshot of
the result of retrieval process, where a group of more general components than the
query component winﬁcrolled‘rext is returned to the user. The speciﬁcation of
winjcrolled‘l‘ext is shown in Appendix C.3. The system takes only several seconds
to return the set of more general components (or exactly-matched component) to the

user and this example shows the applicability of the retrieval process in the framework

of a two-tiered hierarchy.

8.5.3 Modiﬁcation Process

As illustrated in Chapter 7, the matching process helps the user to recognize the
similarity (or difference) between two components’ methods and to modify an existing
component to ﬁt the query specification. Figure 8.25 gives an example of applying
the matching process to the method select of the query component win.l’opﬂenu
and the method push of an existing component xPullDownMenu. The Candidate
Analogies window displays the matchings found between the above two methods.
The result shows that these two methods almost have the same form except that the
term pulldovn in xPullDownMenu: :push should be replaced by the term popup in

win.l’opMenu: :select. The same process can be applied to other methods of these

two components to obtain similar results.

 

  

 

 

U-II-U-I- ul

..._ _ _ a _

     

:Ihue=e..uun

      

 

 

 

 

_ _ 753...

/\

O U I.) ‘I—nlhuu:

  

 

139

eev.le..0~

 

03: Iran

     

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

R —3~uu~uu.ome d33~>£n£uI=obnluomrl p3“
uZO..—..GZ°U IFmOL

 

 

 

 

 

 

 

ozw FMJ

 

2253:5020“...
.zoFEZOUwaa
O :- Iso- _
uxmkanoUmx 1J I|\\ His: iris .uxobuozcnumxupzn. "mg...
._.r 39:33... «956 _@L ..c. I AI _I.§sbuo._o..um.a Hw¢3b<29m Ea:
_ a _ D
25“.;
1 J 3ee:.3_.obuueo:.ﬁs. WU
:xw .
m concur—Lo v von So:
u U z. m P inhuozosumlﬁzF2u20a20u m
nucwcoQEOU 33:00 9.0:
uncoCOQEOU «.5535: uvau
3:05. :uaaom .do—SULGLEI 33:»; a 5
recs. .333 .32232: @L. musfoi 32> @

 

.r
\Ullnulir/

\iiaiﬂw/ \iﬂnll/I

 

 

 

 

 

 

30.2.2 Baa—too 5'953 32:89.2: sebum .32.:
is... i... 3.. '55» II- a- III ' as" 'I @w

.n 50959.3 Oman»-

 

 
 

 

 
 

Eucwcoi “.32.? 02F 053330 3323321

 

 

Figure 8.23. An example of retrieving an exact-matched component.

 

140

 

 

 

ﬂE

 

      

:0: II.— ...-

_r di'h UI=OLUHN —

— cu—

   

I_IG_U~U_ ..-

  

  

 

 

sci—.939.

  

 

 

I>:_ Eth-

~u|~IQUII.Ou

 

W J,

 
      

 

 

 

aw

 

E

$55.55;...

 

_7 A _

. 0

 

 

 

 

 

 

.
3351.62 Pagumx

 

62w

na—Byuwauwsmn “A—3v>a_nw.ul:o..unluoull p3.
"2°..P—DZOUIFMOB

2 Fiﬁ—.0330:
”20-h.DZOUIw~.—n

.125. 125.. .uxohuo:obmx33. EEC.

uxmkumpFDLUmx

 

J

 

..J

_NLOCOU OLD:

 

 

BL

 

 

J.

 

 

:xw

 

 

mucuconEoU Stvmnm 9.02

 

muCOCOQEOU _wgwcoU $.02

 

 

mvC mCOQEOU u... DLUUN z uUmxm

 

 

:wa. :uLuww tau—£055.32.

 

a]:

 

0!. ..ULlow Evin-L652: BL.

itallhl

 

 

/”

Al HIJXO.FU$:°LUWXH "mKDP<ZU-m
2.0wm

Bout—3:93; "DO-Cb:

 

Haﬁz-23.55153.PZwZOnEOU

‘hA—I

ECPI

3:131].

UULHEHE'

 

m

 

 

 

F

n-
aa

£5522 32>

 

.Mf

 

\Uﬁ/

\llilusj

\lliﬁ

 

 

 

 

 

 

 

A .3355“. 339:00 .

 
 

:3me 32:35.21

A Luiwom ..moc: .
9.03.0 uzum

 

3.95.5. I u 0.5:.

 

I02;

9.“ LUMBOLU Dun-OK

 

aa
5..

 

Figure 8.24. An example of retrieving a set 'of more general components.

 

141

The implementation of the existing component xPullDownMenu is shown in Ap-
pendix ES. The output of this implementation is displayed in Figure 8.26 that is
a window containing a pulldown menu. Based on the matching information, the
user can easily modify the implementation of xPullDovnHenu to support popup
menu type. The only thing we need to do is to specify a set of new parameters
that are required in a window containing popup menus. Motif deﬁnes two con-
venient functions XmCreatePopupMenu and XmCreatePulldownMenu to initiate the
popup menu and pulldown menu respectively. Therefore, we exploit the function
XmCreatePopupHenuO to obtain a popup menu. The main routine was modiﬁed
slightly to move the menu out of the menubar and into the DrawingArea widget as
a popup menu. The menu item declarations from the program of xPullDownMenu
may still be used. The ﬁnal implementation of the query component win.l’opﬂenu is

shown in Appendix E.6. The output of this program is given in Figure 8.27.

8.6 Summary

This chapter provides a case study of applying formal methods to a real world example
GUI programming, speciﬁcally in terms of X window’s widgets. The speciﬁcation
process formally describes the Motif widget’s components and helps the users to
understand more about the behavior of the window’s widgets. The construction
process built a two-tiered hierarchy of widget components and can assist the users
in the browsing, the retrieving, and the modifying of existing components. The
retrieval process has shown the ability to ﬁnd a set of existing components that are
more general or more speciﬁc than the query component. Finally, we gave an example
of modifying an existing widget component to satisfy the query speciﬁcation based ~
upon the matching information. This chapter demonstrates how formal methods can

be used in the speciﬁcation of the software components, construction of the software

142

 

 

 

 

 

 

 

 

 

 

 

 

->.:I...L-

    

:ualovacao:

  

an:o.3c.omo_:x no.1.auaﬂuuo-L

 

_ ulnEEaU: _
V
. . . In I x I... no n Utah:
_ 3:02.32...» n uni-.3. _: u u u . e..- h . .e

\‘1... l. . V

 

 

 

 

 

 

. ._ l 3.. Ill 5 .3... E
L‘I‘ i‘
Fo>opinou Au. pogvlmow unwra- lnrlfi- Eh
3 Q

l

 

 

 

 

 

 

 

  

 

 

 

 

 

 

 

 

:coa crow—3n Any new... ill
:coalcsouF—Jn Auv newslaanon
05235;; Avv ornmxmpnmzu .l. mu IL L L .I.l.. . IL L L
ma-no_ﬂc< Qua—33:60 @L. 23:69: 2 . it; €::€:~ . . . .1.
. \ "ZOFEZOUIhmOa
2::05InzaonvOEEn-znnzutu
"20.....GZOUwaa
n::UEn:aoax.::aElasaoau "wa>h
£31.55:qu 9.322.”. Guano—om E L L L L wdDP<ZU.m
.. L: nl . .. L3“ "
. 1.. h: i L L m:_um_xu vausom I“ 2. mm
0
woo—om "vegans. Pas-.0 towns—ow V nus—om coxhws.
u .y
:L L LI: 3 l L e. >L030 vanBm
_ ’1 3_.USQIADAI:_3"PZNZOQSOU
”u a‘ D
a a l a a a ..
mo_ao.¢:< Ou::EcU @ V F Lea—um 2030 BL. 11

 

 

 

 

 

 

 

 
 
 
 

H.822; cunnEOUU A condom _wu.:u._mco.zv Aconmom .59.... v ABUSE? us..o....l03.r v 9.7.33.0 32:953.... amp... :o_unE:mnsm

9n LOHBOLD 03.08 @L

Figure 8.25. An example of computing matching between two methods.

 

143

 

    
  
    
    
 

mako_pulldown

Linea

Linn Lblght h
Line Color D
Mm 331: D

 

rum

 

[alias
Plasma

gnu

 

 

 

 

 

 

   

Figure 8.26. The output of the pulldown menu implementation.

 

 

@ simple_ponun

 

Yello-

 

mm.
0m. Ctrl-C

 

 

 

 

 

 

Figure 8.27. The output of the popup menu implementation.

 

 

 

 

comp<

retrie‘

144

component hierarchy, retrieval of the reusable components and modiﬁcation of the

retrieved reusable components.

CHAPTER 9

Related Work

This chapter describes systems that have been built to perform software component
reuse, including description of the speciﬁc methods used by each system. The related
work discussed is categorized according to software reuse, uses of analogy, and speciﬁ-

cations of graphical user interface. While this survey is comprehensive, it is certainly

not exhaustive.

9.1 Related Work for Reuse

Caldiera and Basili

Caldiera and Basili [25, 22] propose a reuse framework that deﬁnes a project or-
ganization and an experience factory. The project organization performs activities
specific to the implementation of the system to which it is dedicated. The project
organization engineers may request a list of reusable components from the experience
factory. The experience factory’s process model is twofold: it satisﬁes requests for
components from the project organization, but it also prepares the pool of choices
for the requests. The work in this paper is similar to the latter task in that we also
prepare a component library, however, our library is hierarchically organized. We

also use a formal representation for the software components in order to facilitate an

145

146

automated approach for determining software reuse, whereas their approach requires
domain experts to select a set of reusable components. They produce software com-
ponents by developing a component production plan - the experience factory extracts
reusable components from existing systems or generalizes components previously pro-
duced on request from the project organization. They need domain analysts to study
each component in order to determine the service it can provide. Then components
are stored in the repository with all information that has been obtained about them.
However, we prefer our approach for reasons of cost and extensibility. First, many
domains cannot be easily described by domain analysts and domain analysis is very
difﬁcult and expensive. Our library can be automatically constructed without human
intervention. Our library can be built for any domain and can easily be modiﬁed.
GURU

The GURU project [14, 15] automatically assembles large components by using in-
formation retrieval techniques. The construction of the library consists of two steps.
First, attributes are automatically extracted from natural language documentation
by using an indexing scheme. Then a hierarchy is automatically generated using a
clustering technique similar to our hierarchical clustering algorithm. The indexing
scheme is based on the analysis of natural language documents obtained from man-
ual pages or comments. The assumption is that natural language documentation is
a rich source of conceptual information. However, natural language is not a rigor-
ous language to specify the behavior of software components. One concept may be
expressed in different ways or styles between two natural language documentations.
In contrast, a formal speciﬁcation language can serve as a contract, and a means of
communication among a client, a speciﬁer, and an implementer [19] with applicable
formal veriﬁcation techniques. Because of their mathematical basis, formal speci-
ﬁcations are more precise and more concise than natural language documentation.

Another presumption of their indexing scheme is that 98 percent of lexical relations

147

relate words that are separated by at most ﬁve words within a single sentence. There-
fore, they extract pairs of words by sliding a window over the target-text.

MAPS

Nishida and others [17] developed a method of semi-automatic speciﬁcation reﬁne-
ment and program generation using library modules. Users write their speciﬁcations,
modify and rearrange them so that. the speciﬁcation can be reﬁned with the aid of
the library modules. When a speciﬁcation is given, a reﬁnement system called MAPS
searches for library modules applicable to the given speciﬁcation, replaces the speciﬁ-
cation with a more detailed descriptionwritten in the operation part of the modules,
and convert the reﬁned speciﬁcation into a programming language designated by the
user. Case-like expressions or pseudo-natural language expressions are used for de-
scribing user’s speciﬁcation and speciﬁcations for library modules. MAPS exploits
the uniﬁcation capability to search through reusable modules in library. However,
their library is not hierarchically organized, thus the search space could become very
large once the number of software modules in the library increases.

Draco

The Draco project [87] focuses on the use of domain engineering of software reuse [26].
The goal of this project is to increase the productivity of software engineers in the
construction of similar systems by organizing reusable software components by do-
main analysis. Draco was among the ﬁrst systems to promote the reuse of products
from all phases of software life cycle. from analysis, design to implementation (com-
ponents). The most important aspect of Draco is the domain language that describes
objects and operations of a particular domain and hence represents analysis informa-
tion about that domain. The object and operation also describes the design informa-
tion. There is a reusable component associated with each domain language object or
operation. Since there is a potentially large number of components within a domain,

Draco uses a classiﬁcation scheme for the components called faceted classiﬁcation to

148

aid in organizing and retrieving the components [88, 89, 23].

Using faceted classiﬁcation, each component is described by a set of attributes.
The attributes are chosen to best characterize the components of a particular domain.
Each attribute is ﬁlled with a term from a well-deﬁned vocabulary. A thesaurus is
provided to determine the proper term to apply. A query is a tuple with selected terms
used as keys to search the database. In general, a query session begins with with the
most speciﬁc query, that is, all attributes are ﬁlled in. If the results of the query are
unsatisfactory, the user may generalize the query by inserting a wildcard character (*)
for attribute values. Draco maintains a measure of conceptual closeness for the term
lists of each attribute as a weighted, acyclic, directed graph. This way, an unsuccessful
search can be tried again using an alternative but similar term in one of the attributes.
The advantage of using faceted classiﬁcation is that it is conceptually simple for users
and relatively easy to implement. However, the disadvantage to faceted classiﬁcation
is that it is not suitable for unconstrained domains. Also, even with a conceptual
closeness measure, semantically similar components may be overlooked, especially
components from different domains.

The Reusable Software Library

The Reusable Software Library is a system designed to make software reuse an in-
tegral part of the software development process [90]. The components of RSL are
stored in the database with attributes that provides a basis for retrieval. RSL has
two methods to search for components: standard multi-attribute search and natural
language. The former method provides a menu-driven interface in which the user
selects the attributes with which to perform the search. Alternatively, the user may
express a query in the form of natural language. The system parses the input, ex-
tracts keywords from the query and uses those keywords as attributes to perform the
search. The natural language front end is apparently easy to use, however, it is also

expected to be inaccurate and slow because of the nature of the natural language

1:19

parser.
The Programmer’s Apprentice

The goal of the Programmer’s Apprentice is to exploit artiﬁcial intelligence techniques
in an effort to automate the programming process [91, 18, 92]. It is designed to pro-
vide intelligent assistance in all phases of a programming task. The designer thinks
of the Apprentice as an agent in the process rather than as a tool. A reusable com-
ponent in the Programmer’s Apprentice is called a cliche’ that represents a commonly
used combination of elements. A formalism called a Plan Calculus has been devel-
oped to represent a cliche’. The Program’s Apprentice automates cliche’ recognition
as a means to understand existing programs and facilitate program optimization. It
uses graph parsing to recognize cliche’ in program in order to recognize a program’s
design. A maintenance tool called Recognizer automatically ﬁnds all occurrences of
a given set of cliche’s in a program and builds a hierarchical description of the pro-
gram in terms of cliche’s found. Since plan is essentially a directed graph, the system
uses graph-parsing to identify sub-graphs that are then replaced with more abstract
operations. However, it is not efficient to search desired cliche’s via purely structural
exhaustive strategy graph parsing. Some heuristics are expected to be added in order
to enhance the system’s performance.

Steigerwald

Steigerwald [27] describes a tool used in Computer Aided Prototyping System
(CAPS), developed at the Naval Postgraduate School, which retrieves reusable com-
ponents from a software base using a formal speciﬁcation as a search key. The query
speciﬁcation that represents a design requirement is compared to formal speciﬁca-
tions of Ada reusable components stored in an object—oriented database management
system. A syntactic search compares speciﬁcation interfaces, identifying reusable
candidates based on types of parameters. The semantic search rank orders a set of

candidates based on semantic similarity to the query. The method, query by consis-

150

tency, compares terms that are reduced in the axioms of each speciﬁcation. Spec-
iﬁcations are normalized to facilitate the matching between query speciﬁcation and
reusable component speciﬁcations in the retrieval. A formal proof veriﬁes that query
consistency can retrieve components to meet speciﬁed requirements. The framework

of this tool is similar to ours, however, there is no classiﬁcation scheme exploited in

it to enhance retrieval.

Zaremski and Wing

Zaremski and Wing [93] propose signature matching as a method for achieving soft-
ware reuse by using signature information easily derived from a reusable component.
They consider two kinds of software components, functions and modules, and hence
two kinds of matching, function matching and module matching. The signature of
a function is simply its type; the signature of a module is a multiset of user-deﬁned
types and a multiset of function signatures. They consider both the exact match and

relaxed match. In their work, the pre- and postconditions are not used as a key to

ﬁnd a set of reusable components.

9.2 Related Work for Analogy

Several researchers have applied analogical reasoning to real-world problems e.g. au-
tomated theorem proving. education, artiﬁcial intelligence, and software engineering.

This section provides a brief overview of some analogical applications in the AI and

software engineering ﬁelds.

9.2.1 Analogy in AI

Much of the work done in Al not use analogies in problem solving, and therefore turns
out to be less interesting to our concerns in the application of analogy in solving reuse

problems. This section is presented for the purpose of giving a brief history of analogy

in the AI community.

Evans

One of the earliest computational accounts of analogy is due to Evans [94].. Evans
wrote a program that attempted to solve geometric problems of the kind that occur in
intelligence tests. These problems test the ability to perceive relations and analogies,
or rather the ability to perceive the same analogy as the person who deﬁned the
problem. The analogies are not used for any external purposes. Evans’ early work
on analogy is an important part of the development of analogy with AI. Evans was
the ﬁrst person to articulate analogies as symbolic correspondences between formal
representations. The notion of what an analogy exists implicitly in the work of both
Hesse [65] and Polya [95], but is not developed to the stage of being made explicitly.
Kling

Kling [96] developed an analogical reasoning system for use with an existing resolution
theorem proving system. It was the ﬁrst attempt to automate the use of analogies to
solve problems. That is, Kling was the ﬁrst to introduce the paradigm for reasoning by
analogy. Kling was mainly concerned with the analogies that exist between different
branches of abstract algebra, particularly group theory and ring theory.
Carbonnell

Carbonnell [97] has discussed how a problem solving system can be augmented with
analogy components. He proposes various operators that may be applied to a faulty
plan in a hope of transforming it into a valid target solution. The branching rate
at the solution transformation level is much greater than that at the original state
space level, and the solution level may lie at greater depth than the object level; thus,
without some account of the control regime at the transformation level, the use of the
analogy is likely to make the target problem harder to solve. Carbonnell proposed
a new model for reasoning by analogy within an automated problem solver [98],

where he coined the phrase derivational analogy. Instead of looking for analogies

between problem representations alone, analogies are sought between initial segments
of problem solving activity between the target and the base. That is, the target
problem is attacked by the problem solver (presumably with some general purpose
search technique); the entire trace of the problem solver’s search at any stage is
retained, including all failed branches and intermediate states; matching takes place
between this structure and the corresponding initial trace for candidate base problems;
once an adequate match is found, the later stages of the process proceed in a way
similar to Carbonnell’s earlier model.

Indurkhya

Indurkhya [99] describes a knowledge representation scheme, very similar to ﬁrst-order
logic, and deﬁnes a notion of analogy within it. While Indurkhya does consider the
problem of reasoning with analogies, he requires an analogy to be an isomorphism
between formulas, where predicates and objects may be associated with different
predicates and objects, but only in a strictly consistent manner. There can be no

permutation of arguments between associated terms and formulas, and there can be

.no unmatched arguments.

Gentner et al

Gentner and colleagues perform investigation with analogy from both a psychological
and computational standpoint [100, 101]. They have developed a model for analogical
reasoning called structural mapping theory. In the mapping stage, a symbolic match
is formed between the base and target descriptions. Such a mapping is then used
to make predictions about the target domain from known properties of the base
domain. Gentner’s work has focused on modeling famous scientiﬁc analogies such as
Rutherford’s solar system model of the atom and the ﬂuid ﬂow model of electricity,
and also analogies used in cognitive experiments. Gentner uses what is known as a
restrictive notion of analogy match; matches must be isomorphisms between parts

of the base and target descriptions, with relations being matched only with identical

153

relations. Gentner has also considered the problem of base ﬁletering: that is the
determination of a small set of plausible base problems / descriptions from a potentially
vast base of descriptions.

Kedar-Cabelli

Kedar-Cabelli [102] introduces purpose as an additional constraint on a structure-
mapping mechanism. Their motivation is that the similar purpose for two problems
often lead to similar solutions. With the constraint purpose, the analogical reasoning
system needs to generate causal structures of both base and target problems for the
mapping process. Purpose—directed analogy satisﬁes the constraint of mapping only
those causal relations that share similar purposes.

Owen

Owen [103] addresses the use of analogy to guide the search in theorem proving
systems. The goal has been to develop mechanisms for constructing and exploiting
simple analogies between mathematical problems. As with others work on genuine
problem solving analogies, a ﬂexible notion of an analogy match is necessary. Much
of his effort has been towards a more ﬂexible, powerful, and unstandable matching

algorithm than those of Munyer and Kling.

9.2.2 Analogy in Software Engineering

Dershowitz

Dershowitz [104] suggested the formulation of program of analogies as a basic tool
in program abstraction. An analogy is ﬁrst sought between the speciﬁcation of the
given programs; this process yields an abstract speciﬁcation that may be instantiated
to any of the given concrete speciﬁcations. The analogy is then used as a basis
for transforming the existing program into abstract schemas helping to verify and
complete the analogy. A given concrete speciﬁcation of a new problem may then be

compared with the abstract speciﬁcation of the schema to suggest an instantiation of

 

 

 

 

tr
dc
co:

8.111

154

the schema that yields a correct program.

Goldberg

Goldberg [105] explores how an implementation of a modiﬁed speciﬁcation can be
realized by replaying the transformational derivation of the original implementation
and modifying it as required by changes made to the speciﬁcation. They structure
derivations using the notion of tactics and record derivation histories as an execution
trace of the application of tactics. One key idea is that tactics are compositional:
higher level tactics are constructed from more rudimentary ones using deﬁned control
primitives. Given such a derivation history and a modiﬁed speciﬁcation, the corre-
spondence between program parts of the original and modiﬁed program is established.
Their approach uses a combination of name association, structural properties, and
associating components to one another by intensional descriptions of objects deﬁned
in the transformations themselves.

Lubars, et al

The ROSE-2 project [106] is based on the knowledge-based reﬁnement paradigm,
which is a software development process in which user-supplied requirements are
used to select and customize a high-level design. The paradigm is supported by a
knowledge base of high-level design abstractions called design schemas and reﬁnement
rules. The schemas and rules are used to customize the user’s designs to satisfy the
user’s requirements and design decisions.

Bhansali

Bhansali [107] describes the derivation of a concrete program from a semi-formal
speciﬁcation of a problem. He used a transformational approach based on a set of
transformational rules that produce a top-down decomposition of a problem statement
down to the level of target language primitives. The top-down decomposition process
combines ideas from AI research in planning to generate programs efﬁciently. The

author emphasizes the reuse of domain speciﬁc knowledge. APU is a system which

155

uses the proposed paradigm to synthesize UNIX programs (shell scripts) from semi-
formal speciﬁcations of programs. He proposes an analogy approach to automate
software derivation.- .A- knowledge base. contains formal and informal speciﬁcations
from past experience. The formal speciﬁcation is described in pre- and postconditions.
n order to describe a target problem, its informal speciﬁcation is provided in the form
of a systematic representation of information about the domain: objects, attributes,
relations between objects, and problem descriptions. The informal speciﬁcation is
analyzed and compared to extract analogies. An analogy algorithm is proposed to
detect the analogies from the knowledge base, which is then used to derive the formal
speciﬁcation for the target problem.

Maiden et al

Maiden and Sutcliffe [108] investigate the potential of speciﬁcation reuse by analogy
and its possible beneﬁts for requirements analysis. They have developed two non-
simple examples to examine the potential for speciﬁcation reuse by analogy. The
ﬁrst example illustrates an analogy between air-trafﬁc controller (ATC) and a ﬂexible
manufacturing system (FMS). The second example shows analogy between the ATC
and a classroom administration system (CAS), and the FMS and the CAS. They
propose a software engineering analogy model based upon three types of knowledge:
solution knowledge, domain knowledge, and goal knowledge.

Lung et al

Lung and Urban [109] have proposed an analogy model for software reuse. In addition
to the constraints proposed by by Maiden and Sutcliffe, more constraints are added
for software analogy analysis due to the complexity of the software system. They
have proposed an analogy-based domain analysis method that can support a high
level reuse across domains [110]. The purpose is to help users better understand
a domain and support potential future reuse in a different domain. Their method

generalizes that of Maiden and Sutcliffe.

156

Coen-Porisini et al.

Alberto C oen-Porisini, et al. [111] developed a technique and an environment-
supporting specialized software components. The technique—is .based on symbolic
execution. It allows one to transform a more general software component into a more
speciﬁc and more efﬁcient component. Specialization is motivated as a technique to
facilitate software reuse. They assume that a library of generalized components ex-
ists and the environment supports a designer in customizing a generalized component

when the need arises for it under more specialized conditions.

9.3 Related Work of Specifying GUI

There are several Larch Shared Language available, e.g. Larch Handbook and the
extended example of simple windowing system in [76]. Larch also has been used
to specify properties of objects in transaction-based distributed systems [77, 41].
Many formal speciﬁcations have been proposed to describe computer graphics sys-
tems [112, 113, 114, 115, 116, 117, 118, 119, 120] however none of them specify,
iimplementation-independent window systems. Mallgren [121, 122] introduces a lower-
level speciﬁcation for the computer graphics languages, e.g. geometry and color trans-
formations. but the speciﬁcation of higher-level GUI interface like window widgets is
not provided. The similar work closely related to [122, 121] is that by Guttag and
Homing [123]. They have designed a hierarchy of types to serve as the basis for dis-
play device interface and speciﬁed them algebraically. Sufrin [124] presents the formal
speciﬁcation of a simple, display-oriented text editor. It is an early effort to incor-
porate formal speciﬁcation in the design stage. Although their editor speciﬁcation is
tedious, the experience can be used for future attempts of specifying GUI in a formal
means. Narayana and Dharap [125] give a formal speciﬁcation of the look manager

of a dialog system. The look manager deals with the presentation of visual aspects

157

of objects and the editing of those visual objects. They provide a formal model for
specifying the look of the objects. Jacob [126, 127] describes the speciﬁcation of the
user interface module for the family of message systems and provides the surveys of

formal speciﬁcation techniques that can be applied to human-computer interfaces.

CHAPTER 10

Concluding Remarks

In this chapter, we summarize the contributions of this work and discuss potential

future investigations.

10.1 Summary

This dissertation contributes to the ﬁeld of software engineering and component reuse

along the following dimensions:

1. Construction of component hierarchy.
Incorporation of formal methods to software reuse.

Retrieving reusable components from the component hierarchy.

Modifying more general components.

seesaw

Modifying analogous components.

Each of these contributions are discussed in further detail.

10.1.1 Construction of Component Hierarchy

We have proposed classiﬁcation schemes and algorithms for automatically construct-

ing a hierarchy of software components that provide a means for representing, storing,

158

159

browsing, retrieving and modifying reusable components. The hierarchical relation-
ships of the software components based on a generality relationship and similarities
between software components. The similarities are calculated with respect to a par-
tition of operators into equivalence classes. The resulting library structure consists
of lower-level and higher-level hierarchies. The lower-level hierarchy is created by a
subsumption test algorithm that determines whether one component is more general
than another. Based on the generality relationship, the most general components are
placed at the top of the hierarchy and the more detailed or restrictive components at
the bottom. All the generated components undergo another grouping scheme to yield
the higher-level hierarchy. The higher-level hierarchy is generated by a classical hier-
archical clustering algorithm that groups the most syntactically similar components
together. The end result is a connected hierarchy of software components organized

from the most general to the most speciﬁc.

10.1.2 Incorporation of Formal Methods to Software Reuse

Using formal speciﬁcations to represent software components facilitates determina-
tion of reusability because the formal representation more precisely characterizes the
functionality of the software. In addition, the well-deﬁned syntax of formal speciﬁca-
tion languages makes processing amenable to automation. In Chapter 8, a case study
involving the speciﬁcation of X window widgets is described to demonstrate how to

perform reuse for solving a real-world problem.

10.1.3 Retrieving Reusable Components from the Hierar-
chy

Based on the two-tiered hierarchy of reusable components and formal description of

each component, we can easily locate the components that have the generality rela-

160

tionship with a user’s query that is also represented by a formal speciﬁcation. This
approach is in contrasts to another another popular method used today, classiﬁcation
schemes. The classiﬁcation scheme approach attempts to store and retrieve compo-
nents based on attributes whose values are selected from a ﬁnite set of keywords.
Retrieving components, hence, needs some organizational knowledge of the software
structure and the knowledge of the keyword set. However, query by formal speciﬁca-

tion requires no knowledge of the software component hierarchy from the user.

10.1.4 Modifying More General Components

From the framework of a two-tiered hierarchy of reusable software components, the
candidate components that are more general than the query speciﬁcation will be re-
trieved from the hierarchy. A more general retrieved component is compared to the
query speciﬁcation to determine what changes need to be applied to the corresponding
program component in order to make it satisfy the query speciﬁcation. The weakest
preconditions of the retrieved speciﬁcations with respect to the corresponding pro-

. gram component are obtained and returned to the users to provide information to

guide the program modiﬁcation.

10.1.5 Modifying Analogous Components

If there are no components in the hierarchy exhibiting the generality relationships
with the query speciﬁcation, then the candidate components that are analogous to
the query speciﬁcation are retrieved from the hierarchy. A set of analogical matches
between an analogous retrieved component and query speciﬁcation are computed and
returned to the user. Based on the information, the user can determine what part of
the program needs to be modiﬁed to satisfy the query speciﬁcation. This dissertation

has described a new approach to modifying the analogous components, the relevant

161

issues about this approach, and several examples to demonstrate the usefulness of

this approach.

10.2 Future Work

Based upon the framework that has been developed thus far, the approach of construc-
tion and retrieval processes will be extended in several aspects. First, new techniques
are being developed that may be more efficient than the currently used approach for
determining functional similarity between two software components. The abstraction
scheme for forming meta nodes of software components will also be further investi-
gated. The formal representation for the inheritance relationship and the genericity
of software components will be further investigated in order to better exploit the
properties of object-oriented development.

Regarding program modiﬁcation, several issues need to be addressed in the fu-
ture. As mentioned in Chapter 7, program modiﬁcation is a combination of ana-
logical reasoning, veriﬁcation, transformation, and synthesis. These features should
be supported by an integrated system that contains programming tools, veriﬁcation
assistant, and a domain-speciﬁc knowledge base. Some guidelines for programming
by modiﬁcation need to be provided to the programmers in order to support the
development of correct programs. Some rules such as wp semantics are required to
assist the programmers verify that the “modiﬁed” program based on the results of
the matching process. Another set of rules may be needed to assist the programmers
update the unexecutable statements, i.e., change the non-primitive operators into
primitive operators.

As for the matching process, we recognize some problems should be solved be-
fore the potential beneﬁts of the matching process can be fully exploited. Support-

ing the understanding of existing programs is an important factor to the successful

 

162

speciﬁcation-level reuse. An ARMP needs didactic support for comprehension of
candidate speciﬁcations, which requires an explanation facility to help the software
developer understand the query domain and the candidate speciﬁcations. Since an
automated ARMP is unlikely to achieve a perfect match, explanation will also be
necessary for evaluating the appropriate query speciﬁcations.

Regarding the relationship of the matching process and the two—tiered hierarchy

used in the construction and retrieval processes, we have the following problems need

to be solved:

0 Apply the matching process to the abstraction of two real nodes into a meta
node. Currently, the meta node is just an aggregation of its child nodes. If the
abstraction is possible, then we can develop a middle layer in the hierarchy and

the nodes of this layer represent the program schemata of their child nodes.

0 Select a set of methods that are similar to the query speciﬁcation and supply
them to the matching process as candidate speciﬁcations. We now use the

notion of similarity as a guidance, but we wish to have a selection scheme that

is beyond the syntax-based approach.

Regarding the reuse system, we are investigating the integration of the reuse
framework into a software development environment comprising tools for formal spec-
iﬁcation editing [29, 128], program visualization from formal speciﬁcations [129], and

a tool that abstracts formal speciﬁcations from program code [130, 131].

APPENDICES

APPENDIX A

Analogical Reasoning

In this appendix, we describe both normative and cognitive accounts of analogical
reasoning [103]. By normative accounts, we mean those that attempt to provide an
analytical justiﬁcation for reasoning by analogy. In contrast, cognitive accounts refer
to those that attempt to model human analogical reasoning and hence gain insight
into how people are able to beneﬁt from analogy.

Philosophers since Aristotle have tried to understand analogy by considering what
reason we have for believing an analogy is useful. Hesse’s theory of analogy [65]
addresses the use of analogical models to make predictions in science. When scientists
are investigating some new and unfamiliar system (the target system), with only a
few properties of the system known to them, an analogy with a familiar system (the
base system) may be applied in order to predict new properties of the unfamiliar
system. The most famous examples are Newton’s particle theory and Rutherford’s
solar system model of the atom. Typically, the base problem is well understood and
there is a successful theory with the observed properties. The model is used to predict
the target system because the corresponding properties are known to be true of the
base.

Recently, cognitive psychologists have attempted to model human analogical rea-

soning [100]. Early models tended to concentrate on isolated analogy like the IQ test.

163

164

More recent models explain analogy as part of a wider human reasoning. For exam-
ple, Gentner’s structure mapping theory (SMT) has presented a complete analogical
reasoning model from the computational point of view. As illustrated in Figure A.l,
the analogy retriever looks for potential analogies for the current state of working
memory and past experiences stored in long term memory. Potential analogies are
then passed to the analogy engine. The engine performs full matching on the potential

analogies and makes candidate inferences about the target situation and structural

evaluation of the strength of the analogy.

 

 

 

 

 

 

 

 

.------------------’

 

 

 

Figure A.1. Gentner’s Model of Analogy

 

Gentner’s model construes analogy matching and analogical inference as being
isolated syntactic operations, i.e., it does not allow semantic or pragmatic factors to
directly inﬂuence the analogy process. The PI model of Hoyoak and Thagard [132]

takes a different approach to analogy. PI is a spreading activation model in which

165

currently active concepts, e.g. those involved in the current goals, spread activation
to other concepts with which they are associated. As shown in Figure A.2, analogy
is tightly coupled with normal problem solving. The loop in the ﬁgure represents the
principal cycle of rule retrieval and application in P1. If the current goal stored in
working memory has not been solved yet, then production rules that are relevant to
the problem may be retrieved and matched. The use of analogy suggests plausible

rules to ﬁre for the current state, thus distinguishing PI from SMT [100].

 

 

 

 

 

 

 

 

 

 

 

Figure A.2. PI Model

 

As described in Section 7.1, the goal of ARMP is to enable a program synthesizing
system to reuse existing software components similar to a query speciﬁcation. Thus

ARMP may be considered to be a type of machine learning. A popular machine learn-

 

 

 

ing system
solving be
tion. fat!
that sub'
metheI
systenn
new pr<
knowle
domah

experil

166

ing system is the generalization system. That is, given a set of examples of problem
solving behavior, it forms a generalization (schema) of both the problem and the solu-
tion. Faced with a new problem, the generalization system looks for a generalization
that subsumes the new problem. When one is found, the system tries to instanti-
ate the generalization solution to solve the given problem. In contrast, the analogy
system makes no generalization, but, instead, attempts to relate old problems with
new problems by matching. The generalization systems rely on sophisticated domain
knowledge. When such knowledge is lacking or insufﬁcient, for example when a new
domain is to be explored, the analogy system could provide a means for reusing past

experience. The relationships of analogy and generalization are shown in Figure A.3.

 

  
  
  

generalization strict matching

Old Problem New Problem

Figure A.3. Analogy and Generalization

 
 

analogy
ﬂexible matching

  
 

D

   

 

APPENDIX B

Bottom-Up Matching Approach

The algorithm presented in Section 7.5 is a top-down matching algorithm that em-
phasizes the higher-level predicate or function symbols. However, for the matching
problem, bottom-up matching is an alternative approach. It is believed that speciﬁca-
tions can be represented as trees and the symbolic computation, analogical matching
in our case, can be described as tree replacement. In the papers [133, 134, 135], the
authors’ approach to automatic proving of equational theorems is to treat a set of
equational axioms as replacements rule and transform one side of the equation to be
proved into the other by a sequence of tree replacements. Knuth and Bendix [55]
discuss some of the cases in which tree replacements yield efﬁcient theorem provers.
We present formally the tree-matching problem, but the detailed bottom-up tree-
patterning algorithms will not be discussed in this report. Suppose we are given a
ﬁnite ranked alphabet E of function symbols, including constants as nullary functions.

3 denotes the set of E-terms, formally deﬁned as follows.

Deﬁnition 3.1 2 terms:

(I) For all I) in E of rank 0, b is a E-term.

(2) If a is a symbol of rank q in 2, then a(t1, ..., tq) is a E-term provided each of the
t,- is.

(3) Nothing else is a Z-term.

167

168

A nullary symbol :1 is used as a placeholder for a E-tree. We deﬁne the set E U {V}

just as E-terms, denoted by 5”.

Deﬁnition 3.2 A tree pattern is any term in S”. If b(t1, ..., bq) is a term, then deﬁne

child;(t1,...,tq) to be t,- for l S i S q.

Deﬁnition B.3 A pattern p in 5,, with k occurrences of the symbol V matches a
subject tree t in S at node n if there exist 5.3-trees t1, ...,tk in S such that the E-tree
p’, obtained from p by substituting t,- for the ith occurrence of V in p, is equal to the

subtree of t rooted at n.

Deﬁnition BA The Tree Matching Problem.
A tree matching problem consists of a ﬁnite set of patterns p1, ..., pk in 3,, and a subject

tree in S. A solution to a matching problem is a list of all pairs (n,i), when n is a

node in t and p,- matches at n.

All suggested matched methods for tree matching should be compared to the naive
algorithm (based on the simple form of uniﬁcation), which merely tries every pattern
at every position in the subject tree. The main objectives of the bottom-up matching
algorithm is to ﬁnd, at each point in the subject tree, all patterns and all parts of
patterns that match at this point. Let n be a node in the subject tree labeled with
the q-ary symbol b, and suppose we wish to compute the set M of all those pattern
subtrees other than 11. Suppose we have already computed such sets for each of the
children of n, and call these sets, from left to right, M1, ..., M,. Then M contains 1!
plus exactly those pattern subtrees b(t1, ...,t,) such that t,- is in M5, for 1 S i _<_ q.
Therefore, we could compute M by forming trees b(t1, ..., tq) for all combinations of
(t1, ..., t,), where t.- are chosen from Mg, and then asking whether each candidate for
membership in M is a subpattern. Once we have assigned these sets to each node in

the subject tree, then the tree matching problem is solved.

APPENDIX C

The Speciﬁcations for Motif
Widgets

0.1 The Functionalities of Motif Widgets

This appendix contains the functionalities of Motif window widgets and formally
describes a set of Motif window widgets, which include their LSL traits and several
examples of the LIL speciﬁcations for application widgets.

Display Widgets

Motif provides many widgets whose primary purpose is to display information or

interact with the user. These widgets include:

XmArrowButton XmDrawButton
XmList XmPushButton
XmSeparator XmText
XmScale XmCreateButton
XmLabel XmToggleButton
XmScrollBar XmTextField

Container Widgets

Motif provides many widgets that can be used to combine other widgets into compos-

ite panels. Composite widgets allow endless combinations of buttons, scrollbars, text

169

170

panes, and so on, to be grouped together in an application. These widgets include:

XmDrawingArea XmFrame
XmRowColumn XmForm
XmPanedWindow XmBulletinBoard
XmMainWindow XmScrollWindow

Menu Widgets

Motif provides many widgets that allow applications to create popup, pulldown and
option menu. A menu pane consists of a popup Shell widget that manages an Xm-
RowColumn widget containing buttons, labels, and other types of widgets. The
buttons are selectable entries in the menu. By combining different types of widgets,
the programmer can create different types of menus: pulldown, popups, cascading
pulldowns, cascading popups, option menus, and so on. The following widgets can

be used to construct menus:

XmRowColumn XmToggleButton
XmPushButton XmCascadeButton
XmLabel XmSeparator

Dialog Widgets

The Motif widget set builds on the underlying popup facilities provided by the Xt
Intrinsics to provide a versatile set of dialog widgets. Motif uses a subclass of the
Shell widget class, the XmDialogShell widget class directly because Motif provides
convenience functions that create different types of dialogs and popups. Most Motif
widgets know they are the children of the XmDislogShell widget class and automat-
ically popup up and down the parent shell when managed and unmanaged. The

following are the Motif widgets classes designed to be used as dialog widgets.

XmBulletineBoard XmFileSelectionBox
XmCommand XmForm
XmMessageBox XmSelectionBox

By setting appropriate resources, many types of dialogs can be created from these

171

basic widget classes. Motif widgets provide a large set of convenience routines to make

dialogs easier to create and use. These convenience functions create the following

types and dialogs:

BulletineBoardDialog ErrorDialog
InformationDialog MessageDialog
SelectionDialog WarningDIalog
FormDIalog QuestionDialog
F ileSelectionDialog WorkingDialog
PromptDialog

These dialogs are not really new widget classes, but are combinations of an XmDi-
alogShell widget and one of the manager widgets described earlier in this section.
Gadgets
In addition to widgets, Motif provides user interface components known as gadgets.
Gadgets are nearly the same as widgets, except that they have no windows of their
own. A gadget must display text or graphics in the window provided by its parent,
and also must rely on its parent for input. The Motif gadget classes include:
XmArrowButtonGadget XmLabelGadget

XmSeparatorGadget XmToggleButtonGadget
XmPushButtonWidget XmCascadeButtonGadget

Nearly all widgets can be customed by the user or the programmer. Widgets are
reuse and customized by specifying values for various resources by the widget. Based

on the set of Motif widgets, the programmer can create any reusable customized GUI

components with desired behavior.

C.2 The LSL Traits of Motif Widgets

This appendix describes the Larch traits of the widgets and gadgets available in Motif
toolkits. Since trait only describes state—independent properties of target objects, the

state dependent speciﬁcations are described in next section. Because of limited space,

not all developed Motif widgets will be formally described here. The preﬁx of the

Larch traits is “Lt” which stands for “Larch Trait”.

1. Basic Window

LtBasic Window: trait
assumes LtCoordinate
includes LtRegion, LtView, Displayable(Window)
asserts Window tuple of pos: Coord,
size: Size,
cont, clip: Region,
front, back: View,
id: WId
forall w: Window, ed: Coord
cd 6 w == ed 6 w.clip
w[cd] == if cd 6 w.cont w. front else w.baclc
implies converts -[.], 6: Coord, Window -o Bool

2. Color

LtColor( C): trait
includes Set, LtPixelMap
introduces
colon: -v C
colorg: -t C

color": -> C

equal: C, C -v 3001
- -: C, C ~ C
-e-: C, C —’ C

3. Compound String

LtCompoundString( CS ): trait
includes LtString( CS, CSSet)
introduces
conver.c-string: String —. CS
deconvert-c-string: CS —o String

4. Coordinate

173

LtCoonl: trait
introduces
origin —» Coord
---: Coord, Coord -+ Coord
asserts forall ed: Coord
Cd - Cd == origin

5. Displayable

LtDisplayable( T): trait
assumes LtCoond
includes LtRegion( T)

introduces
-[_]: T, Coord -> Color

6. label

LtLabel(L): trait
includes LtCompoundString(CompString)
introduces
set-label: L, CompString —. L
getJabel: L -> CompString
deleteJabel: L, CompString —+ L

7. Line

LtLine(Line): trait
includes Set(Line for E, LineSet for C), LtCoord(Coord)
introduces
create Coord, Coord —> Line
delete Line ->
meet Line, Line --> Coord
parallel Line, Line —* Bool

8. Manager W'indow

LtManager( M): trait
includes Lt View
assumes LtPrimitive
M tuple of pos: Coord,

174

size: Size,

introduces

create: —. M

realize: —v M

size: M —. Size % observers

pos: M -v Coord

id: M -+ Integer

child: M, Real -+ Widget

child.size: M -+ Size

child.pos: M —’ Coord

realize-child: Widget -> View

derealize-child: Widget -+ View
asserts

M generated by create

M partitioned by id

9. Pixel Map

LtPixelMap: trait
includes LtRegion(R)
asserts
Pixel tuple of int: Intensity,
pos: Coord
PixelMap enumeration of Pixel

10. Primitive Window

LtPrimitive: trait
includes LtBasicWindow, LtView, Set( Widget, WidgetSet)
Widget tuple of pas: Coord
size: Size
id: Real
introduces
create: -v Widget
destroy: —+ Widget
open: Widget -> View
quit: Widget —’ View
selected: Widget -+ Bool
move: Widget, Coord —-> Widget
size: Widget —. Size % observers
set-size: Widget, Size —* Widget
pos: Widget —’ Coord
set.pos: Widget, Coord -» Widget
overlap: Widget, Widget -’ Bool
id: Widget -v Integer
parent: Widget —> Widget

175

child: Widget, Real _. WidgetSet
is_child: Widget. Widget —' Bool

asserts
V w: Widget

move(w).pos == coord.map(w.pos)
move(w).size == w.size
move(w).id == w.id
resize(w).id == coord.map( w. pas )
resize (w). id == length-map( w. size)
resize(w).id == w.id

w generated by create
w partitioned by id

11. Region

LtRegion( R): trait

assumes LtCoord

introduces
-€-: Coord, R -—> Bool
nullregion: -» R
point: Coord -+ R
lineseg: Coord, Coord -+ R
intersec: R, R — R
union: R, R —- R
diﬂ'erence: R, R _. R
contains: R, R -~ Bool
equal: R, R -> Bool
empty: R -> Bool

12. Scrolled Text Window

LtScrolled Text( S c Txt). trait

includes LtScrolled, LtText

introduces
all.visible: Scht - Bool
visible: Scht, View —. Bool
visible-content: ScTrt —+ View

asserts
all.visible(visible.mntent(ScTxt)) == true
visible(Scht, visible-content(Scht)) == true

13. Scrolled Text Editor with Menu

LtScrolledTextMenu(Scmlled TextMenu): trait

176

includes LtScrolledText, LtMenu

introduces
with-popupmenu: ScrolledTextMenu -> 8001

with-menubar: ScrolledTextMenu -+ Bool
label: ScrolledTextMenu —> String

asserts
label(set.attr(ScrolledTextMenu, Label, Value)) == Value
with.popupmenu(set_attr(ScrolledTextMenu, Popup, ..)) == true
with-menubar(set.attr(ScrolledTextMenu, MenuBar, ..)) == true

14. Scrolled Window

LtScrolled: trait
includes Lt View, LtManager, LtPrimitive
Scrolled tuple of type: {vertical,horizontal},
policy: { automatic, app-deﬁned} ,
introduces
create: -+ Scrolled
destroy: Scrolled —-¢
scmll.up: Scrolled -» Scrolled
scroll.down: Scrolled -v Scrolled
scrollJeft: Scrolled -o Scrolled
scrolLright: Scrolled —. Scrolled

asserts
scroll.up(create).cont == Empty
scroll.down(create).cont == Empty

scrollJeft(create).cont === Empty

scmll..right(create).cont == Empty
scroll..up(scmll.down).cont == scroll.down(scroll.up).cont
scmllJeft(scmll.right).cont == scroll.right(scmll.left).cont

15. SetResource

LtSetResource(Attr, Val): trait
includes Set(Attr for E, AttrSet for C), LtObject(0b for Obj)
introduces
valid.attr: 0b, Attr -> Boot
valid.value: Attr, Val -9 Bool
get-value: 0b, Attr —-> Val
set-attr: 0b, Attr, Val -+ 0b

asserts
get-value( set-attr (0b, Attr, Val), Attr’ ==
if equal(Attr,Attr’)
then Val

else get-value(0b, Attr’)

16. Size

177

LtSize( S) trait

includes Real
introduces

create.size Real, Real --» S
null.size —-> S
equal.size S, S -* Boot

17. Text Window (Text Editor)

LtText( Text): trait

includes LtTextField, Lt View, LtLine (LN for Line_Number)

LtCharacter(CN for Character.Number), LtPrimitive

introduces

cursor: LN, String, String —> Text
create: —> Text

insert: Text, Stream -v Text
delete: Text, Stream -+ Text
moveJeft: Text -> Text
move-right: Text -> Text
move-up: Text -> Text
move-down: Text -+ Text
ﬁrst: Text —. LN

last: Text —+ LN

line: —v LN

contents: Text, LN —, Stream
clear: Text —>

asserts forall c: Char; cn:CN; 3,31,52: Stream; n, n1, n2: LN; t: Text

ﬁrst(create) == Empty
last(create) == Empty
ﬁrst(cursor(sl,s2)) == cursor(Empty,concat(s1,s2))
last(cursor(sl,32)) == cursor(concat(sl,s2),Empty)
insert(c,cursor(sl,82)) == cursor(append(sl,c),32)
delete(cursor(sI,Empty)) == cursor(sI,Empty)
delete(cursor(s1,append(c,32))) == cursor(sl,32)
move-leﬂ(cursor(Empty,s)) == cursor(Empty,s)
moveJeft(cursor(append(sl,c),s2)) == cursor(sl,append(c,32))
move-right(cursor(s, Empty) ) == cursor(s, Empty)
move-right(sl,cursor(append(c,s2))) == cursor(append(s1,c),s2)
line (move-up((cursor(sl ,s2)) ==

if equal(line(cursor(sl,s2)), I)

then 1 else line(cursor(s1,32)) - I
line(move-down((cursor(s1,52))) ==

if equal(line(cursor(s],s.2)),line.max)

178

then line.max else line(cursor(sl,s2)) + I
clear(t) == cursor(Empty,Empty)

18. Text Field (Line Editor)

Lt TextField(TF): trait
includes Lt View, String, LtCharacter, LtPrimitive

introduces

cursor: String, String —+ TF
create: -» TF

insert: TF - TF

delete: TF —~ TF

moveJeft: TF _. TF
move-right: TF —+ TF
ﬁrst: TF -¢ Char

last: TF —» Char

clear: —. TF

asserts forall c: Char; 3,81,52: String; tf: TF

19. View

insert(c,cursor(sl,s2)) == cursor(append(sl,c),s2)
delete(cursor(sI,Empty)) == cursor(sI,Empty)
delete(cursor(sl,append(c,s2))) == cursor(sl,s2)
move-left(cursor(Empty,3)) == cursor(Empty,s)
move-left(cursor(append(s1,c),32) == cursor(sl,append(c,s.2))
move_right(cursor(s,Empty)) == cursor(s,Empty)
move-right(sl,cursor(append(c,s2))) == cursor(append(sl,c),s2)
ﬁrst(cursor(sl,s2)) == cursor(Empty,concat(s],82))
last(cursor(s1,32)) == cursor(concat(sl,82),Empty)

clear(tf) == cursor(Empty, Empty)

LtView( View): trait
includes LtCoord, LtSize, Lt Widget, PixelMap
introduces

create: —’ View

quit: View -+

size: View -> Size

pos: View —» Coord
parent: View—i Widget
empty: View -s Bool
intensity: View -i Real
coord.map: Coord -> Coord
length-map: Size —-> Size
display: View -» PixelMap

179

'20. Well Formed Widgets

LtWF Widget( WF Widget): trait
includes LtPrimitive, Set( Widget, WidgetSet), LtSetResource, Object
introduces
ancestor: Widget, Widget —. Bool
descendant: Widget, Widget —+ Boot
is-well_formed: Widget -* Bool
managed.by: Widget, Widget -» Boot
extract.wf: ObjSet —> WidgetSet
delete-wf.widget: Widget, Widget—o Widget
delete-objs: Widget, ObjSet —» Widget
delete-children: Widget —’ Widget
asserts for all w, w’: WFWidget
is-well.formed(w) ==
(managed-by(w, parent(w)) A
(for all w’::descendant(w’,w) => -1 ancestor(w’, w)))
parent(w,w’) == child(w’,w)
ancestor( w, w ’) == descendant( w ’, w)
for all w’ E extract-wf( w) => is-well.formed( w’)

C.3 The LIL Speciﬁcations of Widgets

This appendix describes the Larch interface speciﬁcations of the widgets and gadgets

available in Motif toolkits. They are high-level speciﬁcations written using the Larch

Generic Interface Language (GIL). Mice or other indicators are not explicitly men-

tioned. Note that a ’ (‘prime’) is the new-value operator. That is, is x is a variable,

then x’ represents its value after the application of a transition. An unprimed x rep-

resents the old-value of x. We restrict operations to be pure functions, that is, each

operation returns a value that depends only on the parameters of the call, and there

are no side effects, either on the parameters or any other part of the system state.

1. Primitive

component Primitive Widget
based on Widget from

180

LtPrimitive Widget

method Select( w: Widget)
requires (w E DisplayableSet)
modiﬁes (w.view)
ensures selected(w’) == true

method Unselect(w: Widget)
requires true
modiﬁes (w.view)
ensures selected(w’) == false

method Move(w: Widget, delta: Coord)
requires true
modiﬁes w

ensures selected(w) == true =>
(pos(w’) == pos(w) + delta) A selected(w’) == false

method Resize( w: Widget, pos: Coord, size: Size)
requires selected(w)
modiﬁes w
ensures w’ == set.size(set.pos(w,pos),size) A selected(w’) == false

method ChangeAttr(w: Widget, attr: Attr, val: Val)
requires valid.attr(w, attr) A valid.value(attr,val)
modiﬁes w.sttr
ensures w’ = set.attr(w,attr,val)

method Open(w: Widget)
requires selected(w) A displayable(w) == false
modiﬁes w.view
ensures displayable(w’) == true

method Close (to: Widget)
requires selected(w) A displayable(w) == true
modiﬁes w. view
ensures displayable(w’) == false

method Delete( w: Widget)
requires selected(w)
modiﬁes w
ensures selected(w’) == undeﬁned

2. Popup

181

component Popup
inherits Primitive

method Push(p: Popup)
requires - displayable(p)
modiﬁes p.view
ensures displayable(p’) A selected( p) A top_level(p)

method Release(p: PopupWidget)
requires displayable(p)
modiﬁes p.view
ensures w displayable(p’) A -w selected(p)

. Popupmenu

component Popupmenu
inherits Popup
uses Menu from LtMenu

method CreateItem(p: Popupmenu, i: Item, e: Event)
requires i ¢ ItemSet(p)
modiﬁes p
ensures i E ItemSet(p) A associated(i,e)

method DeleteItem(p: Popupmenu, i: Item)
requires i 6 ItemSet(p)
modiﬁes p
ensures i ¢ ItemSet(p)

method Choose(p: Popupmenu, i: Item)
requires selected(p) A -: selected(i)
modiﬁes i
ensures (associated(i,e) => ﬁre(e) A selected(i))
A (#j => -1 selected(j))

method Actor(s: State, e: Event)
requires ﬁre(e)

modiﬁes s
ensures 8’ == act(s,e) A -1 ﬁre(e)

. List

component List based on List from LtList

method Choose(w: List, i: Item, e: Event)

182

requires «selected(i) A selected(p)
ensures associated(i,e) => ﬁre(e)

method CreateItem(w: List, i: Item, e: Event)
requires i é p
modiﬁes it:
ensures i 6 w A associated(i,e)

method DeleteItem(w: List, i: Item)
requires i e w
modiﬁes w
ensures i ¢ to

method ChangeAttr(w: List, attr: xxx, val: xxx)
requires valid.attr(w,attr) A valid.value(attr, val)
modiﬁes w
ensures w’ == set.attr(w,attr, val)

method Actor(w: List, e:Event, s:State)
requires ﬁre(e)
modiﬁes s
ensures 8’ == act(s,e) A wﬁm(e)

APPENDIX D

Algorithm to ﬁnd LGCs

As described in Chapter 5, given a query speciﬁcation, the process of searching a set
of candidate components is performed in the lower-level hierarchy that is basically
a partial ordering set The components called real nodes in the lower-level hierarchy
obey the subsumption relationship (2”,) that is a partial ordering. Since reusing a
more speciﬁc component by abstraction is believed to be diﬂicult, we are interested in
reusing a more general component. In order to minimize the effort needed to modify a
reusable component, a least general component (LGC) is only sought from the lower-
level hierarchy. The deﬁnition of LGC will be given shortly. In this appendix, we
attempt to explore the complexity of ﬁnding a LGC from a partial ordering set, which
can be regarded as the upper bound of our retrieval algorithm. Some basic deﬁnitions

are presented that are necessary for the following proof.

Deﬁnition D.1 Least General Component (LGC). Assume L is a set with partial
ordering 2”,. Given a component x ¢ L, if there exists a component y E L such
that (V y’ 6 L \ {y}: (y’ 2%, x) => -v(y 2”,”, y’)) then y is called a Least General
Component (LGC) for x in the set L.

Our LGC problem is speciﬁed as follows:

183

C01

118

1’8'

184

IN STAN CE: A partial ordering set L and a given component x.
QUESTION: Find a set of LGCs for x in L.

The LGC problem is just a restricted version of our problem of retrieving a set of
more general components from a lower-level hierarchy given some query speciﬁcation
component. In practice, we do not really need to ﬁnd a set of “least” general compo
nents. However, studying this LGC problem helps us to understand how diﬁcult the
retrieval problem would be if the search space is not limited explicitly.

The lower-level hierarchy can be a directed graph G = (V, E), where each com- .
ponent is represented by a node in V and every edge (u, v) 6 E represents the sub-

sumption relationship v 2“,", u. The algorithm of ﬁnding a set of LGC is ducribed

in Figure D.l.

 

Algorithm 10 LGC
Input: G = (V,E) with partial ordering Q and query component x.
Output: A set of LGCs for x in G.

I. Coloring
Given a query component x, we color every node v e V by comparing v and

x. If v 2 x then v is colored as a black node, otherwise it is colored as a white
node.

2. Connecting
For every pair of nodes (u,v), if u 2" v then add an arc (v,u) to the set E,

where 2" is the transitive closure of 2.

3. Collecting
For every black node u, if no other white node v exists such that (v, u) 6 E then

LGC .- LGC u {u}.

Figure DJ. The algorithm to ﬁnd a set of LGCs.

 

185

Figure D.2 is an example of ﬁnding a set of LGCs in a partial ordering set. The
black nodes C,F,I,H,J are the nodes that subsume the input query speciﬁcation. The
solid lines represent the subsumption relationships. The dotted lines represents the
transitive closure among the nodes. Assume there are n nodes in G, indegree,,m
denotes the maximum number of incident arcs for the nodes in G and heightmu.
denotes the maximum height of L. The ﬁnal LGC set contains two nodes C and H.
The complexity of the coloring step is 0(n), the connecting step is 0(n x heightmu)
and the collecting step is 0(n x indegreemu). Hence, the overall complexity is 0(n x

max(indegreemu,heightmu». Therefore, we know that LGC is in the complexity

class P.

 

 

 

Figure D.2. Coloring, joining, and collecting a partial ordering set.

 

APPENDIX E

Program Listing

This appendix contains several programs that are mentioned in previous chapters.
The programs in the ﬁrst four section are written in C++. The programs in the last

two sections are written in C and Motif.

E.1 C++ Implementation for List

.// List.cc ..........................
l/
// List::add
// List::detach
// List::isA
//
// END ------------------------------
L1st::'List()
{
eh11e( head !- O )
{
Listﬁlenent ttenp ' head;
head I head->next;
delete temp;
}

}

void List::add( Objectt toAdd )
{

186

}

187

ListElenent IneeElensnt I new ListElensnt( ttoAdd );
newElelent-)next I head;

head I newElement;

iteIsInContainsr++;

void List::detach( coast Objects toDetach. int deletsOhjeetToo )

{

ListElensnt Icursor I head;

it ( I(head->data) II toDetach )
{

head I head->nert;

}

else

{
ListEleuent Itrailer I head;

while ( cursor !I O )

{

cursor I cursor->nert;
it ( I(cursor->data) II toDetach )

{

trailer->next I cursor->next;
break;

else

trailer I trailer->nsxt;

}

it( cursor !I 0 )

{

iteusInContainer--;
it ( deleteObjectToo )
{

it( cursor->data ) delete cursor->data;
cursor->data I O;

1

else

{

cursor->data I O;

}

delete cursor;

 

188

List List::isA() const

{
return listCIass;
}
char IList::nsns0£() coast
{
return "List";
}

E.2 C++ Implementation for Array

// Array.cc -----------
//
ll Array::addEnd
// Array::addAt
// Array::clear
// Array::isA
//
// End ---------------
Array::‘Array()
{
}
‘1“’TYP° Array::isA() const
{
return arrayClass;
}

void Array::addEnd( Object! toAdd )

{
vhi1e( theArray[ shereToAdd ] !I ZERO It vhereToAdd (I upperbound )

{
vhereToAdd++;

}

it( shereToAdd > upperbound )

{
reallocate( vhereToAdd - loverbound + 1 );

}

 

189

theArray[ shereToAdd - loverbound ] I !toAdd;

ehereToAdd++;
itensInContainer+*;
}
void Array::addlt( Object! toAdd, int atIndex )
{
it( atIndex > upperbound )
I
reallocatef atIndex - loverbound + 1 )z
}
it ( theArray[ atIndex ] !I ZERO )
{
delete theArray[ atIndex ];
itensInContainer--;
}
theArray[ atIndex - loverboundJ I !toAdd;
itelsInContainer++;
}

B.3 C++ Implementation for Stack

// stack.cc ----------------------
II Stack::push

// Stack::pop

// Stach::top

// End ----------------------------

void Stack::push( Object! toPush )

I
theStack.add( toPush ):

itensInContainer++;

1

Object! Stack::pop()

{
Object! temp I theStack.peehHead();

irt temp 2- answer )

{
theStack.detach( temp );

——Ii=——————==——..

190

itemsInContainer--;

}

return temp;
}
Object! Stack::top() const
{

return theStack.peekHead();
}
int Stack::isEmpty() const
{

return theStack.isEmpty();
}
classType Stack::isA() const
{

return stackClass;
}
char *Stack::name0f() const
{

return "Stack";
}

E.4 C++ Implementation for DoubleList

// dbllist.cc -------------------------
// DoubleList::addAtHead

// DoubleList::addAtTail

// DoubleList::detachFromHead

// DoubleList::detachFromTail

// End ----------------------------------

DoubleList::"DoubleList()

{
shileC head != 0 )
{
DoubleListElement *temp = head;
head a head->next;
delete temp;
}

 

191

void DoubleListzzaddAtHeadC Object& toAdd )
{

DoubleListElement *newElement = new DoubleListElement( &toAdd );

if ( head )

{
head->previous I nevElement;
newElement->next I head;
head = newElement;

}

else

{

tail = head = newElement;
}

itemsInContainer++;

}

void DoubleList::addAtTai1( 0bject& toAdd )
{

DoubleListElement *neeElement = new DoubleListElement( ttoAdd );

if ( tail )

{
tail->next = newElement;
newElement->previous = tail;
tail I newElement;

}

else

{

head = tail = newElement;
}
itemsInContainer++;

}

void DoubleList::detachFromHead( const Object! toDetach, int deleteToo )
{

if( head )

{

DoubleListElement *cursor = head;

if ( *(head->data) == toDetach )
{
if( head->next == 0 )
{
tail = 0;
head = head->next;
head->previous I 0;

 

else
{
while ( cursor != O )
{
if ( I(cursor->data) II toDetach )
{
cursor->previous->next I cursor->next;
cursor->next->previous I cursor->previous;
break;
}
else
{
cursor I cursor->next;
}
}
}
1
if( cursor != O )
{
itemsInContainer--;
if ( deleteToo )
{
it( cursor->data ) delete cursor->data;
cursor->data I O;
}
else
{
cursor->data = 0;
}
delete cursor;
cursor I O;
}

}

void DoubleList::detachFromTail( const Object! toDetach, int deleteToo )
{

it( tail )

{

DoubleListElement *cursor I tail;

if ( *(tai1->data) == toDetach )
{
if( tail->previous II 0 )
{
head I 0;
tail I tail->previous;
tail->next I 0;

 

ifC

E.5

193

}
else
{
while ( cursor !I O )
{
it ( I(cursor->data) II toDetach )
{
cursor->previous->next I cursor->next;
cursor->noxt->previous I cursor->previous;
break;
}
else
{
cursor I cursor->previous;
}
}
}
cursor != 0 )

itemsInContainer--:
if ( deleteToo )

{
ifC cursor->data ) delete cursor->data;
cursor->data I O;

}

else

{
cursor->data I O;

1

delete cursor;
cursor I O;

A Window with Pulldown Menu (Existing

Component)

tinclude <Xm/RowColumn.h>
tinclude (Km/Mainﬂ.h>
tinclude (Xm/DrawingA.h>

tinclude <Xm/CascadeBG.h>
tinclude <Xm/PushB.h>
tinclude <Xm/PushBG.h>

typedef struct -menu-item {

char Ilabel; /*
HidgetClass Iclass; /*
char mnemonic; /*
char Iaccelerator; II
char Iaccel-text; /"l

void (*callback)(); /*
XtPointer callback-data; /*

194

the label for the item I/

pushbutton, label, separator... I/
mnemonic; NULL if none I/
accelerator; NULL if none I/

to be converted to compound string */
routine to call; NULL if none I/
client-data for callback() I/

struct -menu-item Isubitems; lt pullright menu items, if not NULL */

} Nenultem;

Widget

BuildPulldownHenu(parent, menu_title, menu-mnemonic, items)

Widget parent;
char Imenu-title, menu-mnemonic;
NenuItem Iitems;

{

widget PullDown, cascade, widget;

int i;
XmString str;

PullDown I XmCreatePulldownHenu(parent, "-pulldown", NULL, 0);

str I XmStringCreateSimple(menu-title);
cascade I XtVaCreateHanagedHidget(menu-title,
meascadeButtonGadgetClass, parent,

XmNsubMenuId, PullDown,
XleabelString, str,

XmNmnemonic. menu-mnemonic,

NULL);
XmStringFree(str);

for (i I 0; items[i].1abel !I NULL; i++) {

it (items[i].subitems)

widget I BuildPulldownHenu(PullDown,
itemsfi].label, itemsEi].mnemonic, itemsfi].subitems);

else

widget I XtVaCreateHanagedHidget(items[i].label,
*itemsEi].class, PullDown,

NULL);

if (items[i].mnemonic)

XtVaSetValues(widget. XmNmnemonic, itemsfi].mnemonic, NULL);

195

if (items[i].accelerator) {
str = XmStringCreateSimple(items[i].accel_text);
XtVaSetValues(widget,
XmNaccelerator, itemsEi].accelerator,
XmNacceleratorText, str,
NULL);
XmStringFree(str);

}

if (items[i].ca11back)
XtAddCallback(widget, XmNactivateCallback,
items[i1.callback, items[i1.callback_data);
}

return cascade;

void

set-color(widget, color)
widget widget;

char *color;

{

printf("Setting color to Zs\n“, color);

}

NenuItem color-menu[] = {

{ "Cyan", tmeushButtonGadgetClass, ’C’, "Heta<Key>C", "Meta+C",
set_color, "cyan", (HenuItem *)NULL },

{ "Yellow", tmeushButtonGadgetClass, ’Y’, "Heta<Key>Y", "Meta+Y",
set_color, "yellow", (HenuItem *)NULL },

{ "Magenta", &meushButtonGadgetClass, ’M’, "Heta<Key>N", "Meta+H",
set_color, "magenta”, (MenuItem *)NULL },

{ "Black", lrmPushButtonGadgetClass, ’B’, "Meta<Key>B", ”Meta+B",
set-color, "black", (MenuItem *)NULL },

NULL,
};

HenuItem drawing_menus[] I {
{ "Line Height", tmeascadeButtonGadgetClass, ’W’, NULL, NULL,
0, 0, NULL }.
{ "Line Color", tmeascadeButtonGadgetClass, ’C’, NULL, NULL,
0, 0, color_menu },
{ "Line Style", &meascadeButtonGadgetC1ass, ’5’, NULL, NULL,
0, 0, NULL }.
NULL,
};

main(argc, argv)
int argc;

char *arng];

{

196

Widget toplevel, main_w, menubar, drawing_a;
XtAppContext app;

toplevel I XtVaAppInitialize<tapp, "Demos", NULL, 0,
targc, argv, NULL, NULL);

main-w I XtVaCreateHanagedWidget("main_w",
meainWindowWidgetClass, toplevel,

XmNscrollingPolicy,
NULL);

XmAUTOMATIC,

menubar I XmCreateNenuBar(main-w, "menubar", NULL, 0);
BuildPulldownNenu(menubar, "Lines", ’L’, drawing-menus);
XtHanageChi1d(menubar);

drawing-a I XtVaCreateHanagedWidget("drawing-a”,
merawingAreaWidgetClass, main-w,

XtRealizeWidget(teplevel);
XtAppMainLoopCapp);

Xmeidth, 500,
XmNheight, 500,
NULL);

E.6 A Window with Popup Menu (Modiﬁed

tinclude
tinclude
tinclude
tinclude
iinclude
tinclude

Component)

<Xm/RowColumn.h>
(Km/NainW.h>
<Xm/DrawingA.h>
<Xm/CascadeBG.h>
<Xm/PushB.h>
<Xm/PushBG.h>

typedef struct -menu-item {

char

*label;

WidgetClass *class;

/* the label for the item */
/* pushbutton, label, separator... */

char
char
char
void

197

mnemonic; /* mnemonic; NULL if none */
*accelerator; /* accelerator; NULL if none */
*accel_text; /* to be converted to compound string */

(Icallback)(); /* routine to call; NULL if none */

XtPointer callback-data; /* client-data for callbackC) */

struct -menu_item *subitems; /* pullright menu items, if not NULL #/
} HenuItem;
Widget

BuildPopupHenu(parent, menu-title, menu_mnemonic, items)
Widget parent;

char *menu_tit1e, menu-mnemonic;

HenuItem titems;

{
Widg
int

et PopUp, cascade, widget;
i;

XmString str;

PopUp I XmCreatePopupMenu(parent, "_popup”, NULL, 0);

for

}

(i = 0; itemsEi].1abel 2: NULL; i++) {

if (items[i].subitems)
widget I BuildPopupHenuCPopUp,
items[i].label, itemsEi].mnemonic, itemsEi].subitems);
else
widget I XtVaCreateHanagedWidget(items[i].label,
titemsEi].class, PopUp,
NULL);

if (items[i].mnemonic)
XtVaSetValues(widget, XmNmnemonic, items[i].mnemonic, NULL);

if (items[i].accelerator) {
str I XmStringCreateSimple(items[i].accel_text);
XtVaSetValues(widget,
XmNaccelerator, items[i].acce1erator,
XmNacceleratorText, str,
NULL);
XmStringFree(str);
}

if (items[i].ca11back)
XtAddCallback(widget, XmNactivateCallback,
itemsEi].callback, items[i].callback_data);

return cascade;

198

void

set_color(widget, color)
Widget widget;

char *color;

{

printr("Setting color to Zs\n", color);

}

HenuItem color_menu[] I {
{ "Cyan", tmeushButtonGadgetClass, ’C’, NULL, NULL,

set-color, "cyan", (HenuItem *)NULL },

{ "Yellow", lmeushButtonGadgetCIass, ’Y’, NULL, NULL,
set_color, "yellow", (MenuItem *)NULL },

{ ”Magenta", trmPushButtonGadgetClass, ’N’, NULL, NULL,
set_color, "magenta", (HenuItem *)NULL },

{ “Black“, trmPushButtonGadgetClass, ’8’, NULL, NULL,
set_color, "black", (MenuItem *)NULL },

XmVaSEPARATOR,

{ ”Quit", tmeushButtonGadgetClass, ’Q’, ”Ctrl<Key>c”, ”Ctrl-C",
NULL, NULL, (NenuItem *)NULL },

NULL.

};

main(argc , argv)
int argc;
char IarngJ;

{

Widget menu, toplevel, main_w, menubar, drawing_a;
void popup-cb(). inputC);
XtAppContext app;

toplevel I XtVaAppInitialize(&app, "Demos”, NULL, 0,
targc, argv, NULL, NULL);

main_w I XtVaCreateNanagedWidget("main_w",
meainWindowWidgetClass, toplevel,
XmNscrollingPolicy, XmAUTOMATIC,
NULL);

drawing_a I XtVaCreateHanagedWidget(”drawing_a”,
merawingAreaWidgetClass, main_w,
Xmeidth, 500,
XmNheight, 500,
NULL);

menu I BuildPopupMenu(drawing_a, ”Lines”, ’L’, color_menus);
XtAddCallback(drawing-a, XmNinputCallback, input, menu);

199

XtRealizeWidget(toplevel);
XtAppHainLoop(app);

void
input(widget, popup. cbs)
Widget widget;
Widget popup; /* popup menu associated with drawing area */
XmDrawingAreaCallbackStruct #cbs;
{
if (cbs->event->xany.type !I ButtonPress ll
cbs->event->xbutton.button != 3)
return;

/* Position the menu where the event occurred */
XmHenuPosition(popup, (XButtonPressedEvent *)(cbs->event));
XtHanageChild(popup);

BIBLIOGRAPHY

BIBLIOGRAPHY

[1] Ted .1. Biggerstaff. An Assessment and Analysis of Software Reuse. In Mar-
shall C. Yovits, editor, Advances in Computers, volume 34, pages 1-57. 1992.

[2] Ted J. Biggerstaff, editor. Software Reusability Vol. 1: Concepts and Models.
ACM Press, New York, 1989.

[3] Ted .1. Biggerstaff, editor. Software Reusability Vol. 2. Applications and Expe-
rience. ACM Press, New York, 1989.

[4] Charles W. Krueger. Software Reuse. ACM Computing Surveys, 24(2):131-183,
June 1992.

[5] P.A.V. Hall, editor. Software Reuse and Reverse Engineering in Practice. UNI-
COM Applied Information Technology, Chapman & Hall, 1992.

[6] E. Horowitz and J .B. Munson. An Expansive View of Reusable Software. IEEE
Transaction on Software Engineering, 10(5):477—487, September 1984.

[7] M. Sitaraman. Mechanisms and Methods for Performance Tuning of Reusable
Software Components. PhD thesis, Ohio State University, Columbus, Ohio,

1990.

[8] W. Tracz. Where does reuse start? ACM SIGSOFT Software Engineering
Notes, 15(2):42-46, April 1990.

[9] B.W. Weide, W.F. Ogden, and SH. Zweben. Reusable Software Components.
Advances in Computers, 33:1-65, 1991.

[10] Ted .1. Biggerstaﬁ. Design Recovery for Maintenance and Reuse. Technical
Report STP-378-88, MCC, November 1988.

[11] MD. Mcllroy. Mass Produced Software Components. In Proceedings of NATO
Conference on Software Engineering, pages 88—98, New York, 1969. Petro-
celli/Charter.

200

1121

1131

[141

1151

1161

1171

118]

1191

120]

1211

1221

[231

‘201

B.J. Cox. Object-Oriented Programming - An Evolutionary Approach. Addison-
Wesley, Reading, Massachussets, 1991.

James W. Hooper and Rowena 0. Chester. Software Reuse: Guidelines and

Methods. Plenum Press, New York and London, 1991.

Y.S. Maarek, D.M. Berry, and GE. Kaiser. An Information Retrieval Ap-
proach for Automatic Constructing Software Libraries. IEEE' Trans. Software
Engineering, l7(8):800—813, August 1991.

R. Helm and Y.S. Maarek. Integrating Information Retrieval and Domain Spe—
ciﬁc Approaches for Browsing and Retrieval in Ob ject-Oriented Class Libraries.

In Proceedings of OOPSLA ’91, pages 47—61, 1991.

R.L. London. Specifying Reusable Components Using Z: Realistic Sets and
Dictionaries. ACA'I SICSOFT Software Engineering Notes, 111(3):120—-127, May
1989.

F. Nishida, S. Takamatsu, Y. Fujita, and T. Tani. Semi-Automatic Program
Construction from Specification Using Library Modules. IEEE Transaction on
Software Engineering, l7(9):853—870, 1991.

Charles Rich and Richard C. Waters. Formalizing Reusable Software Com-
ponents. In Proceedings of Workshop on Reusability in Programming, pages

152-158, Newport, RI, September 1983.

Jeannette M. Wing. A Speciﬁer’s Introduction to Formal Methods. IEEE'
Computer, 23(9):8-24, September 1990.

D. R. Smith. KIDS: A semiautomatic program development system. 16(9):1024—
1043, September 1990.

Betty Hsiao-Chih Cheng. Synthesis of Procedural and Data Abstractions.
PhD thesis, University of Illinois at Urbana-Champaign, 1990. Tech Report

UIUCDCS-R-90-1631.

G. Caldiera and V. Basili. Identifying and Qualifying Reusable Software Com-
ponents. [EEE Computer, 24(2):61—-70, Febrary 1991.

R. Prieto’Diaz and P. Freeman. Classifying Software for Reusability. IEEE
Software, 4(1):6-16, January 1987.

1241

1251

1261

[271

1281

[29]

1301

1311

1321

1331

[341

[351

302

CW. Krueger. Models of Reuse in Software Engineering. Technical Report
CMU-CS-89-188, Carnegie Mellon University, December 1989.

Victor R. Basili Gianluigi Caldiera and Giovanni Cantone. A Reference Archi-
tecture for the Component Factory. ACM Transaction on Software Engineering
and Methodology, 1(1):53—80, January 1992.

Rubén Prieto-Diaz. Domain analysis: An introduction. ACM SIGSOFT Soft-
ware Engineering Notes. 13(2):47—54, 1990.

Robert Allen Steigerwald. Reusable Software Component Retrieval via Normal-
ized Algebraic Speciﬁcations. PhD thesis, Naval Postgraduage School, Monterey,

California, 1991.

Tom Danieli and Betty H.C. Cheng. Construction of Formal Speciﬁcations from
an Object-Oriented Decomposition of Informal Problem Descriptions. Techni-
cal Report MSU-CPS—Sl‘l-OS. Department of Computer Science, Michigan State

University, 1992.

Robert H. Bourdeau and Betty H.C. Cheng. An object-oriented toolkit for
constructing specification editors. In Proceedings of COMPSAC’92: Computer

Software and Applications Conference, pages 239-244, September 1992.

Barbara Liskov and John Guttag. Abstraction and Speciﬁcation in Program
Development. MIT Press and McGraw-Hill, Cambridge, 1986.

Rebecca Wirfs-Brock. Brian Wilkerson, and Lauren Wiener. Designing Object-
Oriented Software. Prentice Hall, Englewood, New Jersey, 1990.

James Rumbaugh, Michael Blaha, William Premerlani, Frederick Eddy, and
William Lorensen. Object-Oriented Modeling and Design. Prentice Hall, Engle-
wood Cliffs, New Jersey, 1991.

Peter Coad and Edward Yourdon. Object-Oriented Analysis. Yourdon Press,
Prentice Hall, Englewood. New Jersey, 1990.

Bertrand Meyer. Object-Oriented Software Construction. Prentice Hall, Engle-
wood, New Jersey, 1988.

Keith E. Gorlen, Sanford .\I. Orlow, and Perry 8. Plexico. Data Abstraction
and Object-oriented Programming in C++. John Wiley & Sons, 1990.

203

[36] Ann L. Winblad, Samuel D. Edwards, and David R. King. Object-Oriented
Software. Addison-Wesley, Publishing Company Inc., 1990.

[37] Kevin D. Jones. LM3: A Larch Interface Language for Modula-3 - A Deﬁnition
and Introduction Version 1.0. Technical Report 72, Digital Systems Research

Center, June 1991.

[38] Gary T. Leavens and Yoonsik Cheon. Preliminay Design of Larch/C++. Tech-
nical Report 92-l6a, Department of Computer Science, Iowa State University,

May 1992.

[39] Jeannette M. Wing. Using Larch to Specify Avalon/C++ Objects. IEEE Trans-
action on Software engineering, 16(9):1076-1088, September 1990.

[40] J .V. Guttag, J. J. Horning, and J. M. Wing. Larch in Five Easy Pieces. Tech-
nical Report 5, Digital Equipment Corporation Systems Research Center, Palo
Alto, California, July 24 1985.

[41] Richard Allen Lerner. Specifying Objects of Current Systems. PhD thesis,
Caregie Mellon University, May 1991.

[42] Jun-Jang Jeng and Betty H.C. Cheng. Using formal methods to construct
a software component library. In Proceedings of the Fourth European Soft-
ware Engineering Conference, pages 397-417, Garmisch-Partenkirchen, Ger-

many, September 13—17, 1993.

[43] Betty H.C. Cheng and Jun-Jang Jeng. Formal methods applied to reuse. In
Proceedings of the Fifth Workshop in Software Reuse, 1992.

[44] Jun-Jang Jeng and Betty H.C. Cheng. Using Automated Reasoning to De-
termine Software Reuse. International Journal of Software Engineering and
Knowledge Engineering, 2(4):523-546, 1992.

[45] John V. Guttag and James J. Horning. Introduction to LCL, a Larch/C In-
terface Language. Technical Report 74, Digital Systems Research Center, July

1991.

[46] Gary T. Leavens. Modular Veriﬁcation of Object-Oriented Programs with Sub-
types. Technical Report 92-09, Department of Computer Science, Iowa State

University, July 1990.

[47] Gary Leavens. Modular Speciﬁcation and Veriﬁcation of Objected-Oriented
Programs. IEEE Software, 8(4):?2—80, July 1991.

204

[48] Mike Laux. lb: A Larch Browser. Technical report, Dept. of Computer Science,
MSU, 1992. in preparation.

[49] Chin-Liang Chang and Richard Char-Tung Lee. Symbolic Logic and Mechanical
Theorem Proving. Academic Press, 1973.

[50] J .A. Robinson. A Machine Oriented Logic Based on Resolution Principle. Jour-
nal of ACM, 12(1):227-234, 1965.

[51] K.H. Blasius and H.J. Burchert, editors. Deduction Systems in Artiﬁcial Intel-
ligence. Ellis Horwood Series in Artificial Intelligence, 1989.

[52] S. Miyamoto. Fuzzy Sets in Informational Retrieval and Cluster Analysis.
Kluwer Academai Publishers, 1990.

[53] DR. Ballard and CM. Brown. Computer Vision. Prentice-Hall, Englewood
Cliffs, New Jersey, 1982.

[54] J.B. Kruskal. On the shortest spanning subtree of a graph and the traveling
salesman problem. In Proc. of the American Mathematical Society, 1956.

[55] D. Knuth and P. Bendix. Simple word problems in universal algebras. In
J. Leech, editor, Computational Problems in Abstract Algebra, pages 263-297.

Pergamon Press, Oxford, 1970.
[56] D. Gries. The Science of Programming. SpringeroVerlag, New-York, 1981.

[57] Edsger W. Dijkstra. A Discipline of Programming. Prentice Hall, Englewood
Cliffs, NJ, 1976.

[58] Ted Faison. Borland C++ 3.1 Object-Oriented Programming. SAMS, 1992.

[59] Greg Nelson. A Generalization of Dijkstra's Calculus. ACM Transactions on
Programming Languages and Systems, 11(4):571—561, October 1989.

[60] Jun-Jang Jeng and Betty H. C. Cheng. Using Analogy to Determine Program
Modiﬁcation Based on Speciﬁcation Changes. In Proceedings of 5th Interna-
tional Conference on Tools with Artiﬁcial Intelligence, 1993, pages 113-116.
IEEE Computer Society, November 8-11, 1993.

[61] K.E. Gorlen. NIH Class Library. National Institute of Health, Computer System
Laboratory, Division of Computer Research and Technology, Bethesda, MD.

Public Domain Software.

205

[62] Roger P. Hall. Computational Approaches to Analogical Reasoning: A Com-
parative Analysis. Artiﬁcial Intelligence, 39:39—120, 1989.

[63] Dedre Gentner. Mechanisms of Analogical Reasoning. In Stella Vosniadou and
Andrew Ortony, editors, Similarity and Analogical Reasoning, pages 199—241.
Cambridge University Press, 1990.

[64] Nachun Dershowitz. The Evolution of Programs. Birkhauser, Boston, MA,
1983.

[65] MB. Hesse. Models and Analogies in Science. Notre-Dame, 1963.

[66] J un-J ang Jeng and Betty H. C. Cheng. Program Modifications Based on Speciﬁ-
cation Changes Part 1: More General Components. Technical report, Michigan

State University, 1993.

[67] Jun-Jang Jeng and Betty H.C. Cheng. Using Automated Reasoning to De-
termine Software Reuse. Technical Report MSU-CPS-ACS-64, Department of
Computer Science, Michigan State University, East Lansing, Michigan, June
1992. accepted for publication in IJSEKE.

[68] Jun-Jang Jeng and Betty H.C. Cheng. Using Formal Methods to Construct
a Software Component Library. Technical Report MSU-CPS-92—ll, Michigan
State University, Department of Computer Science, 1992.

[69] Adrian Nye and Tim O’Reilly. X Toolkit Intrinsics Programming Manual.
O’Reilly & Associate, Inc., 1990.

[70] Dan Heller. Motif Programming Manual. O’Reilly & Associate, Inc., 1991.

[71] Douglas A. Young. Object-Oriented Programming with C++ and OSF/Motif.
Prentice Hall Englewood Cliffs, 1992.

[72] Dan Heller. X View Programming Manual. O’Reilly & Associate, Inc., 1990.

[73] Michael Mehlich and Weishi Zhang. Specifying Interactive Components for
Conﬁguring Graphical User Interfaces, May 1993. personal communication.

[74] Donald L. McMinds. Mastering OSF/Motif widgets. Reading, Mass. : Addison-
Wesley, 1993.

[75] John V. Guttag, James J. Horning, and Andres Modet. Report on the Larch
Shared Language: Version 2.3. Technical Report 58, Digital Systems Research

Center, April 1990.

206

[76] Stephen J. Garland, John V. Guttag, and James J Horning. Debugging Larch
Shared Language Specifications. IEEE Transactions on Software Engineering,

16(9):1044-1057, September 1990.

[77] Jeannette M. Wing. Using Larch to Specify Avalon/C++ Objects. IEEE Trans-
actions on Software Engineering, 16(9):1076-1088, September 1990.

[78] Allan Heydon, Mark W. Maimone, J.D. Tygar, Jeannette M. Wing, and
Amy Moormann Zaremski. Miro: Visual Speciﬁcation of Security. IEEE Trans-
actions on Software Engineering, 16(10):1185-1197, October 1990.

[79] Jeannette M. Wing and Amy Moormann Zaremski. A Formal Speciﬁcation of a
Visual Language Editor. Technical Report CMU-CS-91-112, February 25 1991.

[80] Amy Moormann Zaremski. A Larch Specification of Miro Editor. Technical
Report CMU-CS-91-lll, February 25 1991.

[81] Jeannette M. Wing and Chun Gong. A Library of Concurrent Objects and
Their Proofs of Correctness. Technical Report CMU-CS-90-151, July 1990.

[82] Scott Nettles. A Larch Speciﬁcation of Copying Garbage Collection. Tech-
nical Report CMU-CS-92-219, School of Computer Science, Carnegie Mellon

University, December 1992.
[83] Tim O’Reilly. Xlib Programming Manusl. O’Reilly & Associate, Inc., 1990.
[84] Tim O’Reilly. Xlib Reference Manusl. O’Reilly 8!. Associate, Inc., 1990.

[85] Hartmut Ehrig. Fundamentals of Algebraic Specification Vol 1 and 2. Springer-
Verlag, Berlin, New York, 1985.

[86] Yoonsik Cheon and Gary T. Leavens. A Quick Overview of Larch/C++. Tech-
nical Report 92-18, Department of Computer Science, Iowa State University,

June 1993.

[87] J .M. Neighbors. The draco approach to constructing software from reusable
components. IEEE Transaction on Software Engineering, 10(5):564-573, 1984.

[88] Rubén Prieto-Diaz. Implementing Faceted Classiﬁcation for Software Reuse.
Communication of ACM, 34(5), May 1991.

[89] Rubén Prieto-Diaz. Implementation Faceted Classiﬁcation for Software Reuse.
In IEEE International Conference on Software Engineering, pages 300—304,
1990.

207

[90] Bruce A. Burton, Rhonda Wienk, et al. The Reusable Software Library. IEEE
Software, 4:25—33, July 1987.

[91] Charles Rich and Richard C. Waters. The Programmer’s Apprentice. ACM
Press, 1990.

[92] Richard C. Waters. The programmer’s apprentice : A session with KBE—
macs. IEEE Transactions on Software Engineering, 11(11):1296-l320, Novem-

ber 1981.

[93] Amy Moormann Zaremski and Jeannette M. Wing. Signature Matching: A
Key to Reuse. In Proceedings of the ACM SIGSOF T ’93 Symposium on the

Foundations of Software Engineering, pages 182-190, December 1993.

[94] T. G. Evans. A program for solution of geometric analogy intelligence test

questions. 1968.

[95] G. Polya. Induction and Analogy in Mathematics. Princeton University Press,
1954.

[96] Robert E. Kling. A Paradigm for Reasoning by Analogy. Artiﬁcial Intelligence,
2:147—178, 1971.

[97] J. Carbonnell. Learning by Analogy: formulating and generalizing plans from
past experience. In Machine Learning. Tioga, 1983.

[98] J. Carbonnell. Derivational analogy. In AAAI-83. 1983.

[99] B. Indurkhya. A computational theory of metaphor comprehension and analog-
ical reasoning. PhD thesis, Boston University, 1985.

[100] Brian Falkenhainer, Kenneth D. Forbus, and Dedre Gentner. The Structure-
Mapping Engine: Algorithm and Examples. Artiﬁcial Intelligence, 41:1—63,

1989.

[101] Dendre Gentner. Structural Mapping: a theoretical framework for analogy.
Cognitive Science, 7, 1983.

[102] S. Kedar-Cabelli. Analogy - from a Uniﬁed Perspective. In D.H. Helman, editor,
Analogical Reasoning, pages 65-104. Klumer Academic Publishers, 1988.

[103] Stephen Owen. Analogy for Automated Reasoning. Academic Press, 1990.

208

[104] Nachum Dershowitz. Program Abstraction and Instantiation. ACM Transac-
tions on Programming Languages and Systems, 7(3):446—477, July 1985.

[105] A. Goldberg. Reusing Software Development. In Proceedings of the 4th
ACM SIGSOF T Symp. on Software Development Environment, pages 107-119,
Irvine, California, December 1990.

[106] Mitchell D. Lubars. The ROSE-2 Strategies for Supporting High Level Soft-
ware Design Reuse. In Michael R. Lowry and Robert D. McCartney, editors,

Automating Software Design. MIT Press, 1991.

[107] Sanjay Bhansali. Domain-Based Program Synthesis Using Planning and Deriva-
tional Analogy. Technical Report UIUCDCS-R-91-1701, Department of Com-
puter Science, University of Illinois at Urbana-Champaign, May 1991.

[108] Neil A. Maiden and Alistair G. Sutcliffe. Exploiting Reusable Speciﬁcations
Through Analogy. Communications of the ACM, 35(4):55-64, August 1992.

[109] Chung-Horng Lung and Joseph E. Urban. Analogical Approach for Software
Reuse. In Proceedings of Golden West International Systems, 1992, Reno,

Nevada, June 1-3, 1992.

[110] Chung-Horng Lung and Joseph E. Urban. Integration of Domain Analysis and
Analogical Approach for Software Reuse, 1993.

[111] Alberto Coen-Porisini, F lavio De Paoli, Carlo Ghezzi, and Dino Mandrioli. Soft-
ware Specialization Via Symbolic Execution. IEEE Transactions on Software

Engineering, 17(9):884-899, September 1991.

[112] D.B. Arnold, D.A. Duce, and G.J. Reynolds. An approach to the formal speci-
ﬁcation of conﬁgurable models of graphics systems. In Eurographics ’87, pages

439—463, 1987.

[113] George S. Carson. The speciﬁcation of computer graphics systems. IEEE Com-
puter Graphics and Applications, pages 27—41, September 1983.

[114] George S. Carson. An approach to the formal speciﬁcation of computer graphics
systems. Computers and Graphics, 8(1):51—57, 1984.

[115] D.A. Duce. GKS, structures and formal speciﬁcation. In Eurographics ’89,
pages 271-287, 1989.

209

[116] D.A. Duce and E.V.C. Fielding. Towards a formal speciﬁcation of the GKS
output primitives. In Eurographics ’86, pages 307-323. 1986.

[117] D.A. Duce, E.V.C. Fielding, and L.S. Marshall. Formal speciﬁcation of a small
example based on GKS. ACM Transactions on Graphics, 7(3):180—197, July

1988.

[118] Eugene F iume. Toward realistic formal speciﬁcations for non-trivial graphical
objects. In Eurographics ’89, pages 289—300, 1989.

[119] Rupert Gnatz. Approaching a formal framework for graphics software stan-
dards. Computers and Graphics, 8(1):39—-50, 1984.

[120] Joseph A. Goguen. Modular Algebraic Speciﬁcation of Some Basic Geometrical
Constraints. Technical Report CLSI-87—87, March 1987.

[121] William R. Mallgren. ACM Distinguished Dissertation Series, The MIT Press,
Cambridge, Massachusetts, London, England, 1982.

[122] William R. Mallgren. Formal speciﬁcation of graphic data types. ACM Trans-
actions on Programming Languages and Systems, 4(4):687—710, October 1982.

[123] John Guttag and Jim J. Horning. Formal Speciﬁcation as a Design Tool. In Sev-
enth Annual ACM Symposium on Principles of Programming Language, pages
251—261, 1980.

[124] Bernard Sufrin. Formal Speciﬁcation of a Display-Oroented Text Editor. Sci-
ence of Computer Programming, 1:157-202, 1982.

[125] K. T. Narayana and Sanjeev Dharap. Formal Speciﬁcation of a Look Manager.
IEEE Transactions on Software Engineering, 16(9):1089—1103, September 1990.

[126] Robert J.K. Jacob. Using Formal Speciﬁcation in the Design of a Human-
Computer Interface. Communication of ACM, 26(4):259-264, 1983.

[127] Robert J.K. Jacob. A Speciﬁcation Language for Direct-Manipulation User
Interfaces. ACM Transactions on Graphics, 5(4):283-317, October 1986.

[128] Michael R. Laux, Robert H. Bourdeau, and Betty H.C. Cheng. An integrated
development environment for formal speciﬁcations. In Proc. of IEEE Inter-
national Conference on Software Engineering and Knowledge Engineering, San
Francisco, California, July 1993.

[129]

[130]

[131]

[132]

[133]

[134]

- [135]

210

M. V. LaPolla, J. L. Sharnowski, B. H. C. Cheng, and K. Anderson. Data
parallel program visualizations from formal speciﬁcations. Journal of Parallel
and Distributed Computing, May 1993.

Betty H.C. Cheng and Gerald G. Gannod. Constructing formal speciﬁcations
from program code. In Proc. of Third International Conference on Tools in
Artiﬁcial Intelligence, pages 125-128, November 1991.

Gerald G. Gannod and Betty H.C. Cheng. A two-phase approach to reverse en-
gineering using formal methods. In to appear in the Proc. of Formal. Methods in
Programming and Their Applications Conference, June 1993. The proceedings
will appear as part of the series of Lecture Notes in Computer Science (LN CS)

published by Springer-Verlag.

K.J. Holyoak and RR. Thagard. A Computational Model of Analogical Prob-
lem Solving. In A. Ortony and S. Vosniadou, editors, Similarity and Analogic
Learning, pages 242—266. Cambridge University Press, 1990.

Kuo—Chung Tai. The Tree-to-Tree Correction Problem. JACM, 26(3):422-433,
July 1979.

Christopher M. Hoffmann and Michael J. O’Donnell. Pattern Matching in Tree.
JACM, 29(1):68-95, January 1982.

Kaizhong Zhang, Dennis Shasha, and Jason Tsong-Li Wong. Approximate
Tree Matching in the Presence of Variable Length Don’t Cares. Technical
Report CIS—9l-21, Department of Computer Science, New Jersey Institute of
Technology, Newark, New Jersey, 1991.

CHIGRN STATE UNIV.

1111111[11|91|l913 91111191 9999999..