Introduction to Inferencing
(Also known as a Reading Guide on Knowledge-based Legal Applications)
Russell Allen and Graham Greenleaf
Last updated 3 March 2001.
= Required reading. All other links indicate optional reading.
1. Introduction
1.1. Objectives
The aim of this document is to help you obtain an understanding of the possibilities and limitations of knowledge-based technologies when applied to law.
Particular areas of coverage include:
- The history of Artificial Intelligence and 'expert systems', and the main models of computerised inferencing (drawing inferences) that have emerged.
- The relationship between legal theory (jurisprudence) and computerisation of legal tasks.
- Rule-based reasoning and its applications to statute-based law.
- Case-based reasoning and the problems of applying it (or neural networks or anything else) to case law.
- The relationships between inferencing and the other technologies of hypertext and text retrieval.
- The special issues in web-based legal inferencing, and the opportunities for greater integration with hypertext and text retrieval.
- Automated legal document generation systems - why so little progress?
1.2. Web resources
The most substantial collections of research papers available for free access can be found at:
The Foundation for Legal Knowledge Systems in the Netherlands; this site contains 100+ papers from the JURIX Conferences since 1989. Many of the papers are highly technical (particularly those concerning various types of logics), but many are also accessible to those new to the field.
Computerisation of Law Resources (CompLRes) database on AustLII - contains papers from AustLII's 'Law via Internet' Conferences, as well as other papers. Includes quite a few papers on inferencing systems.
Links to other research papers, home pages of researchers etc can be found at:
WorldLII Catalog:Computerisation of Law:Inferencing - includes many of the resources which will be used in this Study Guide.
Michael Aikenhead's Artificial Intelligence and Law Resources (Durham Centre for Law and Computing) - one of the most extensive resources available
It is also worth checking the Other Indexes listed in the WorldLII Catalog.
1.3. Print resources
Here are the main starting points:
- Proceedings of the (biennial) International Conference on Artificial Intelligence and Law (ICAIL) since 1987 are the single most substantial resource in the field - The
web version of the Conference Proceedings is hosted by the ACM Virtual Library, but not all of the papers are here, and all are password protected and only accessible to ACM members. The programmes for each of the Conferences - and therefore the tables of contents for the print Proceedings are at the ICAIL home page.
The journal Artificial Intelligence and Law (Kluwer) is the main refereed journal in the fieldsee The articles are not on the web.The journal's home page contains the tables of contents of previous issues.
John Zeleznikow and Dan Hunter, Building Intelligent Legal Information Systems: Representation and Reasoning in Law Kluwer, 1994 is the most approachable textbook to give an overview of the field, and is recommended to those commencing study of this area.
See the selected 'Computers and Law' monographs in the UNSW Library for further print resources. Most monographs in this field are very specialised and technical.
2. Background and history of 'AI & law'
For a brief introduction to the history of AI and law, and the components of a 'legal expert system', see Graham Greenleaf
`Legal expert systems - robot lawyers?' (1989) Australian Legal Convention. . It's rather old, but still useful, particularly on the components of a 'legal expert system'.
2.1. More detailed introductions / overviews
3. Legislation-based inferencing systems
3.1. Can rule-based systems be used in law?
A controversial starting point for arguments about the extent to which rule-based reasoning systems are useful in law is Robert Moles
'Logic Programming - An Assessment of Its Potential for Artificial Intelligence Applications In Law' Journal of Law and Information Science (1991) vol 2 no 2 (now on UniServe Law) Moles' argument is explicitly a critique of the body of work known as `logic programming in law', but is a more general attack on (at least) all attempts to create rule-based expert systems in law. In effect, he concludes that the value of any such research is an open question.
I have listed below some of Moles' principal arguments (but without doing them justice in a brief description), followed by a brief comment on how such an argument could be answered.
- Isomorphism
- Moles gives a damning paraphrase of Bench-Capon's description of how isomorphism is maintained in the Alvey Project, and rightly concludes that it shows considerable insensitivity to the nature of legal interpretation. Answer?: Even if Bench-Capon's is a poor example of isomorphism, how is this a criticism of isomorphism as an ideal? Can it be done better? (see the next section of this guide for more extensive discussion)
- Statutes are interdependent
, so it is useless to represent in isolation; if we represented all possibly relevant rules, inferencing engines couldn't cope. Answer?: A good point, there is a practical limitation to the comprehensiveness of any expert system. But the same criticism applies to a textbook, but doesn't stop them being written. Is this a double standard? Isn't it a question of expert judgment as to what other rules must be added;? Also, doesn't the user still need to use discretion or an over-ride capacity ? (This is the `its only a tool' argument.)
- Statutes and the common law are interdependent
, so its no use just representing statutes. Moles gives superb example concerning the British National Act (the subject of a famous early legal expert system by Sergot et al) - to understand the Act it is necessary to understand a statue 250 years old and the case law on it at the time. Answer?: There is no way around this but to rely on expert judgement to add some rules (or opportunities for users to make decisions?) reflecting these interpretative issues, as there is no possibility of comprehensiveness!
- Consequentialism
- Courts [always?] apply consequentialist reasoning (ie consider the consequences of reasoning in deciding whether the reasoning is adequate), so it is impossible to separate the knowledge representation from the inferencing engine. He give an excellent example of the DIY shop case. Answer?: Perhaps there isn't any, unless you hypothesise meta-rules like the one he quotes; also, although he (via MacCormick) claims that such reasoning is encountered `at every turn', it must be something of an empirical question as to how much this threatens the viability of legal expert systems.
- Words are ambiguous
- He correctly chides the logic programmers for silly statements about choosing unambiguous interpretations of legal sources; and for conducting surveys to determine meanings of words. Answers?: He wrongly conflates Susskind's misuse of the `survey' approach with what the logic programmers do, and doesn't seem to appreciate Susskind's `stop here' approach to ambiguity. The answer to this can only be that the ambiguity of language can only be dealt with by leaving some interpretative decisions in the operation of an expert system to the user (while attempting to assist the user to find the correct interpretation), and by attempting to make it explicit where interpretations have been built in to the knowledge-base (part of the idea of isomorphic representation).
- Rules [or groups of them] are not atomistic entities
, so isomorphism's claimed advantages in terms of maintainability are fictitious. Answer?: There is a major error of understanding in Moles' thinking that to say that to change one rule is to change the meaning of other rules is somehow a criticism of isomorphism - this is precisely what isomorphism aims to achieve!
- The view that there is no need for legal expertise is obvious rubbish
- Moles suggest that if the logic programmers (in the British Nationality Act example) had obtained expert advice it would have destroyed their project, and makes the acerbic comment that `They have used logic programming to model the logical consequences of their own untutored assumptions as to how a statute, dealing with an area about which they have no experience, would be read and used by someone about whom they have no understanding'. No answer needed, Moles is correct that legal expertise is needed.
Moles concludes that there has been no progress by anyone in the previous 20 years! He sees the need to approach with an `open mind, the basic question of the suitability of the legal domain for he development of expert systems'
It is worth asking 'what is Moles really criticising?' - Is his model of 'logic programming' a straw man? If he is criticising the quest for artificial intelligence in the strict sense, then his pessimism is justified, but not if he is purporting to generalise this into a more overall critique of inferencing systems as useful tools for lawyers. If he is, then the relevant comparison may be a textbook, not a human mind. Perhaps this is a mole-hill not a mountain.
3.2. Isomorphism and knowledge representation
'Isomorphism' in legal knowledge representation, used in the sense of 'creating a well defined correspondence between source documents and the representation of the information they contain [that is] used in the system' [Bench-Capon and Coenen, 1992], has been shown to provide many advantages for the operation of legal inferencing systems (despite dissents such as that of Moles).
Arguments for the use of an isomorphic approach and a 'quasi natural language' representation are discussed in
Greenleaf, Mowbray and van Dijk
5.5. Isomorphism and the Value Of 'english-Like' Representations (in 'Representing and using legal knowledge in integrated decision support systems: DataLex Workstations' [1995] COL 1).
The approach taken in this paper is very similar to that taken by SoftLaw (see following section), who have applied an isomorphic and English-like representation to their development of large commercial inferencing systems.
Additional reading
3.3. Other papers and resources on rule-based inferencing in law
4. Case study of legislation-based inferencing systems: Softlaw
Softlaw is an Australian company which is one of the world leaders in the field of large-scale legal inferencing systems. SoftLaw was founded in 1989, based on expert systems work by Mead and Johnson from the mid-1980s, and has grown to over 50 staff. Its
clients include some of the largest public sector agencies in Australia, and some overseas. SoftLaw has won many national and international awards.
Browse the SoftLaw pages, and read the following to obtain an introductory idea of SoftLaw's approach:
The Softlaw approach
Softlaw and Government and the links from that page.
What is Statute Expert?
There are also some
demonstration versions of Statute Expert which are available from the site, which you might want to look at. Go to the sample facts, then do the sample applicationi
4.1. Papers concerning SoftLaw's work
Philip Kellow (Director, Centre for Legal Process, Sydney)
Legal expert systems via the Internet [1997] CompLRes 9 (Paper presented at AustLII's "Law Via The Internet '97" conference). This project (no longer proceeding) applied Softlaw's technology to development of a free access application via the Internet: "The first eLAPS module prepares an application for review to the Commonwealth Administrative Appeals Tribunal (AAT). This Tribunal reviews decisions made by Government agencies under a range of Commonwealth statutes. "Most of SoftLaw's research papers are available from their website - see the list of Conference and Seminar Papers available on request.
The following older papers concerning SoftLaw are in the printed volume of Developing Computer Applications to Law (B) subject materials Vol # 5 - Case Study: SoftLaw available at the UNSW Law Library Reserve Desk.
- Peter Johnson and David Mead `Legislative knowledge base systems for public administration - Some practical issues' Proc. 3rd International Conference on Artificial Intelligence and Law, ACM Press, 1991
- Surendra Dayal, Peter Johnson & David Mead `Natural language - An appropriate knowledge representation scheme for the administrative domain' SoftLaw Corporation, 1993
- Surendra Dayal et al `Beyond knowledge representation - Commercial uses for legal knowledge bases' Proceedings 4th International Conference on Artificial Intelligence & Law, Amsterdam, 1993, ACM Press
- Softlaw Corporation - `STATUTE Corporate - Component Technologies Guide' (Chapter on STATUTE Expert) Softlaw Corporation, Canberra, 1991
5. Integration of legal inferencing systems with other technologies
5.1. The DataLex Project
A principal aim of the DataLex Project (1985-1994) was to explore the integration of legal inferencing technologies with hypertext and text retrieval.
- Greenleaf, Mowbray and van Dijk (1995)
'Representing and using legal knowledge in integrated decision support systems: DataLex Workstations' [1995] COL 1; This article gives the pre-internet background to AustLII's inferencing research.
The following Parts set out the arguments in favour of integration of technologies:
Part 5.
ntegrating inferencing with hypertext and text retrieval
Part 6. Multi-modal inferencing and integration
One of the earliest papers on the DataLex project was Tyree, Mowbray and Greenleaf The DataLex Project, presented at the first AI & Law Conference in Boston, 1987 (it describes the earliest DataLex 'shells', much earlier than those described in the above paper).
5.2. AustLII and legal inferencing via the web
AustLII is a proponent of legal inferencing systems via the internet, partly because of the potential it creates for legal inferencing systems to be integrated with large online collections of primary legal materials (like AustLII) which are being updated constantly.
Russell Allen et al
'With a wysh and a prayer: An experiment in cooperative development of legal knowledgebases' 2000 (2) The Journal of Information, Law and Technology (JILT)
Part 5. New Legal Services via the Web - AustLII's Research on Legal Inferencing (in Greenleaf Mowbray and King (1997) The AustLII Papers - This paper discusses these additional advantages.
AustLII's Legal Inferencing Project Home Page
5.3. Further reading
6. Case-based reasoning systems in law
There are many attempts to develop case-based reasoning systems in law, but little information seems to be available on the web. The following notes only discuss one example of a case-based reasoning system applied to law.
6.1. Tyree's FINDER
These notes deal with various implementations of FINDER, Alan Tyree's implementation of case-based reasoning using 'precedent analysis by nearest-neighbour decision analysis' (PANNDA). FINDER is actually the particular example application using the 'finder's cases', whereas the general approach is more properly called PANNDA. However, the general approach is also referred to below as 'FINDER'.
Alan L Tyree
'FINDER: an expert system' (1987??) - The original paper on the FINDER case-based reasoning system, but unfortunately it does not contain a detailed explanation. However, it should be read as an introduction to FINDER.

FINDER (The Finder's Cases) , is an implementation of FINDER using the wysh software, with a rule added about trespassers .
Some notes on Tyree's FINDER
These notes are derived from Alan Tyree Expert Systems in Law Prentice Hall 1989 Ch 7, pgs 137-44, and from discussions with Alan Tyree. They are, however, my interpretations (GG).
`Similarity' is the basis of FINDER - `The concept [of similarity] is at the heart of any well-defined notion of precedent.' (eg the maxim `like cases should be decided alike'); PANNDA attempts to model similarity in a `direct' fashion, by a mathematical measure of the presence/absence of key attributes of each case.
The key assumptions in FINDER are:
- that a group of cases can be identified as relevant to deciding a legal issue;
- that it is impossible to deduce any rule(s) of general application from the cases;
- that it is, however, possible to identify attributes of those cases which are relevant to the Courts' decisions [although exactly how they are relevant can't be reduced to a rule]; for PANNDA, these attributes must also be reduced to Yes / NO form]
- `that we have exercised care and skill in representing the attributes and values of each of the cases at an appropriate level of generality' (Tyree p138) -- ie the application of expertise.
This approach could be relevant where any statute or case simply lists factors which are to be taken into account or `weighed' in reaching a conclusion.
A summary of the method used in FINDER (in the 'finder's cases' example):
- each of 10 factors identified by expert given a 1/0 ie present/absent in case;
- low variance factors = where there are a preponderance of 1s or 0s in the 8 cases; high variance = where a 4/4 or 3/5 split; they ranged from 7/1 (low) for A4 and A9 to 4/4 (high) for A2; the low variance factors (7/1 etc) are considered to be more significant than the high variance ones ; therefore they are given a lower weighting (VAR) (eg .109), whereas high variance ones have higher weighting (eg .250);
- for each Q, each factor in the test case is compared with the case; if the factors are the same, the score is 0; if different, the score is 1/VAR; the total distance between each case and the test case is the sum of these 10 scores ; so, the more zeros, the lower the scores; and the more difference on low variant factors, the higher the scores.
- the case with the lowest score is regarded as the closest to the test case; those with higher scores progressively further away. ie not simply a matter of which one has the highest no of 0s and 1s the same as the test case.
- The assumption behind the measure of variance (which determines the weight to be given to attributes) is that factors which almost all cases share, but 1 or 2 don't (ie 7/1/ or 6/2) are likely to be the most significant factors determining the outcome of a set of cases. Therefore, if a case differs from the test case in respect of one of these, this should mean it is `further away' than if it merely differed on one factor which is spread uniformly b/w all the cases (ie 4/4/ or 3/5).
- The measure of variance is not correlated with the outcomes of the cases; ie if only two cases in a set of 10 have a Y value for variable A, A is regarded as more important than B which has a 50/50 Y/N split -- but the system doesn't know if there is a significant correlation b/w a Y answer for variable A and a particular outcome for the precedent. Tyree explains (verbally) that there are usually too few cases to make such a correlation in a way that will be statistically significant [otherwise, wouldn't we just represent such a correlation as a rule??]. The justification for his approach is that the cases, and the factors described within them, are not drawn at random, but are selected by the expert because of their significance. Therefore, variables with 4/4 or 5/3 splits actually separate the cases into groups more effectively, so `why would an expert include a 7/1 variable unless it is particularly significant'. He agrees this is contentious.
- The outcome of a FINDER consultation is that a report is generated with identifies the 'nearest' case (to be followed) and the nearest case with the opposite outcome (to be distinguished), and the similar and distinguishing facts for both nearest case and nearest opposite case. It is a simulation of a legal argument from a statistical process. There is an attempt at recognition of hard cases, when it identifies that there are two 'nearest' cases (ie the same distance away from the test case) with opposite outcomes.
Some imitations of FINDER:
- It is limited to true / false values (the wysh version also includes 'uncertain')
- It makes no distinction between the hierarchical precedential value of cases. It is questionable whether this is a criticism -- perhaps it should only be used where there are a `flat' set of cases, where no particular case is regarded as more authoritative than others.
- It does not emulate human legal reasoning in reaching its conclusions [as far as we know]. Tyree attempts to show that it does produce a similar result, however.
Other papers about FINDER
6..2 'Split-up' and related systems (Stranieri and Zeleznikow)
One of the best-known current Australian research projects in this area is in the domain of property settlements in family law, legal decision support system built by the Database Research Laboratory at Melbourne's LaTrobe University Computing Institute.
'Family Law by Computer' (a discussion of 'Split-Up'), transcript of the Law Report, ABC Radio National, 16 July 1996 (Susanna Lobez with Andrew Stranieri and John Zeleznikow )
6.3. Additional reading
For papers generally concerning case-based reasoning in law see:
WorldLII Catalog:Computerisation of Law:Inferencing:Case-Based Reasoning
Argumentation and legal reasoning (an index of articles and web pages) on CBR-Web ( the Case-Based Reasoning homepage at the University of Kaiserslautern)
7. Automated document generation
There is relatively little available on automated document generation in law, despite it seeming to be one of the most obvious commercial implementations of inferencing technologies to law.
- Greenleaf Mowbray and van Dijk
'6.1. Document Generation And The Inference Engine' (in 'Representing and using legal knowledge in integrated decision support systems: DataLex Workstations' [1995] COL 1) - discusses the varieties of automated document generation systems in law, and the implementation of document generation in the DataLex Workstation Software.
See SoftLaw for details of its extensive document generation systems for government administrative systems.
Tunde Meikle (University of Ballarat) A decision support architecture in the domain of refugee law [1999] CompLRes 39 (Paper presented at AustLII's "Law Via The Internet '99" conference). "This paper outlines the beginnings of work in progress to develop an innovative approach to an architecture for a decision support system for decision makers in the Refugee Review Tribunal. "
Andrew Stranieri, John Yearwood and John Zeleznikow Mapping inference trees to document structure for text generation in refugee determinations (JURIX '99 Conference proceedings)
8 AI applications to text retrieval
One of the most active fields of AI research in law is to adapt AI techniques to improve legal text retrieval.
- Paulo Quaresma and Irene Pimenta Rodrigues (Universidade Nova de Lisboa, Portugal)
PGR: A cooperative legal IR system on the web [1999] CompLRes 39 (Paper presented at AustLII's "Law Via The Internet '99" conference)
James Osborn and Leon Sterling (University of Melbourne) JUSTICE: A judicial search tool using intelligent concept extraction [1999] CompLRes 35 (Paper presented at AustLII's "Law Via The Internet '99" conference). The paper states:
"The research was motivated by a desire to provide legal researchers with a tool which would provide conceptual based searching of case law. The initial insight was a belief that a knowledge based approach to extracting legal concepts would perform well in the domain of legal cases, especially as regards the headnote. JUSTICE is able to recognise and extract abstract legal concepts from heterogenous digital representations of legal cases."