Name: Knowlege representation
Code: ITI8700
Link: http://lambda.ee/Teadmiste_formaliseerimine
Lecturer: Tanel Tammet tanel.tammet@ttu.ee ICT-426
Practice sessions: Riina Maigre
Archives of previous years: 2024.

Sisukord

1 Eksamitulemused
2 Exam date/times
3 Time, place, result
4 Grading
5 Contents overview: what the course is about
6 Weekly tasks
7 The first weekly task
8 Practical work
9 Books to use
10 Blocks and lectures: the main content of the course

Eksamitulemused

Esimesel eksamil (21. mai) osales kaks tudengit (232213IAPM ja 242175IAPM), mõlemi hinne on 5.
Teise eksami (27. mai) järgsed hinded leiad siit.
Kolmanda eksami (5. juuni) hinded leiad siit.
Neljandal eksamil osales üks tudeng (242319IAPM), hinne 4.

Exam date/times

Exam together with the Hajussüsteemid ITI0215

21 May: 11:30 in the room U04-103.
27 May: 11:00 in the room NRG-131
5 June: 15:00 in the room U04-103
10 June: 15:00 in the room U04-103

The materials below marked with the red E1 are the main materials for the exam, while E2 materials will also come up, but less and/or in a simpler form.

Time, place, result

Lectures (Tanel): Tuesdays 14:00-15:30 room SOC 312/313 (313 weeks 1,3,5,...; 312 weeks 2,4,6,...)
Practice sessions (Riina): Wednesdays 12:00-13:30 room U06A-204.

The lectures and sessions will be on site (physical room) plus teams. Later we may decide to do some lectures/practice sessions on teams only: this will be advertised on the page here early. Beware: recording may not always succeed.

Please join the teams of the course. The code for joining is y0bd8e5.

Weekly teams link for the lecture.

Weekly teams link for the practice session.

Grading

Practical work will give 50%, exam 50% of points, and weekly tasks additional extra up to 10%, underlying the final grade. The exam will consist of (a) several questions asking for the explanation of important concepts along with examples (b) some small excercises.

The materials below marked with the red E1 are the main materials for the exam (10 of these on the page), while E2 materials (20 on the page) will also come up, but less and/or in a simpler form.

To finish the course successfully, you need to

successfully pass three practical works
receive at least 1/3 of the max points at exam
get practice work + weekly tasks + exam at least 50% points altogehter.

Contents overview: what the course is about

The theme of the course: from SQL to natural language.

Hence the main focus of the course is on hybrid methods for knowledge representation and reasoning: symbolic AI, machine learning / neural methods and their neurosymbolic combinations. We look at this spectrum from simple (representing and using knowledge in databases) to complex (meaning of sentences in natural language and commonsense knowledge) tasks. A closely related subject is commonsense reasoning.

The course contains the following blocks:

Knowledge in SQL, RDF and JSON
Handling simple facts and rules in symbolic AI and logic
General-knowledge databases
Natural language and question answering
Uncertain knowledge and hybrid systems

Weekly tasks

The small weekly tasks are in addition to the practical work: their goal is to practice with the stuff presented at the last lecture.

The first weekly task

Ülesanne on eksperimenteerida veidi logictools.org saidiga ja teha ise läbi loengus toodud näited SQL data ja päringute esitamisest loogikas. Tähtaeg 25. veebruar.

Konkreetselt tee järgmist:

Eksperimenteeri veidi logictools.org "simple examples" valikuboksi näidetega basic ... equality, loe nende selgitusi ja püüa aru saada tõestustest.
Loe seejuures predikaatloogika väikest seletust about lehel kuni lausearvutuseni (seda pole enam otse vaja lugeda).
Kirjuta ise valmis näide vastavalt kursuse materjali Background: relations of sql and logic lk pealkirjaga "SQL join as logical rule", kus on toodud ühe client tabeliga ja client ning cars tabelitega näide sql päringute teisendamisest loogikasse (üks lihtne select ja üks join). Tee need näited reaalselt valmis nii, et logictools.org sööb nad sisse ja annab ootuspärase vastuse. Seejuures on sul vaja kasutada $ans predikaati, nagu "answers" näites toodud. Kui tahad mitut vastust (ei ole kohustuslik), siis vaata ka "multiple answers" näidet.
Saada näite-sisendid ja genereeritud tõestused õppejõule emailiga (riina.maigre at taltech.ee ja cc tanel.tammet at taltech.ee). Kui jääd nendega hätta, siis kirjuta, mida proovisid, too oma näitetekst ja selgita, mille juures hätta jäid (see on ka arvestatav tulemus): kirjuta aadressile riina.maigre at taltech.ee või tanel.tammet at taltech.ee ja pane pealkirja sisse mh sõna "formaliseerimine", et kirja kergesti üles leiaks.

Soovitused:

Negatiivset arvu sa logictoolsi vaikimisi süntaksiga otse sisestada ei saa: -200 interpreteeritakse kui loogilise eituse rakendamist 200-le. Kaks head võimalust negatiivse arvu sisestamiseks on kas kirjutada 0-200 või $difference(0,200).
Aritmeetika jaoks vaata help teksti Arithmetic peatükki.
Väiksem-tingimuse jaoks vaata complex examples valikuboksis "arithmetic" näidet.

Practical work

There are three labs.

The labs have to be presented (as a demo + small overview of the software and principles) to the course teachers and all students present at labwork time, either in the auditorium or over MS teams. The labs can be prepared alone or by teams of two or three people.

Each lab is evaluated independently and gives the same number of points. NB! When you do not meet the lab deadline, you will be awarded half of the points for this lab.

First lab

The 2025 task for the first lab is about handling and understanding triplet stores, meaning of rdf types/properties/etc and writing simple logical rules for answering questions.

Lab is updated! Future updates may include clarifications, but the task itself will not change.

Deadline: 5. March.

Second lab

See Details of the KR second lab in 2025. There are 5 options, you only have to choose one.

Lab is updated! Future updates may include clarifications, but the task itself will not change.

Deadline: ~~9. April~~ 16. April

Third lab: questions in NLP

This is updated!

Experiment with neurosymbolic reasoning: LLM as a parser plus a symbolic reasoner: 2025

Deadline: 14. May (recommended). However, it is possible to present your work in other practice sessions before or after the 14. May (including 21. May).

NB! The task description above has just been updated (at 23. April) with the example code and files developed and presented during the 22. April lecture.

Books to use

Main materials:

Get the basic background about classical automated reasoning from the book chapter pages 297-345 by Tanel Tammet and optionally more details from the set of course notes from Geoff Sutcliffe.
The book for natural language (NLP): Speech and Language Processing by Dan Jurafsky and James H. Martin. here is the 2nd edition and (suggested) here the web page for the draft 3rd edition. Recommendation: read from the beginning up to an including chapter 6: Vector Semantics and Embeddings, then read the appendix chapter Logical Representations of Sentence Meaning.

Other useful books and tools for symbolic K&R:

logictools.org for experimenting with classical automated reasoning
Symbolic reasoning themes are covered in this book with (hopefully) freely accessible pdf-s via Taltech. See also slides for the book.
Similarly, the freely accessible pdf of the handbook of knowledge representation gives a detailed and thorough coverage of the subject; far more than necessary for the course.
You may want to get a small intro to Prolog by reading the first 100 pages of a classic Art of Prolog book. It is not strictly necessary for the course (we will not be using Prolog) but it surely helps. Besides, the book is interesting, quite easy to read and works out a lot of interesting examples.

Other useful material for neural NLP:

Andrej Karpathy nanoGPT and neural networks zero to hero course
GPT architecture

Observe that a noticeable part of the course contents are not covered by these books: use the course materials and links to papers, standards and tutorials provided.

Blocks and lectures: the main content of the course

The details and materials of lectures in the future are from 2024 and will be revised/extended. The materials of the current or passed lectures are up to date.

Block 1: knowledge in SQL, RDF and JSON

Lecture: Intro and background: 4. February

Intro presentation:

teadmised_intro2025.pdf

Recommended non-obligatory listening: Francois Chollet, Yann LeCun, Gary Marcus, Demis Hassabis, Ilya Sutskever, Ben Goertzel, Tim Rocktäschel,

Lecture: Nosql, json and rdf in databases: 11. February

Lecture materials:

We cover Background: relations of sql and logic E1

and start with schemaless databases, RDF and RDFS E1 (also important for practice work)

Block 2: handling simple facts and rules in symbolic AI and logic

Lecture: simple rules in rdfs, logic and owl: 18. February

Main lecture material: schemaless databases, RDF and RDFS

Rdf example developed during lectureE2

Have a brief a look at this, no need to read thoroughly:

Official w3c rdf and rdfs primer E2

The main materials covered in the lecture:

json-ld (check out playground examples) E1 currently most popular triple representation language on top of json. see also wikipedia and w3c standard: for the latter, read the basic concepts chapter E2.
rdfa E2 triple markup language suited for html, see also wikipedia
schema.org (E2 you can surf on the schema.org site and understand/find answers there)) we looked at in the last lecture: property markup vocabulary suggested by Google, Microsoft and others.

And then have brief look (skim through) at:

Official w3c owl primer but it is better to look at this intro presentation and continue with the owl rules presentation
sparql tutorial and wikipedia take on sparql.

Certainly read:

Hea jutt Whatever Happened to the Semantic Web?

We will also start at looking and understanding the examples in logictools.org

More resources:

sparql implementations in wikipedia
various rdf tools listed at the bottom of the page

Lecture: what the reasoner does: 25. February

NB!This lecture will be the following recording from 2024 pluss self-study reading and experiments only, not in the lecture room:

Recording

We will consider the reasoning part of K&R to understand what provers do:

rules, logic, provers E1
logictools system running gkc reasoner in the browser: try out the "simple" and "complex" examples, read explanatory texts and the manual about predicate logic
Have a look at the gkc github repo, including precompiled binaries for command line in the release 0.6.0 and the tutorial Examples/Readme.md.
Travel examples from lecture 2021

You may want to have a look at the additional materials - from easier to harder - from the

book chapter pages 297-345 and optional extras, 346-382; by Tanel Tammet.
the set of course notes from Geoff Sutcliffe,
the implementation presentation from Stephan Schulz
5-lecture prover course from the authors of Vampire
and the readable-and-hackable PyRes reasoner for educational use.

Lecture: looking into main large knowledge bases: 4. March

Slides:lecture slides E2, a tool to explore Wordnet taxonomies can be found here: https://github.com/martinve/wntool

You can search/investigate what kind of data is available and you can find out actual data from these databases with some work, by surfing on the web pages of the systems.

About knowlege base creation using LLMs: have a look at this paper.

We will have a look at the goals, main content and differences between:

wordnet see tptp axioms
dbpedia, see also classes and dbpedia wiki
wikidata
yago old page and new page
babelnet described in wikipedia
conceptnet described in wikipedia
nell website (sometimes down) and in wiki and a good paper.
framenet described in wikipedia, see example
cyc but just read wikipedia and see whitepaper also tptp small random selection of axioms
sumo and adimen-sumo, see sumo in wikipedia and tptp axioms
tptp: a large set of axioms and problems in logic usable for automated reasoners
schema.org: property markup vocabulary suggested by Google, Microsoft and others.
Wolfram alpha stuff

Block 3: natural language and question answering

Lecture: Intro to NLP, n-grams, word vectors: 11 March

These three wikipages give useful introductory details:Natural language processing, Knowledge extraction, Natural language understanding.
Tanel Alumäe keelemudelitesse sissejuhatavad slaidid E1 ja loengusalvestus
Väga hea õpik: 3. variandi draft või 2. variandi terve pdf.
- Vektorsemantikast arusaamiseks loe 3. variandi ptk 3. N-gram Language Models E2 ja siis chapter 6: Vector Semantics and Embeddings E2. Vektorsemantika kasutamine närvivõrkudes a la Bert ja GPT on hästi kirjas ptks 7: Neural Networks and Neural Language Models
- Slaidid eelmiste peatükkide kohta: Ngrams ja word vectors. E2
Google ngrams ja reaalne data.
GloVE: a relatively simple vector representation
Kui tahad katsetada toore Wikipedia dataga, siis saab loengus vaadatud töödeldud variante siit (tarballid sisaldavad datat, selgitavat README-d ja tarkvara selle ise ehitamiseks): a compacted pure-text version of full wikipedia, A lemmatized version of wikipedia texts, Several co-occurrence matrices and lists of top-co-occurring words for wikipedia
Online demo of NTLK running some basic NLP tasks
Loengus tehtud katse Glove vektoritega selle jutu järgi.

Lecture: large language models (LLM): 18 March

Lecture material

Eesmärk: saada pinnaliselt aru LLM ja transformerite (NLP masinõppe põhiasjad) peamistest ideedest. Detailid on tegelikult keerulised ja nendest arusaamine võtab hulga rohkem aega, kui meil kursuses on, sestap lepime vähesega. Samas, kui sul endal huvi, siis loomulikult on teretulnud detailsem iseõpe.

Enne järgmisse materjali süvenemist tutvu kindlasti vektorsemantikaga, mida seletasime 11 märtsi loengus.

Mida vaadata ja lugeda:

Eriti soovitav

Andrej Karpathy progemisega-koos loeng: vaata see süvenemisega läbi! Vt ka Karpathy.

Paratamatult jääb enamus detaile loengus vähearusaadavaks (kui ise ei kuluta hulga aega järele katsetamiseks), aga sellegipoolest on see tõenäoliselt kõige parem loeng transformeritest: mingi tunnetuse sellest, et mis toimub, saab.

Pikemaks lugemiseks:

visualizing neural machine translation and then the illustrated transformer
Jurafsky & Martin book: neural networks and the slides. Alternative: CMU Mitchell course: neural networks basics
Jurafsky & Martin: chapter 9, transformers

Optsionaalselt üldisem arusaam klassikutelt:

Jann LeCun: värske episood Lex Fridmani podcastist
Wolframi pikk jutt lugemiseks koos närvivõrkudega üldisema taustaga
Rodney Brooksi ennustused transformerite kohta

Optsionaalselt diipim arusaam:

nanoGPT: [Karpathy reaalne näitekood GPT ise-ehitamiseks (otse eelmise loenguga seotud) : seda saab ilma suuremate pingutusteta käima linuxil ja macil, windowsil tekib aga suuremaid probleeme (proovisin ise linuxil järgi).
Taustaks: Karpathy terve loenguseeria, kui sul tekib sügavam huvi
Jurafsky & Martin ptk 10 Large Language Models kui tahad ise juurde lugeda.
Good free (?) book: Build a Large Language Model (From Scratch)
väga detailne ja mahukas videoloengutega deep learningu kursus, mh raamat.

Lecture: LLM usage patterns and RAG: 25 March

Lecture material

Useful reading in addition to pointers in the lecture material:

application landscape overview
early taxonomy of real business use
LLM improvement for applications and a basic overview of customizing llms
Finetuning llama 2
RAG:
- simplest RAG intro E1
- more rag details: a good architecture picture, may skip the improvements part after that
- encoder/decoder and Seq2seq as a generalization of word2vec and openai embedding model api
- vector database E2 as a useful tool
- different kinds of rag E2 based on paper
- rag by langchain
- a fancy rag paper
- example estonian startup using a kind of rag

Block 4: hybrid systems and uncertain knowledge

We will focus on (a) world modeling and uncertainty in KR (open/closed world, frame problem, probabilities, exceptions) and (b) current research on building hybrid systems

Lecture: neurosymbolic reasoning + open/closed world, planning, frame problem, blocks world. 1 April

First, some results/challenges in neurosymbolic reasoning.

Second, planning, open/closed worlds and the frame problem.

The standard example for these is a blocks world: some blocks are on the table and the robot arm can lift single blocks and put them on other blocks or on the table. The arm cannot lift several blocks or a tower of blocks. Then, given a position of blocks at initial situation, can the robot arm create a required new position of blocks? Can we get the steps required? For getting the steps we use the $ans predicate.

One hard issue arising is that how do we know that doing some action like lifting a block does not create side effects like stumbling existing towers, moving other blocks etc? This issue is called the frame problem and it has no easy solutions.

Importantly, the frame problem arises since normal first order logic has

the open world assumption E1 (must read): if we do not have (and cannot derive) a positive fact like P(a) and neither a negative fact like -P(a), we assume we simply do not know which holds.

Prolog and databases operate under the

closed world assumption E1 (must read): if we do not have (and cannot derive) a positive fact like P(a), we automatically assume that -P(a) must be true. For example, Prolog "negation" really means "cannot prove". One consequence of this assumption is that we cannot represent real negation and disjunction in an OK manner (Prolog does not contain these) and cannot satisfactorily speak about "true", "false" and "unknown". See also negation as failure in prolog

Please look at and read these materials about the frame problem:

tiny video of the Boston Dynamics robot failing: looks like a frame problem in the blocks world :)
overview presentation: E1 must read.
a classic AI philosophy paper about the frame problem: must read the first three pages (rest is optional).
wikipedia about frame problem: E2 skim it through (no need to read all the details) to get a rough idea about different approaches to the frame problem. None of these are really good.

These two readings are optional:

frame problem at the Stanford Encyclopedia of philosophy: not obligatory, but a good read to get a better understanding.
another classic about frame problem: read this only if you became really really interested about the whole issue: it goes quite deep, although it is not especially technical.

You may also want to have a look at the algorithmic complexity and efficient solution algorithms, regardless of how we formalize the problem: see this article.

Next, importantly, you should experiment yourself with gkc and both of the following files. Please copy and read the files and understand the encoding. At the end of the files are several queries to experiment with: instructions are also there.

In both of them the predicate holds(X,Y) is used to describe something X about the blocks world (like, one block is on another or robot holds some block) and the second argument Y describes the state we have arrived to via robot actions from the original state (like, picked up a block in the initial state and then did put the block down on top of another).

Simple axiomatization of the blocks world: E2 please also read through the comments explaining the axiomatization and queries!
More complex blocks world axiomatization taken from the TPTP problem set for first order logic, concretely by concatenanting and commenting axiom sets PLA001-0.ax, PLA001-1.ax and queries PLA004-1.p, PLA005-1.p, PLA019-1.p

To get a simple visualization of the blocks world, have a look at this tiny video.

Reasoning with uncertainty and intro to lab 3: 8 April

Loengu sissejuhatava osa presentatsioon: Reasoning_with_uncertainty.pdf E1.

Almost all the knowledge we have is uncertain: there are many exceptions to a rule or a fact/rule holds with some vague probability. Notice that, in contrast, typical databases in companies contain facts with very high certainty, but they do not contain rule or commonsense knowledge: the latter is built into brittle software code of systems using these databases.

We will consider two different ways to tackle uncertainty:

Numeric methods: uncertainty is estimated with probability-like numbers of various kinds.
Discrete methods: no numbers used. Instead, uncertainty is described as exceptions or beliefs of people.

NB! Both of these ways are hard to actually implement, neither have they been understood very well, despite an immense amount of research and papers. For example, typical machine learning methods operate with extremely vague probability-like numbers and do not attempt to put these numbers into a real properly-theoretical-and-correct probabilities framework.

The main material for reading about numeric methods is an overview-followed-in-depth by the lecturer Numeric uncertainty: (E2 up to and including the chapter 3, see also E2 tags inside)

Read carefully up to and including the chapter 4 "Different ways to encode confidences in logic" and have a quick look at all the wikipedia links. In particular, Wiki on Bayesian inference and this
Skim through the later parts and look only into these subchapters which you found interesting.

Next,

Read the paper E2 about confidences. containing intro, algorithms and experiments. Then look at the web page with the confer reasoner and a wealth of examples.
Read the paper E2 about exceptions (default logic). containing intro, alorithms and experiments. Then look at the web page with the gk reasoner and a wealth of examples.

Next, have a brief look at a good example system doing some specific kinds of probability reasoning:

Problog start with a tutorial E2
Try out some examples via browser from the tutorial

Lecture: visual reasoning: 15 April

The lecture will be given by our postdoc researcher Mohit Vaishnav. You may want to have a look at his thesis from 2023.

Lecture: third lab live experiments: 22 April

We will do live experiments for the third lab.

Previous plan was: continue with uncertainty, numeric confidences and probabilities.

A presentation of our paper about implementing numeric confidences. Here are examples, a binary, etc.

Lecture: Semantic parsing: 29 April

The focus of the lecture is on semantic parsing.

Here are Martin's slides from 2024 Media:KR_2024_Parsing_01.pdf E2.

We will look at both (a) classic paradigms of semantic parsing (as on the slides), (b) using LLMs and alternative modern paradigms for semantic parsing.

About important concepts in parsing:

Some important toolkits and representation schemes:

Some draft parsing papers:

Semantic parsing continued: 6 May

Lecture will be given by Martin Verrev on teams only (not the lecture room).

Loenguslaidid (lisatud linkidega): Media:KR_2024_Parsing_02.pdf E2

Topics in neurosymbolic reasoning and consultation for the exam: 13 May

Teadmiste formaliseerimine