0% found this document useful (0 votes)

506 views

AI Unit 4 Lecture Notes It

The document discusses different types of machine learning. It begins by defining learning and describing different definitions of learning. It then describes different forms of learning including rote learning, learning by taking advice, learning from examples, and learning with macro-operators. For each type of learning, it provides details on the process and examples to illustrate how it works. It also discusses Winston's learning program and decision trees as other examples of machine learning techniques.

Uploaded by

tony

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

506 views

AI Unit 4 Lecture Notes It

Uploaded by

tony

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Lecture Notes

Course: B.Tech. III Year I Semester

Subject: Artifical Intelligence
Course Objectives: To train the students to understand
different types of AI agents, various AI search algorithms,
fundamentals of knowledge epresentation, buildi
ng of simple knowledge-based systems and to apply
knowledge representation, reasoning. Study of Markov
Models enable the student ready to step into applied AI.

Dr.Y.L Malathi Latha

Artificial Intelligence
Unit-IV

Syllubus
Learning: What Is Learning? Rote Learning, Learning by Taking Advice, Learning in Problem Solving,
Learning from Examples, Winston’s Learning Program, Decision Trees.
Learning
Introduction
Learning covers a wide range of phenomena. At one end of the spectrum is skill
refinement. People get better at many tasks simply by practicing. The more you
ride a bicycle or play tennis, the better you get. At the other end of the spectrum
lies knowledge acquisition. As we have seen, many AI programs draw heavily on
knowledge as their source of power. Knowledge is generally acquired through
experience and such acquisition is the focus of this chapter.

What is learning?

• According to Herbert Simon, learning denotes changes in a

system that enable a system to do the same task more
efficiently the next time.
• Arthur Samuel stated that, "Machine learning is the subfield of
computer science, that gives computers the ability to learn
without being explicitly programmed ".
• In 1997, Mitchell proposed that, " A computer program is said
to learn from experience 'E' with respect to some class of
tasks 'T' and performance measure 'P', if its performance at
tasks in 'T', as measured by 'P', improves with experience E ".
• The main purpose of machine learning is to study and design the
algorithms that can be used to produce the predicates from the
given dataset.
• Besides these, the machine learning includes the agents
percepts for acting as well as to improve their future
performance.
The following tasks must be learned by an agent.

To predict or decide the result state for an action.

• To know the values for each state(understand which state has

high or low vale).
• To keep record of relevant percepts.

Why do machine need learn?

1. Understand and improve efficiency of human being.

2. Discover new things or structure that is unknown to humans
3. Fill in skeleton or incomplete about the domain
Advantages of learning

1. Skill refinement: practice make skill improve- More you playtennis, better you get
2. Knowledge acquisition: Knowledge is generally acquired by experience.

Different form of Learning:

Various forms of learnings are

1. Rote Learning
2. Learning by Taking Advice,
3. Learning in Problem Solving
4. Learning from Examples
5. Learning with Macro-Operators

1. Rote learning:

Rote Learning is basically memorisation.

• Saving knowledge so it can be used again.

• Retrieval is the only problem.
• No repeated computation, inference or query is necessary.

A simple example of rote learning is caching

When a computer stores a piece of data, it is performing a rudimentary form of

learning.
•In case of data caching, we store computed values so that we donot have to
recompute them later.
•When computation is more expensive than recall, this strategy can save a
significant amount of time.
•Caching has been used in AI programs to produce some surprising performance
improvements.
•Such caching is known as rote learning.

• Store computed values (or large piece of data)

• Recall this information when required by computation.
• Significant time savings can be achieved.
• Many AI programs (as well as more general ones) have used caching very
effectively.

Memorisation is a key necessity for learning:

• It is a basic necessity for any intelligent program -- is it a separate learning

process?
• Memorisation can be a complex subject -- how best to store knowledge?

Samuel's Checkers program employed rote learning (it also used parameter
adjustment which will be discussed shortly).

• A minimax search was used to explore the game tree.

• Time constraints do not permit complete searches.
• It records board positions and scores at search ends.
• Now if the same board position arises later in the game the stored value can be
recalled and the end effect is that more deeper searched have occurred.

Rote learning is basically a simple process. However it does illustrate some issues that
are relevant to more complex learning issues.

Rote learning does not involve any sophisticated problem-solving capabilities.

•It shows the need for some capabilities required of complex learning systems such
as:
Organisation
-- access of the stored value must be faster than it would be to recompute it.
Methods such as hashing, indexing and sorting can be employed to enable this.

E.g Samuel's program indexed board positions by noting the number of pieces.

Generalisation
-- The number of potentially stored objects can be very large. We may need to
generalise some information to make the problem manageable.

E.g Samuel's program stored game positions only for white to move. Also
rotations along diagonals are combined.

Stability of the Environment

-- Rote learning is not very effective in a rapidly changing environment. If the
environment does change then we must detect and record exactly what has
changed -- the frame problem.

2. Learning by taking Advice

• This type is the easiest and simple way of learning.

• In this type of learning, a programmer writes a program to give some
instructions to perform a task to the computer. Once it is learned (i.e.
programmed), the system will be able to do new things.
• Also, there can be several sources for taking advice such as humans(experts),
internet etc.
• However, this type of learning has a more necessity of inference than rote learning.
• As the stored knowledge in knowledge base gets transformed into an
operational form, the reliability of the knowledge source is always taken
into consideration.

There are two basic approaches to advice taking:

• Take high level, abstract advice and convert it into rules that can guide
performance elements of the system. Automate all aspects of advice taking
• Develop sophisticated tools such as knowledge base editors and debugging.
These are used to aid an expert to translate his expertise into detailed rules.
Here the expert is an integral part of the learning system. Such tools are
important in expert systems area of AI.
Automated Advice Taking

The following steps summaries this method:

Request

-- This can be simple question asking about general advice or more

complicated by identifying shortcomings in the knowledge base and asking for
a remedy.

Interpret

-- Translate the advice into an internal representation.

Operationalise

-- Translated advice may still not be usable so this stage seeks to provide a
representation that can be used by the performance element.

Integrate

-- When knowledge is added to the knowledge base care must be taken so

that bad side-effects are avoided.

E.g. Introduction of redundancy and contradictions.

Evaluate

-- The system must assess the new knowledge for errors, contradictions etc.

The steps can be iterated.

Knowledge Base Maintenance

Instead of automating the five steps above, many researchers have instead assembled
tools that aid the development and maintenance of the knowledge base.

Many have concentrated on:

• Providing intelligent editors and flexible representation languages for

integrating new knowledge.
• Providing debugging tools for evaluating, finding contradictions and
redundancy in the existing knowledge base.

EMYCIN is an example of such a system.

Example Learning System - FOO

Learning the game of hearts

FOO (First Operational Operationaliser) tries to convert high level advice (principles,
problems, methods) into effective executable (LISP) procedures.

Hearts:

• Game played as a series of tricks.

• One player - who has the lead - plays a card.
• Other players follow in turn and play a card.
o The player must follow suit.
o If he cannot he play any of his cards.
• The player who plays the highest value card wins the trick and the lead.
• The winning player takes the cards played in the trick.
• The aim is to avoid taking points. Each heart counts as one point the queen of
spades is worth 13 points.
• The winner is the person that after all tricks have been played has the lowest
points score.

Hearts is a game of partial information with no known algorithm for winning.

Although the possible situations are numerous general advice can be given such as:

• Avoid taking points.

• Do not lead a high card in suit in which an opponent is void.
• If an opponent has the queen of spades try to flush it.

In order to receive advice a human must convert into a FOO representation (LISP
clause)
(avoid (take-points me) (trick))

FOO operationalises the advice by translating it into expressions it can use in the
game. It can UNFOLD avoid and then trick to give:
(achieve (not (during
(scenario
(each p1 (players) (play-card p1))
(take-trick (trick-winner)))
(take-points me))))

However the advice is still not operational since it depends on the outcome of trick
which is generally not known. Therefore FOO uses case analysis (on
the during expression) to determine which steps could case one to take points. Step 1
is ruled out and step 2's take-points is UNFOLDED:
(achieve (not (exists c1 (cards-played)
(exists c2 (point-cards)
(during (take (trick-winner) c1)
(take me c2))))))

FOO now has to decide: Under what conditions does (take me c2) occur during (take
(trick-winner) c1).

A technique, called partial matching, hypothesises that points will be taken if me =

trick-winner and c2 = c1. We can reduce our expression to:

(achieve (not (and (have-points(card-played))

(= (trick-winner) me ))))

This not quite enough a this means Do not win trick that has points. We do not know
who the trick-winner is, also we have not said anything about how to play in a trick
that has point led in the suit. After a few more steps to achieve this FOO comes up
with:
(achieve (>= (and (in-suit-led(card-of me))
(possible (trick-has-points)))
(low(card-of me)))

FOO had an initial knowledge base that was made up of:

• basic domain concepts such as trick, hand, deck suits, avoid, win etc.
• Rules and behavioural constraints -- general rules of the game.
• Heuristics as to how to UNFOLD.

FOO has 2 basic shortcomings:

• It lacks a control structure that could apply operationalisation automatically.

• It is specific to hearts and similar tasks.
3. Learning By Problem solving

There are three basic methods in which a system can learn from its own
experiences.
Learning by Parameter Adjustment

Many programs rely on an evaluation procedure to summarise the state of

search etc. Game playing programs provide many examples of this.

However, many programs have a static evaluation function.

In learning a slight modification of the formulation of the evaluation of the problem is

required.

Here the problem has an evaluation function that is represented as a polynomial of the
form such as:

The t terms a values of features and the c terms are weights.

In designing programs it is often difficult to decide on the exact value to give each
weight initially.

So the basic idea of parameter adjustment is to:

• Start with some estimate of the correct weight settings.

• Modify the weight in the program on the basis of accumulated experiences.
• Features that appear to be good predictors will have their weights increased
and bad ones will be decreased.

Samuel's Checkers programs employed 16 such features at any one time chosen from
a pool of 38.
Learning by Macro Operators

The basic idea here is similar to Rote Learning:

Avoid expensive recomputation

Macro-operators can be used to group a whole series of actions into one.

For example: Making dinner can be described a lay the table, cook dinner, serve
dinner. We could treat laying the table as on action even though it involves a sequence
of actions.

The STRIPS problem-solving employed macro-operators in it's learning phase.

Consider a blocks world example in which ON(C,B) and ON(A,TABLE) are true.

STRIPS can achieve ON(A,B) in four steps:

UNSTACK(C,B), PUTDOWN(C), PICKUP(A), STACK(A,B)

STRIPS now builds a macro-operator MACROP with preconditions ON(C,B),

ON(A,TABLE), postconditions ON(A,B), ON(C,TABLE) and the four steps as its
body.

MACROP can now be used in future operation.

But it is not very general. The above can be easily generalised with variables used in
place of the blocks.

However generalisation is not always that easy (See Rich and Knight).

Learning by Chunking

Chunking involves similar ideas to Macro Operators and originates from

psychological ideas on memory and problem solving.

The computational basis is in production systems (studied earlier).

SOAR is a system that use production rules to represent its knowledge. It also
employs chunking to learn from experience.

Basic Outline of SOAR's Method

• SOAR solves problems it fires productions these are stored in long term
memory.
• Some firings turn out to be more useful than others.
• When SOAR detects are useful sequence of firings, it creates chunks.
• A chunk is essentially a large production that does the work of an entire
sequence of smaller ones.
• Chunks may be generalised before storing
Most often heard criticisms of AI is that machines cannotbe called intelligent until
they are able to learn to do newthings and adapt to new situations, rather than
simplydoing as they are told to do.•Some critics of AI have been saying that
computerscannot learn!•Definitions of Learning: changes in the system that
areadaptive in the sense that they enable the system to dothe same task or tasks
drawn from the same populationmore efficiently and more effectively the next
time.•Learning covers a wide range of phenomenon:–Skill refinement : Practice
makes skills improve.–Knowledge acquisition: Knowledge is generally acquired
throughexperience
Explain the importance of repeated problem solving for an effective improvement in the process of
``Learning”. Distinguish it from Learning by taking advice.

a) What is unsupervised learning? b) “Learning is the most important characteristic of

Intelligence” Jus

• What are the components of agents? (16)

• 2) Define and explain
• (i) Supervised learning (6) (ii) Unsupervised learning
(6) (iii) Reinforcement learning (4)
• 3) How hypotheses formed by pure inductive inference or
induction?Explain with ex - amples. (16)
• 4) (a) What is a decision tree? (4)
• b) Explain the process of inducing decision trees from
examples. (6)
• c) Write the decision tree learning algorithm (6)
• 5) How the performance of a learning algorithm is assessed?
Draw a learning curve for the decision tree algorithm (16)
• 6) Explain with an example
What is learning? What are its types?

Applied ML notes
No ratings yet
Applied ML notes
123 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
6CS6.2 Unit 5 Learning
No ratings yet
6CS6.2 Unit 5 Learning
41 pages
Ai Unit 1,2 Notes
No ratings yet
Ai Unit 1,2 Notes
45 pages
Ai PPT 1
No ratings yet
Ai PPT 1
47 pages
CS3491 Unit 1 Notes
No ratings yet
CS3491 Unit 1 Notes
83 pages
Ai Notes
No ratings yet
Ai Notes
31 pages
Top 50 Machine Learning Interview Q A
No ratings yet
Top 50 Machine Learning Interview Q A
13 pages
AI Question Bank With Solutions - 2021-22
No ratings yet
AI Question Bank With Solutions - 2021-22
45 pages
AI
No ratings yet
AI
272 pages
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
No ratings yet
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
23 pages
Machine Learning Oral Questions
No ratings yet
Machine Learning Oral Questions
10 pages
Artificial Intelligence: Unit-1 Introduction: Chapter 1 Text Book: Stuart Russell, Norvig
No ratings yet
Artificial Intelligence: Unit-1 Introduction: Chapter 1 Text Book: Stuart Russell, Norvig
13 pages
Lab Program
100% (1)
Lab Program
15 pages
PPT1
No ratings yet
PPT1
93 pages
AL3391-AI Unit IV
No ratings yet
AL3391-AI Unit IV
65 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
19 pages
Bayesian Networks, Dumpster-Shafer Theory
No ratings yet
Bayesian Networks, Dumpster-Shafer Theory
33 pages
Question Bank 1to11
No ratings yet
Question Bank 1to11
19 pages
BCA603T Cryptography and Network Security: Unit - I Contents
No ratings yet
BCA603T Cryptography and Network Security: Unit - I Contents
42 pages
Se Unit2
No ratings yet
Se Unit2
115 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
Artificial Intelligence Tutorials
No ratings yet
Artificial Intelligence Tutorials
58 pages
Unit 2
No ratings yet
Unit 2
29 pages
Representing Knowledge Using
No ratings yet
Representing Knowledge Using
22 pages
Artificial Intelligence Unit 1 PPT Part 1
No ratings yet
Artificial Intelligence Unit 1 PPT Part 1
81 pages
Krr Unit i Notes
No ratings yet
Krr Unit i Notes
32 pages
Software - Engineering Full Notes
No ratings yet
Software - Engineering Full Notes
37 pages
AI Notes All Units
No ratings yet
AI Notes All Units
151 pages
KCA 303 Unit-2
No ratings yet
KCA 303 Unit-2
32 pages
IF4071 - Deep Learning Laboratory
No ratings yet
IF4071 - Deep Learning Laboratory
1 page
Uncertainty AI
No ratings yet
Uncertainty AI
45 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
Subject: Artificial Intelligence 5. Planning: Faculty Name: Anita Patil Mrs. Jyoti Joshi
No ratings yet
Subject: Artificial Intelligence 5. Planning: Faculty Name: Anita Patil Mrs. Jyoti Joshi
49 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
ML Lab Final R22
No ratings yet
ML Lab Final R22
67 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
38 pages
ml notes question bank exstraction from notes
No ratings yet
ml notes question bank exstraction from notes
163 pages
NN UNIT-1 Complete Notes with 153 pages (1)
No ratings yet
NN UNIT-1 Complete Notes with 153 pages (1)
153 pages
AI-UNIT-1 PPT
No ratings yet
AI-UNIT-1 PPT
149 pages
SCT - QB - Anwers - p1
No ratings yet
SCT - QB - Anwers - p1
53 pages
cd3291 Dsa Study Material
No ratings yet
cd3291 Dsa Study Material
169 pages
AI 2marks Questions
100% (1)
AI 2marks Questions
121 pages
Data Science Techniques Classification Regression and Clustering
No ratings yet
Data Science Techniques Classification Regression and Clustering
5 pages
Artificial Intelligence - Lecture Notes, Study Material and Important Questions, Answers
100% (1)
Artificial Intelligence - Lecture Notes, Study Material and Important Questions, Answers
3 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
14 pages
AL3391 AI UNIT 3 NOTES EduEngg
No ratings yet
AL3391 AI UNIT 3 NOTES EduEngg
38 pages
Ai Unit 4
No ratings yet
Ai Unit 4
23 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Machine Learning-Unit-V-Notes
No ratings yet
Machine Learning-Unit-V-Notes
23 pages
Artificial Intelligence Notes
100% (1)
Artificial Intelligence Notes
28 pages
Da Unit-2
No ratings yet
Da Unit-2
23 pages
AI Question & Answers
No ratings yet
AI Question & Answers
31 pages
Question Bank For CAT1 - 2mks
No ratings yet
Question Bank For CAT1 - 2mks
36 pages
Unit 5 1
No ratings yet
Unit 5 1
18 pages
Ai-Unit2 - QB-VDP
No ratings yet
Ai-Unit2 - QB-VDP
13 pages
MCS-224 Artificial Intelligence and Machine Learning
No ratings yet
MCS-224 Artificial Intelligence and Machine Learning
493 pages
ML unit-2
100% (1)
ML unit-2
28 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
Memory, PDF
No ratings yet
Memory, PDF
44 pages
Skills Qualities Abilities PDF
No ratings yet
Skills Qualities Abilities PDF
6 pages
FLCT Module 9
No ratings yet
FLCT Module 9
8 pages
Eric Burkholder & Martha Peláez - A Behavioral Interpretation of Vygotsky's Theory of Thought, Language and Culture
No ratings yet
Eric Burkholder & Martha Peláez - A Behavioral Interpretation of Vygotsky's Theory of Thought, Language and Culture
3 pages
Individual Differences in Memory Disruption Caused by Simulated Cellphone Notifications
No ratings yet
Individual Differences in Memory Disruption Caused by Simulated Cellphone Notifications
39 pages
Brain Lateralization
No ratings yet
Brain Lateralization
4 pages
Test Bank For Cognitive Psychology In and Out of the Laboratory Sixth Edition - Full Version With All Chapters Is Ready For Download
100% (7)
Test Bank For Cognitive Psychology In and Out of the Laboratory Sixth Edition - Full Version With All Chapters Is Ready For Download
53 pages
What Are Graphic Organizers
No ratings yet
What Are Graphic Organizers
15 pages
None 6377f7aa PDF
No ratings yet
None 6377f7aa PDF
15 pages
Ncm 102-He Finals
No ratings yet
Ncm 102-He Finals
16 pages
From Visualization To Visually Enabled Reasoning
No ratings yet
From Visualization To Visually Enabled Reasoning
24 pages
The Affordances Beyond What One Does Reconceptualizing - 2022 - Teaching and T
No ratings yet
The Affordances Beyond What One Does Reconceptualizing - 2022 - Teaching and T
9 pages
Research Chapter 1
No ratings yet
Research Chapter 1
14 pages
HOMEWORK BEHAVIOR AND LEARNING STRATEGIES ON STUDENTS' ACADEMIC PERFOMANCE IN ARALING PANLIPUNAN By: Lopez, Racquel Ramos
No ratings yet
HOMEWORK BEHAVIOR AND LEARNING STRATEGIES ON STUDENTS' ACADEMIC PERFOMANCE IN ARALING PANLIPUNAN By: Lopez, Racquel Ramos
56 pages
1999_the_cognitive_monster
No ratings yet
1999_the_cognitive_monster
22 pages
Problem Solving
No ratings yet
Problem Solving
5 pages
Introduction To Psychology
No ratings yet
Introduction To Psychology
2 pages
Psy - Mod 2
No ratings yet
Psy - Mod 2
19 pages
Chapter 2 Cognitive Biases
No ratings yet
Chapter 2 Cognitive Biases
24 pages
Memory and Intelligence
No ratings yet
Memory and Intelligence
25 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
2 pages
Stylistics
No ratings yet
Stylistics
2 pages
Learning Learning Theories and Learning Environment in Pakistan-1-1
No ratings yet
Learning Learning Theories and Learning Environment in Pakistan-1-1
18 pages
PDF The Shared Mind Perspectives on intersubjectivity Converging Evidence in Language and Communication Research Celcr 12th Edition Jordan Zlatev download
100% (2)
PDF The Shared Mind Perspectives on intersubjectivity Converging Evidence in Language and Communication Research Celcr 12th Edition Jordan Zlatev download
61 pages
Children's Recognition of Slapstick Humor Is Linked To Their Theory of Mind
No ratings yet
Children's Recognition of Slapstick Humor Is Linked To Their Theory of Mind
10 pages
Ashley Resume Final
No ratings yet
Ashley Resume Final
1 page
Facilitating Learning UPDATED
No ratings yet
Facilitating Learning UPDATED
101 pages
Prof Ed 103 Report
100% (1)
Prof Ed 103 Report
4 pages
Assignment BPCC 107 E July 2024-January 2025
No ratings yet
Assignment BPCC 107 E July 2024-January 2025
5 pages
Vittorio Guidano
No ratings yet
Vittorio Guidano
251 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

AI Unit 4 Lecture Notes It

Uploaded by

AI Unit 4 Lecture Notes It

Uploaded by

Lecture Notes

Course: B.Tech. III Year I Semester

Dr.Y.L Malathi Latha

• According to Herbert Simon, learning denotes changes in a

To predict or decide the result state for an action.

• To know the values for each state(understand which state has

Why do machine need learn?

1. Understand and improve efficiency of human being.

Different form of Learning:

Rote Learning is basically memorisation.

• Saving knowledge so it can be used again.

A simple example of rote learning is caching

When a computer stores a piece of data, it is performing a rudimentary form of

• Store computed values (or large piece of data)

Memorisation is a key necessity for learning:

• It is a basic necessity for any intelligent program -- is it a separate learning

• A minimax search was used to explore the game tree.

Rote learning does not involve any sophisticated problem-solving capabilities.

Stability of the Environment

2. Learning by taking Advice

• This type is the easiest and simple way of learning.

There are two basic approaches to advice taking:

The following steps summaries this method:

-- This can be simple question asking about general advice or more

-- Translate the advice into an internal representation.

-- When knowledge is added to the knowledge base care must be taken so

E.g. Introduction of redundancy and contradictions.

The steps can be iterated.

Knowledge Base Maintenance

Many have concentrated on:

• Providing intelligent editors and flexible representation languages for

EMYCIN is an example of such a system.

Example Learning System - FOO

Learning the game of hearts

• Game played as a series of tricks.

Hearts is a game of partial information with no known algorithm for winning.

• Avoid taking points.

A technique, called partial matching, hypothesises that points will be taken if me =

(achieve (not (and (have-points(card-played))

FOO had an initial knowledge base that was made up of:

FOO has 2 basic shortcomings:

• It lacks a control structure that could apply operationalisation automatically.

Many programs rely on an evaluation procedure to summarise the state of

However, many programs have a static evaluation function.

In learning a slight modification of the formulation of the evaluation of the problem is

The t terms a values of features and the c terms are weights.

So the basic idea of parameter adjustment is to:

• Start with some estimate of the correct weight settings.

The basic idea here is similar to Rote Learning:

Avoid expensive recomputation

Macro-operators can be used to group a whole series of actions into one.

The STRIPS problem-solving employed macro-operators in it's learning phase.

STRIPS can achieve ON(A,B) in four steps:

UNSTACK(C,B), PUTDOWN(C), PICKUP(A), STACK(A,B)

STRIPS now builds a macro-operator MACROP with preconditions ON(C,B),

MACROP can now be used in future operation.

Chunking involves similar ideas to Macro Operators and originates from

The computational basis is in production systems (studied earlier).

Basic Outline of SOAR's Method

a) What is unsupervised learning? b) “Learning is the most important characteristic of

• What are the components of agents? (16)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.