100% found this document useful (21 votes)
18K views63 pages

RGO PA Reviewer Marcos

Psychological assessment involves gathering and integrating psychological data through tests, interviews, case studies, and behavioral observations to make an evaluation. There are key differences between testing, which measures specific abilities, and assessment, which seeks to answer referral questions through an individualized process. Effective assessment requires skills in test selection, organization and interpretation of various data sources. Common tools include tests, interviews, case histories, portfolios, and behavioral observations. The assessor's role is to integrate information from multiple sources to make informed recommendations.

Uploaded by

lailani daya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (21 votes)
18K views63 pages

RGO PA Reviewer Marcos

Psychological assessment involves gathering and integrating psychological data through tests, interviews, case studies, and behavioral observations to make an evaluation. There are key differences between testing, which measures specific abilities, and assessment, which seeks to answer referral questions through an individualized process. Effective assessment requires skills in test selection, organization and interpretation of various data sources. Common tools include tests, interviews, case histories, portfolios, and behavioral observations. The assessor's role is to integrate information from multiple sources to make informed recommendations.

Uploaded by

lailani daya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 63

Psychological Assessment: Prepared by: Rolf Gian Marcos, RPm

Chapter 1: Psychological Testing and Assessment Skills of Evaluator - technician-like - educated


skills selection and
Roots of contemporary psych testing and assessment thoughtful
- Early 20th Century / 1905 in France organization and
o Binet - place Paris school children in integration of data
appropriate classes. Outcome - yields test score - entails logical
- World War 1 (1917) problem-solving
o Screen large number of recruits for approach, from
intellectual and emotional problems different sources
of data designed
TESTING – Refer to everything from the administration of a to shed light on a
test to interpretation of a test score. referral question
- (WW1: Testing means screening of thousands of
military recruits) - Assessment begins with a REFERRAL (informal
- WW2 – Term ASSESSMENT began to emerge meeting)
o U.S office of Strategic Service (Central - The assessor’s own past experience, education,
Intelligence Agency) training play a key role in the specific tests/tools to be
o Used variety of procedures and used in assessment.
measurement tools (psych tests) to select - If hindi ka familiar, tool selection process might be
military personnel for HIGHLY informed by (publications dealing with behavior) some
SPECIALIZED POSITIONS such as research in preparation for the assessment.
espionage, intelligence gathering etc.
o SET UP: Group-on-one Interview Collaborative Psychological Assessment
Hostile interviews - Assessor and assessee may work as “partners” from
Explain a given scenario initial than final feedback.
Evaluated on their ability to o Therapeutic Psychological Assessment
maintain Non-Contradictory ▪ Include element of therapy as part
responses of the process.
o OSS MODEL (uses variety of tools from ▪ Self- discovery and new
highly trained assessors) understanding are encouraged
▪ Inspired the Assessment Center
Approach to Personnel Evaluation Dynamic Assessment
of today. - mostly employed in educational settings
Psychological Assessment – gathering and integration of - Interactive approach that usually follows a model of:
psychology- related data for the purpose of making o Evaluation
psychological evaluation that is accomplished through the use o Intervention
of tools (test, interviews, case studies, behavioral o Evaluation
observations) and specially designed and measurement - Dependent on the purpose of assessment
procedures. (feedback/hints = mastery)
Psychological Testing – process of measuring psychology- - Provides a means for evaluating how the assessee
related variables by means of devices on procedures designed processes/benefits from some type of intervention
to obtain sample of behavior. (feedback, hints, instruction, therapy) during the
course of evaluation
Difference between Testing and Assessment - Could be done thru computer tools.
Testing Assessment
Objective Process - Gauge on - answer a Tools for Psychological Assessment
certain ability or referral questions,
attribute decision making, 1. Test (Measuring Device or Procedure)
- group or - individual; the
individual; result process itself
Psychological test – device or procedure designed
of processing
to measure variables related to psychology. (analysis
Role of Evaluator - tester is not the - assessor is the
key key of sample of behavior)
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Content – Subject Matter/ focus of the test; to make more informed recommendations about
Difference came from varying definition of hitting, firing and advancement of personnel
construct/theoretical orientation ADV: minimized bias of lone interviewer
Format - Form, plan, structure, arrangement -and DISADV: utility (cost of using multiple interviews may
layout of test items (time limits); not be justified); issue on Group Think
Computerized, pencil-paper; Guidelines for
creating a portfolio work sample Differences of Interviews
Administration Procedures a. Pacing of interviews
- One – to – one basis = requires attire and b. Rapport with interviewers
knowledgeable test administration (demo tasks, c. Ability to convey genuineness, empathy and
trained observation of assessee’s performance) humor
- Group basis DOES NOT REQUIRE presence of test
administrator 3. Portfolio
Scoring and Interpretation Procedures - Sample of one’s ability and accomplishment
o Score - Code or summary statement, usually - Education (writing samples) tools for hiring instructors
numerical, that reflects an evaluation of
performance on a test, task, interview, etc. 4. Case History Data
o Scoring - Process of assigning evaluative - Records, transcripts and other accounts in written,
codes or statements to performances in test pictorial or other form that present archival
etc. information, official and informal accounts and other
o Cut-Off Scores - reference points (usually data and items relevant to assessee
numerical) derived by judgement and used - Files/excerpts from files maintained in situation and
to divide a set of data into TWO or more agencies
classifications - Letters, written correspondence, photos, family
o Informal Eyeball Method - no formal albums, newspaper, magazine clippings, home news
method used to assign cutoff. and pictures, movies and audio tapes
Psychometric Soundness (Technical Quality) – - Shed light on individuals. Past and current adjustment
RV as well as on the events/circumstances that may have
o Psychometrics - science of psychological contributed to any changes in assessment
measurements - Neuropsychological Evaluations – prior trauma that
o Psychometric – measurements that is contributes to deficit
psychological in nature - Case Study/Case History – report or illustrative
o Psychometrician/Psychometrist – account concerning a person or event that was
professional who uses, analyze, interpret compiled so the basis of case history data
psych test data.
o Psychometric Utility – usefulness or 5. Behavioral Observation (Down-home Wisdom)
practical values - Monitoring the actions of others or oneself by visual or
electronic means while recording
2. Interview quantitative/qualitative info regarding those sections
- Method of gathering info thru direct communication - Can be used as diagnostic aid (inpatient facilities,
involving reciprocal exchange behavioral research lab, classroom)
- Takes note of verbal and non-verbal actions - Naturalistic Observation – observing behavior in
- Facial expressions, eye contact and general reaction natural settings
to the demand of the interview - Aid in designing therapeutic intervention
- Ideally conducted face to face - Used most frequently by researchers in settings such
- Telephone – pitch, pause are signs of emotion as classroom, clinics and prisons
- Different in purpose, length and nature - Not practical for private practitioners
- Panel Interview/Board Interview – more than one - Mostly used by those in field of assisted living
interviewer; usually used to help professionals in HR

1
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
6. Role Play Tests - Test developers and Publishers
- Tool for assessment where in assesses are directed Standards for Educational and Psychological Testing
to act as if they were in a particular situation o Covers issues related to test construction
- Used when real settings are too impractical and evaluation, test administration and use,
- Substance abusers and special applications of tests such as
- Can be used as both a tool for assessment and special considerations, when testing
measure of outcome linguistic minorities
- Test Users
7. Computers as Tools o Professionals who uses the tests
- Aids in test administration, scoring and interpretation o Controversies on qualifications
- Local Processing – scoring done on site - Test Takers → affected by → Sources of Error
- Central processing – conducted at some central o Test anxiety, if gets ba nila yung rationale ng
location; done through phone lines; mail courier assessment, willingness to cooperate,
- Simple Scoring Report – mere listing of score comprehend yung instructions, physical pain,
- Extended Score Report – more detailed; includes emotional distress, physical discomfort,
statistical analysis of test taker’s performance physical conditions, alertness, wakefulness,
- Interpretive Report – inclusion of numeral/narrative predisposed to anger/disgrace, prior to
interpretive statements in report. coaching, good/bad lighting or chamba
- Consultative Report – expert opinion o Anyone who is subject of an assessment or
- Integrative Report – employ previously collected evaluation
data into the next report - Exception: Psychological Autopsy
- CAPA (Computer Assisted Psychological o Reconstruction of decreased individuals’
Assessment) – assist was for test user, not to test psychological profile on the basis of archival
taker records, artifacts and interviews previously
- CAT (Computer Adaptive Testing) – reference to conducted with the deceased assessee or
the computer’s ability to tailor the test to the test people who know him/her
taker’s ability - Society at Large
o Organizing, systemizing (sino ang baliw?)
Advantages of CAPA shifted to social concerns
a. Greater access to potential test users o Assessment evidence has roots in science
b. Quicker scoring and interpretation o Society was heard thru court decisions
c. Cost-efficient - Other Parties
d. User-friendly for test takers who have o Organizations, companies, governmental
disabilities/isolated populations agencies, sponsors
o Companies and serious offer test scoring
Disadvantages of CAPA and interpretation service
a. Test-Client Integrity – verification of identity o Marketing and sales of texts
when administered online o Academicians – evaluates and reviews
b. Unrestricted access to notes, other internet psychometric soundness
sources/ other aids in testing score inflation
Note: internet testing-scores is not assessment. It In What Types of Settings are Assessments Conducted
goes with limitations and Why?
- Educational settings
8. Other Tools o Achievement tests
- Videos – training and evaluation contexts o Court mandates (identify those who need
- Tools unassociated to PA necessarily: biofeedback special service)
equipment o SAT & GRE
o Diagnostic test: identify area of deficit to be
Who are the Parties Involved? targeted for intervention

2
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
o Informal evaluation – non-systematic - Rapport – relationship between examinee &
assessment that leads to formation of an examiner
opinion/attitude - Assessment of People with Disabilities
- Clinical Settings o Assessed for the exact same reasons as
o Public, private, military hospitals, inpatient non-disabled individuals
clinics, private practice, consulting rooms, o Accommodation – adoption of a test,
schools and etc. procedure, situation or substitution of one
o To screen or diagnose behavior problems test or another, to make the assessment
o Individualized (group-screening only) more suitable for the assessee with
- Counselling exceptional needs
o Improve adjustments, productivity o Alternate Assessment – evaluative or
o Mostly administered: social and academic diagnostic procedure or process that varies
skills, personality, interest, attitudes and from the usual, customary or standardized
values way a measurement is derived, either by
- Geriatric Settings – for older people virtue of some special accommodation made
o Quality of life – perceived stress, to the assessee or by means of alternative
loneliness, sources of satisfaction, personal methods desired to measure the variables
values, quality of living conditions, friendship
and social support Where to go for Reference Sources
o Screening for cognitive decline & 1. Test catalogue
dementia – loss of cognitive functioning o Distributed by the publisher
o Pseudodementia – severe depression o Most readily accessible source of information
mimics dementia o Contain: brief description of the test
o Those who are diagnosed with dementia are o Objective: sell the test
also showing symptoms of depression
- Business and Military Settings 2. Test Manuals
o Decision making about careers of personnel o Detailed info concerning the development of
o Hiring, promotion, transfer, job satisfaction, particular test and technical info
eligibility for further training, o Purchased from test publisher
o Taking pulse of costumers
- Governmental and Organizational Credentialing 3. Reference Volumes
o Licensing and certification o Buros – created mental measurement
- Academic Research Settings yearbook; provides detailed info for each test
- Other settings listed, including publisher, author, purpose,
o Court intended population and administration time.
o Health psychology – understanding the o More of updates and making revisions.
psychological variables in the onset, course, 4. Journal Articles
treatments and prevention of illnesses, o Reviews studies on its psychometric address
disease and disabilities and examples of flow the instrument was
o Promotes on health thru teaching, research used in either research/applied context
and direct-service activities 5. Online Database – Educational Resources Info
o Uses individual interviews, surveys and Center
paper-and-pencil tests. o contains wealth of sources, news about texts
and test’s assessments
How are Assessments Conducted o provides balanced info concerning
- Responsible tests users have obligations before, educational assessment and to provide
during and after a test measurement resources that encourage responsible test
- Protocol – form, sheet, booklet; description of set of use
test or assessment related procedures

3
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 2: Historical, Cultural and Legal/Ethical their natural gifts. Developed questionnaires, rating
Considerations scales and self-report inventories
- Antiquity to the 19th Century - Pioneered coefficient of correlation
- Karl Pearson – product-moment correlation
China 2200 BCE technique
- Selection kung sino makaka obtain ng government - Anthropometric Laboratory – height, armspan,
job. weight, breathing capacity, strength of pull & squeeze,
- “roll was released” – posting of results. For men swiftness of blow, keenness of sight, memory of form,
only discrimination of color & steadiness of hand
- contained items that measures proficiency in subjects
like Music, Agriculture, Horsemanship, Writing, Wilhelm Max Wundt
Arithmetic, Agriculture, Geography, Civil Law and - First experimental psychology laboratory
Military Strategy - General description of human abilities (reaction time,
- Knowledge and skill in the rites and ceremonies of perception, attention span)
public and social life Galton Wundt
- individual differences - focused on similarities
Song/Sung Dynasty (960-1279 CE) – classical literature - he believed that individual
(acquired wisdom of the past – entitled for a position) differences are a frustrating
(there are dynasties na exclusive lang sa family ang source of error in
positions/binebenta) experimentation

Imperial Examination – state sponsored exams for official James McKeen Cattell (individual differences in reaction time)
positions - Coined mental test (1980)
- Privilege for Passing – gov’t position & wear garbs; - Cattell Infant Intelligence Scale – Psyche Cattell ( 3rd
exemption in taxes; exempted to torture Son of James Cattell) & his wife Josephine Omen
SPEARMAN – originated the concept of test reliability; building
Edwin Smith Papyrus – docu (1600 BCE) ancient Egyptians mathematical framework for factor analysis
physicians had a developing knowledge of human anatomy VICTOR HENRI – collaboration with Binet on papers
suggesting how mental test could be used to
Priest – psych assessment; counselling; psychotherapy measure higher mental processes.
- Also trained in healing arts, philosophy, architecture, EMIL KRAEPELIN – early experimenter with word association
math and astronomy technique as a formal test
- Answers questions about meaning of dreams, LIGHTNER WITMER – little known founder of clinical
questions about life & after life psychology; treat chronic bad spells; founded the first
Ebers Papyrus – used incantations to ward off anything that psychological clinic in US
causes physical and mental health problems
Greco-Roman Writings – personality categorization (fluids) The 20th Century
18th Century – Christion Von Wolf
- Anticipated psychology as a science and Measurement of Intelligence
psychological measurement as a specialty within that 1895: Binet and Henri – published articles w/c they argued for
science the measurement of MEMORY and SOCIAL
1859 – on the origin of species by Means of Natural Selection COMPREHENSION
– Charles Darwin
- Chance variation in species would be selected by 1905: Binet and Simon – Published 30 item “measuring scale
nature according to adaptivity and survival value of intelligence”; help identify MENTALLY RETARDED PARIS
- According to Charles Darwin, individual differences SCHOOL CHILDREN
yung tawag if not exactly same yung offspring
- Francis Galton – contributor to the field of 1939: David Wechsler (Bellevue Hospital) – measure adult
measurement; aspired to clarify people according to intelligence; Defined intelligence as “the aggregate or global

4
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
capacity of the individual to act purposefully, to think rationally - Found that most immigrants from various nationalities
and to deal effectively with his environment” to be mentally deficient. (Hereditary defect/Apparent
detect due to deprivation)
Group Intelligence Test – outgrowth of INDIVIDUALLY - Translated Binet test into English
ADMINSTERED INTELLIGENCE TEST; military’s need for
efficient method of screening the intelligence of WWI results. Culture-Specific Test
- test designed for use w/ people from one culture, not
Measurement of Personality from the other.
(1930; Clinical psych was synonymous to mental testing) - Mixed populations are not recommended
- Lower scores if di belong sa target pop/normed pop.
Robert S. Woodworth – measure of adjustment and - Note: yung opinions about sa clarity of instructions
emotional stability - Reviewed by experts to eliminate possible bias
Personal Data Sheet – name of disguise the free purpose
Henry Goddard
(Yes/No); never went beyond the experimental stages
- Immigrants in Ellis Island (incapable of
Woodworth Psychoneurotic Inventory - first widely used comprehending English)
measures of personality - Feebleminded Club – Goddard; Johnstone, Barnes;
interdisciplinary forum for exchange of ideas
Self-Report – process whereby assessee themselves supply regarding special education
assessment related information by responding to questions, - Used Binet test to identify individuals who are
keeps a diary/self-monitoring thoughts/behavior mentally deficient
- Lower intelligence = social problems
Advantage: mas kilala natin self natin - Eugenics – science of improving qualities of breed
Disadvantage: Not appropriate for users who have thru intervention with factors related to heredity; along
poor insight into themselves; believe something that is with Charles Davenport; segregation,
not true in reality; unwilling to reveal negative things institutionalization, no reproduction
about them. - Kallikak Family – Feeblemindedness – alcoholism,
delinquency, truancy, criminality, prostitution,
illegitimacy, economic dependence and even physical
Projective Test – Project to ambiguous stimulus their own
appearance
unique needs, fears, hope and motivation; inkblot, drawing, - Abraham Myerson – pseudoscience, generalized
photograph conduct based on quantifiable data
RORSCHACH – Best known of all projective Tests
TAT – 1930s – Murray and Morgan – tell a story; needs and Major Contributions:
motivation - State – SPED (law)
- Binet – TERMAN (US)
Academic Traditions - Nature – nurture
- Work sa univ ng researchers o However, he said that he did not ignore the
Applied Traditions effect of emotions, talagang nature lang ang
- Selection (china) sikat at focused that time

Culture and Assessment Some Issues regarding Culture & Assessment


- Socially transmitted behavior patterns, beliefs and - Communication between assessor and assesee –
products of work a particular population, community, most basic part of assessment
or group of people. - Verbal Communication
- spoken language, attitudes towards elders and o Must speak the same language
techniques of child-rearing o Using infrequently used words
Henry Goddard o Issue on translation
- Addressed that measure must be adaptive to test o Trained examiner may detect thru verbal and
takers’ culture non-verbal means that the examinee’s grasp
- Nature and nurture debate on intelligence. of language is too diffluent to proceed
- Genetic nature on mental deficiency o Always consider the culture
- Non-verbal Communication and Behavior
5
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
o Different from culture to culture (3) produces for ensuring confidentiality
o Psychoanalysis – emphasize non-verbal (4) notifying test takers in errors of scoring
acts; first chance actions (5) accuracy in scoring
o Disclosure of answers within 30 days of
Standards of Evaluation publishing the scores
- Standards/personal preference can be culturally - Gottfredson – argued that manipulation of such
relative standards will produce only lasting frustrations, not
- Individualistic Culture (dominant culture) – self- enduring solution
reliance, autonomy, independence, uniqueness and
competitiveness. (OWN: STABLE) Litigation
- Collective Culture – conformity, cooperation, - Court-mediated solutions of legal matters of a civil,
interdependence, striving towards group goals criminal or administrative nature
(OTHERS: UNSTABLE) - Stimulus to the creation of new legislation
- Dependent PD (individualistic culture then became - Judge-Made Law (galing sa ruling ng court)
overly dependent) o Depended sa prior rulings of new legislation
- Federal Rules of Government
Tests and Group Membership o Scientific of research is admissible as
- Conflict – result of differing scores within a group evidence when the research
- If a test is used to evaluate a candidate’s ability to do o Study/method enjoys general acceptance
a job, the text should do just that – regardless of ▪ Established by testimony of experts
group membership of the test taker. and by reference to publication in
- Group differences in scores on professionally peer-reviewed grounds
developed tests do reflect performances in the world - Rule 702
- Affirmative Action – voluntary and mandatory efforts o More experts to testify the administrability of
undertaken by federal, state and local governments, expert testimony
private employers and schools to combat - Daubert Ruling
discrimination and to promote equal opportunity o Opposing expert testimony, Kahit higher
In assessment – altering test-scoring I.R.R or not, would be admissible.
procedures according to set guidelines o Usually applied in determining mental
capacity, claims of emotional distress,
Legal and Ethical Considerations personal decisions, child custody,
termination of parental rights.
- Ethics – body of principles of right, proper or good
conduct Concerning of the Profession
- Code of Professional Ethics – standard of care; the - Test-user Qualifications
average reasonable and prudent professional would a. Achievement/Proficiency/Aptitude
provide diagnostic or therapeutic service under the ▪ Manual and general orientation
same or similar condition b. Aptitude/Adjustment Inventories
▪ Require technical knowledge
Legislation (enacted at the federal level) c. Projective/Individual Mental Tests
- Minimum competency training programs ▪ Substantial understanding of testing
o Decision making about grade promotions, & supervision
awarding of diplomas and identification of
areas for remedial instruction Code of Fair Testing Practicing in Education
o HS Grads “minimal competencies” in - Developing/selecting test; interpreting scores; striving
reading, writing and arithmetic for fairness; informing test takers
- Truth-in-Testing Legislation Oregon – death with dignity “euthanasia” – needs
o Give that test takes a way to learn the psychological evaluation first
criteria by which they are being judged
o Provide descriptions of: Disadvantages of CAPA
(1) test’s purpose and subject matter - Easily copied and duplicated
(2) knowledge skills that the test’s purpose to - Few researches conducted regarding transforming
measure paper and pencil test & CAPA

6
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- Questionable test interpretation
- Unprofessional / unregulated “psych taking” sites 3. The Right to Privacy and Confidentiality
- Privacy Right – Freedom to pick and choose for
“International Guidelines on Computer-Based and himself the time, circumstances, and the extent to w/c
Internet-Delivered Testing” he wishes to share/withhold from others his/her
- Flonda-based org attitude, beliefs, behaviors and opinions
- Address technical, quality, security and related issues - Privileged – Protected by law from disclosure in a
legal proceeding
- Privileged Information - Lawyer – client rel.; Doctor
The Rights of Test Takers –patient rel.; Priest – penitent rel.; Husband – wife
1. Right of Informed Consent rel.; Psychologist – client rel.; (all keeps information
- A person giving IC must be competent revealed & their client’s confidential)
- MacArthur competence assessment tool-treatment
(McCat-T) Confidentiality Privileged
o Structured interview based on 4 components - concerns matter of - protects clients from
of competency communication outside the disclosure in judicial
a. Being able to evidence a choice na bet courtroom proceedings
mo magparticipate -With Exceptions With Exceptions
b. Factual understanding of the issues -Ethical Term Legal Term
c. Able to reason about the facts of a
study/treatment - Privilege is not absolute. (reveal padin if necessary
d. Appreciate the rates of the situation and inutos ng court
Competent: Major Depression (Can be improved by o If di padin dinisclose = jail, fine, legal
training) consequences
Incompetent: Dementia, Bipolar, Schizophrenia - Psych-Client Relationship
(parent/legal rep.) o Nasa kanya yung privilege (pwede iutos ni
client kay psych na wag idisclose yung info
Written Form sa third party)
a. General purpose - Reveal if may danger/ harm sa self or others
b. Reason why is it being undertaken in the present o Potential harm/violence (must be identifiable)
case o Aids
c. General type of instruments to be administered o Threats to physical well-being
- Secure Test Data (5 years)
Court – Mandated: IC is Not necessary o Obtained in any medium
Courtesy – for building rapport.
4. The Right to the Least Stigmatizing Level
Deception – if knowing facts attesting might contaminate data - Avoid using moron, feebleminded, etc. (don’t breach
Debriefing – after testing. Full disclosure confidentiality)

a. Do not use deception unless it is absolutely


necessary
b. If it will cause participants emotional distress
c. Fully debrief participants

2. The Right to be informed to test findings


- Realistic info
- Put recommendations
- Put limitation also (errors)

Communication of Results – Most Important Part of


Evaluation Process
- answers future questions
- Ideally, put counseling resource for violent reactions.

7
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 3: A Statistic Refresher Raw Score – straight forward, unmodified accounting of
performance that is usually numerical
Scales of measurement
- Act of assigning numbers or symbols to characteristic Frequency Distribution
of things, etc. according to rules. - Ilang beses nag occur yung score
Scale - Also called as Simple F.D.
- Set of numbers whose properties model empirical
properties of the objects to rule the number an Grouped Frequency Distribution
assigned - Also called as class intervals
- Depende sa test user
a. Continuous scale – inch, meter (theoretically - Basis of convenience
possible to divide any of the values of the scale) - Involves Upper and Lower Class Limits=
b. Discrete scale – male/female
Illustrations
Error a. Graph – diagram/chart composed of lines, points,
- Collective influence of all the factors on test bars or other symbols that describe the illustrate data
score/measurement
- Beyond those specifically measured by the test a.1 Histogram – graph with vertical lines drawn at the
- Present always. Element of all measurements true limits of each score forming rectangles

1. Nominal
- Simplest forms of measurement
- Classifications/ Organization
- Yes/no responses
- Arithmetic Operation (+) addition

2. Ordinal
- Rank order
- Comparison with one another
- Assessment instruments applied to individual subject

3. Interval
- Contain equal intervals between numbers
- No absolute zero a.2 Bar Grah
- Possible to average set of measurements & obtain
meaningful
- Employed by tests of intelligence

4. Ratio
- Has the true zero point – theory only
- All mathematical operations can be used 3
- Used in assessing neurological functioning

Measurement Scales in Psychology

Ordinal – most frequently used in psychology; intelligence,


aptitude, personality test score a.3 Frequency Polygon
- Psychologists want to treat their data at interval due
to flexibility to be manipulated statistically

Describing Data
Distribution – set of test score arrayed for recording or study

8
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm

4. Standard Deviation
o Square of each score (- becomes +)
o Measures variability equal to the square root
Measures of Central Tendency of the average deviations about the mean
- Statistic that indicates the average/midmost score but o Equal sa square root of the variance
extreme scores in distribution
S2 Variance – mean of the squares of difference
Mean – average; most commonly used measures of between the scores & their mean
central tendency
∑ sigma – summation of
X – test scores Skewness – absence of symmetry
Kurtosis – steepness of a distribution in it’s center
1. Mean 𝒙 ̅ Higher Kurtosis (Higher peak) – fatter tails
o Most appropriate for interval and ratio Lower Kurtosis (rounded peaks) – thinner tails
For frequency:
𝑥=
∑(𝑓𝑥) The Normal Curve
𝑛 - began in the middle of 18th century
- Abraham De Moivre & later, Marquis de Laplace
Where: Sigma = Summation of all scores 19th Century – Karl Friedrich Gauss = Laplace – Gausian
N = number of observations Curve
Fx = midpoint if scores are grouped Karl Pearson – first to refer the curve as normal curve
2. Median Normal Curve – bell-shaped, smooth, mathematically defined
o Middle score in the distribution curve that is highest at it’s center
o Order (ascending or descending) - asymptotic – di lumalapat sa abscissa
▪ Odd – exact - normally distributed if larger ang samples (higher samples,
▪ Even – add the 2 then divide to 2 under range of abilities)
o Ordinal, interval, ratio
o Swak gamitin kapag konti lang ang scores Standard Scores
sa baba at taas - raw score that has been converted from one scale to another
3. Mode scale (easier interpretation)
o Most frequently occurring score
o Bimodal Distribution – 2 modes 1. Z Scores (zero plus/ minus 1 scale)
o Not used except for nominal data
o Useful in analyses of qualitative or verbal Z = (raw score-mean)/(standard deviation )
nature
o Can convey into along with the mean 2. T Scores (fifty plus/ minus 10)
- densed by McCall to honor his professor E.L. Thorndike
Measures of Variability ADV: No Negative Values
- Indication of how scores in a distribution are
scattered/dispersed 3. Stanine
- for achievement tests
1. Range
o Highest - lowest 4. SAT/GRE
o Simplest measure of variability M = 500 SD = 100
o Guide but gross description
2. Interquartile and Semi-Interquartile Range
• Linear Transformation
o Direct numerical relationship to the
3. Average Deviation (AD)
∑|𝑥|
original score
𝐴𝐷 = 𝑛 o Scores may undergo more than 1
transformation
Where: |x| = scores deviation from the mean

9
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
• Non-Linear Transformation 36% - other factor
o Used when data are not normally
distributed yet need padin i- Spearman Rho
compare sa normal distribution - also called as rank-order correlation coefficient, rank
• Normalized Standard Scores difference correlation method
o If skewed, normal distribution - developed by Charles Spearman – father of factor analysis
▪ Stretching the skewed - used when sample size is small
curve into a shape of a
normal curve then create a Graphic Representations
corresponding scale of Scatterplot/ Scatter diagram / scatter gram / bivariable
standard scores distribution
o Desirable for comparability - simple graphing of coordinate, points for values of x and y
purposes - useful for quick indication of direction and magnitude of
o Rely wen large sample is large and relation between 2 variables
replaceable & for skewness was
deep to the measuring instrument Curvilinearity
- eyeball gauge of how curved graph is
Correlation and Inference
Inference – decreased conclusions Outlier
Coefficient of Correlation – index ng strength of relationship - extremely atypical point
- usually occurring in a very small sample of test taker
The Concept of Correlation - can provide hint of some deficiency in test/ scoring
Correlation – degree and direction of correspondence procedures
between 2 things
+ 1 / - 1 = No Error Range Restriction
Perfect Correlation - causes lower correlation

Positive Correlation - Both Higher Both Lower Meta Analysis


Negative Correlation - Higher and Lower Lower and Higher - family of technique used to statistically combine info averse
0 = No Correlation studies to produce single estimates of data study
*Not a causation but can imply prediction Effect Size – estimates derived
- Most M.A studies: expressed as correlation coefficient
Pearson R ( r ) - M.A facilitates the drawing conclusions
- most widely used to correlation ADV: more weight can be given to studies with higher number
- also called as Pearson Correlation Coefficient of
Pearson Product Moment Coefficient of 1. More reliable
Correlation 2. Can be replicated
- by Karl Pearson 3. More focus on the effect size kesa sa stratified
significance
r = used when data is Linear & 2 Continuous Variables 4. Promotes evidence-based practice
convert raw scores → standard score ( X and Y each pair of - art and science
scores) - dependent on the skill of the meta analyst

Σ𝑥𝑦
𝑟=
√(Σ𝑥 2 )(Σ𝑦 2 )

.05 Level of Significance - chances are five times lower in level

Coefficient of Determination
- how much variable is shared in x & y
Ex: r = .80 = 64% - X
Chapter 4: Of Tests and Testing
10
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
to measure will influence performance on the
Some assumptions about Psychological Testing test
and Assessment o Error Variance – component of a test score
attributable to sources others than the
1. Psychological Traits and States Exists trait/ability measured
Ex: IQ 75% (25 san to galing?)
Traits – any distinguishable, relatively enduring
way in which one individual varies from one Sources of Error Variance
another 1. Assessee
State – also varies from person to person but 2. Assessor
less enduring 3. Instruments
Construct – informed, scientific concept develop
or constructed to describe or explain behavior;
unobservable but we can infer their existence Classical Test Theory
Overt Behavior – observable behavior; product o True score always comes with an error
of observable action
0s = Ts + E
2. Psychological Traits and States can be X = Ts + E
Quantified & Measured Ts = X0s – E
o Test developers shall provide test takers with
clear operational definition of the construct 6. Testing and Assessment can be Conducted in
being studied a Fair and Unbiased Manner
o Make an item that measures the targeted o More controversial assumption
text o Utilizing a test to a group which background
▪ Technical considerations was different from people whom the test was
▪ The way a construct has been intended
defined for the purpose of the test o More political than psychometric
▪ Value attended to the behaviors 7. Testing and Assessment Benefits Society
evaluated o What makes a good society?
▪ Psychometric soundness
Cumulative Scoring – the more the test taker responds
to a particular direction as keyed by the test manual as Two Aspects:
concept, the higher the test taker is pressured to be on the a. Reliability
targeted trait ▪ Consistency
-Higher Score = Higher you possess the construct being ▪ Necessary but not sufficient
revealed/measured b. Validity
▪ Measure what it proposes to
3. Test Related Behavior Predicts the Non-Test measure (difficult to achieve in
Related Behavior intelligence tests)
o Looks for patterns of answers ▪ Issue: are test items adequate
o For prediction of the future enough to come to construct?
o Postdict – understanding the behavior that ▪ Issue: interpretation results
takes place. Confirmation of retrospective c. Other considerations
data or incident. ▪ Trained examiners can administer
4. Tests and Other Measurement Techniques score and inspect with a minimum
have Strengths & Weaknesses difficulty
o Competent test users are aware and
acknowledges the limitations of the test and Norms
in how it is appropriate with the data being - Norm Related Testing and Assessment
obtained o Method of evaluation and a way of deriving
5. Various Sources of Error are Part of the means from test scores by evaluating
Assessment Process individual test-taker’s score and comparing it
o Error – component of a measurement to other scores of a group of test takers.
process factors other than what test attempt
11
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
o To know one’s standing/ranking relative to a - Standard Error of the Difference
comparison group
- Norm Score A Score B
o Test performance data of a particular group 80/100 79/100
of test takers that are designed to use as a Is 80 & 79 statistically significant?
reference sa pagevaluate/interpret
- Norming Developing Norms of a Standardized Test
o Process of deriving means 1. Obtain a Sample
o Standard set of instructions and conditions
- Race Norming 2. Data and Analysis
o Process of norming on the basis of race or o Descriptive statistics (measure of central
ethnic group (establishing different cut-off tendency & variability)
scores for ethnic) o Precise description of the standardized
- User Norms/ Program Norm sample (dapat kungsino yung nasa
o Consists of descriptive stats based on a standardization sample, representative sila
group of test takers in a given period of time ng future test takers)
because of formal sampling method o If in case na yung test ay limited for specific
group, yung test manual ay may guideline for
Sampling to Develop Norms establishing local norms
- Test Standardization o Standardization Samples – original
o Process of administering a test to a standardization sample
representative sample of test-takers to o Normative Sample – revised target
establish norms population; administration of test to a new
o Specified procedures for administering and normative sample
scoring typically including the normative data
Types of Norms
a. Stratified Sampling A. Percentile
▪ Proportion - Expression of the percentage of people whose score
▪ If random called, Stratified Random on a test or measure falls below a particular raw score
Sampling - 80th Percentile – you are better than the 80 of the test
b. Purposive Sampling takers who fall below the 80% percentile
▪ Select some sample because we - Ranking, conveys info about your relative position
believe it to be representative of the within the distribution of scores.
population criteria
c. Convenience/ Incidental Sampling Percentage Correct
▪ Most convenient
𝑟𝑎𝑤 𝑠𝑐𝑜𝑟𝑒
▪ Available = 𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑖𝑡𝑒𝑚𝑠 𝑥 100

Types of Standard Error


Issue: normal distribution (halos lahat nasa gitna)
- Standard Error of Measurement
o Observed score deviates from a true score
*Big difference pero tabi tabi sila, tas Small difference
pero layo layo sila.
0s and Ts (Huge discrepancy) = Error

- Standard Error of Estimate (Regression)


B. Age Norms
R2 = .64/ 64% (coefficient of def)
- Age equivalent scores
36% (error)
Ex: Stanford Binet
o Measure of Accuracy of Predicted
“Mental Age”
Is X really attributed to X?
C Age: 10 Mental Age:10
High reliability = Lower SEE
Compared sa same age people
- Standard Error of the Mean
Issue: Acceleration
o Error sa sampling

12
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Yes, same MA but key aspects (social and Average was 60
developmental psych) are not the same yet. Cut-off score of 80 was not good

C. Grade Norms 2. Is a cut-off score really discriminating?


- Madalas gamit for ELEMENTARY SCHOOL AGE Passing: 75
- expressed in decimals Is 74.8% very significant sa 75 na score?
- Disadvantage: pang nag-aaral lang; walang
pake sa items administered; Di pwede sa Note: all performances are normative, nilagyan lang ng
adults na nagbalik sa school. dichotomy (pass/fail sa criterion reference)

D. National Norms
- Nationally representative (Proportions)

E. Subgroup Norms
Test of self-esteem of gay people:
Norms: gay people
Given Norms:
Homosexual
- Lesbian
- Gay
- Bisexual
- Transgender

F. Local Norms
- only applicable in one particular locale/population.
- single-group validity

Fixed Reference Group Scoring System

Fixed Reference Group - basis of scoring for future


administration of the test. If in case may added items on the
revised version, raw scores will be converted to or anchored sa
FRG.

Norm Referenced
- Your score -> compared to -> average score of test takers
on the same test

* When applied, we can detect/recognize brilliance and


superior abilities

Criterion Referenced
- Your score -> compared to -> criteria: passing is 80%

* Regardless of other test taker’s performance


* It varies depending on the organization’s cut-off
* Also called as Domain/Content Referenced
* e.g Mastery Test

Issues on Criterion Referenced Testing/Assessment

1. What if unattainable yung criteria?


Normative = 75 is the highest
13
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 5: Reliability - emotional problems, physical discomfort, lack of sleep,
effects of drugs/medication
Reliability - wrong shading
- consistency in measurement - misreading the test item
- comes in a continuum/ degree level - formal learning experiences, casual life experiences, therapy,
- Reliability Coefficient - index of reliability, a proportion that illnesses, changes in mood or mental state
indicates the ratio between true score variance on a test and
the total variance Examiner Related Variables:
- pressure/absence of examiner
Classical Test Theory - physical appearance and demeanor
- not following test instructions
X=T+E - clues on a right answer: head nodding, eye movements and
non-verbal gestures
Where = T = 0s - E - level of procrastination
E = Ts - 0s
3. Test Scoring and Interpretation
Variance (o ) 2 - scores/scoring systems
- sd2 - computer- scored, technical glitch
- describing sources of test-score variability -subjectivity of the test
- True Variance - variance true from the difference
- Error Variance - from irrelevant random sources 4. Other Sources
- margin of error
O 2 (total) = O th (true) + O e (error)
2 2 - sampling error
- methodological error
Reliability = proportion of Total V attributed to the True V - non-systematic error (forgetting, failing to notice abusive
behavior, misunderstanding instructions regarding reporting)
Measurement Error - all factors associated with the process - systematic error (over reporting and under reporting)
of measuring some variable, other than the variable being
measured
Reliability Estimates
Random Error - unpredictable fluctuations and inconsistencies
of other variables in the measurement process; “noise” 1. Test-Retest Reliability
biglaang pangyayari
Marcos IQ Test Marcos IQ Test
Systematic Error (January 2018) (July 2018 )
- constant, proportion to which is presumed to be the value of = 100 = 100
the variables being measured
- does not affect the test consistency (present sa lahat) = greater than 6
- predictable and fixable months
If greater than 6 months, referred to as coefficient to stability.
Sources of Error Variance
1. Test Construction Time Sampling Error
- item/ content sampling - variation of items within and between - longer the time, the greater the likelihood that the reliability
text coefficient will be lower
- Problem: content sampled on a test and the way item is - experience, practice, memory, fatigue, motivation and
constructed developmental change
- Challenge/ Goal: maximize the proportion of total variance &
true variance. Minimize the proportion of total variance 2. Parallel Forms and Alternate Forms of Reliability

2. Test Administration Coefficient of Equivalence - sameness of two versions of text


Parallel Forms - equal correlation/ equal means and variances
Test Environment: room temp, lighting, ventilation and noise Alternate Forms - different version of a text to be parallel;
Test Taker Variables: equivalent in content/item difficulty
Item Sampling Error - time consuming and
14
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
expensive; minimizes carry over effect. - For dichotomous items (right/wrong)
- High heterogenous, low reliability KR20 compared sa
INTERNAL CONSISTENCY / INTER-ITEM CONSISTENCY split half

3. Split-half Reliability Formula: rKR20 = (


𝑘 ∑pq
)( 2)
- correlating two pairs of scores obtained from equivalent 𝑘−1 𝜎
halves of a single test administered once.
K = no. of test items
Step 1: Divide the test in 2 equivalent halves ∑pq = proportion of test & fouled takers who passed)
- odd-even method o2 = variance of total test scores
- divide the test by content & difficulty
“mini parallel forms” KR21 – all items have same levels of difficulty
Step 2: Calculate Pearson in between scores on
the 2 halves of the test Coefficient Alpha
Step 3: Adjust the half test reliability using - Received most acceptance and widest used today
- Also referred as coefficient of Alpha of -20
Spearman-Brown formula - Mean of all possible split half correlations, corrected
by Spearman-Brown formula/ test-retest
rSB = nrxy - For non-dichotomous items
1 + (n+1)rxy
𝑘 ∑𝜎2
rα = (𝑘−1)( 𝜎2
)
Rxy= Pearson R in the original test
N = no. of total items in a revised version then ididivide sa total k = number of test items
number of items sa original version ∑o2 = summation of variances each item
o2 = variance of total test scores
- to measure/estimate the effect of shortening the test - Preferred statistic for obtaining an estimate of internal
* Mas mataas ang reliability ng whole test kesa sa half-test consistency reliability
* Pwede gamitin to determine the number of items needed to - Widely used as a measure of reliability
attain a desired level of reliability
* if lower r then impractical mag add Pearson r = -1 to +1
Coefficient Alpha = 0 to 1 – similar
* to increase reliability: create new items, clarify test
instructions, simplify scoring rules. Average Proportional Distance (APD)
- Focused on degree of difference exists between item
Note: Inappropriate sa speed tests at heterogeneous data. scores
Other Methods of Estimating Internal Consistency Step 1: Calculate the absolute differences between scores
Scores:
Inter-Item Consistency - degree of correlation among all 1.) 4 1.) Item 1 – Item 2 = 1
items on a scale; used for homogeneous tests. 2.) 5 2.) Item 1 – Item 3 = 2
3.) 6 3.) Item 2 – Item 3 = 1
Homogeneous Tests - measures a single trait
Heterogenous Tests - measures different traits Step 2: Average the difference between scores
High Homogeneity = Higher Inter-Item Consistency 1+2+1 4
𝐴𝐷 = = = 1.33
3 3
Advantage: straight-forward interpretation, unlike heterogenous
tests na same scores but different abilities Step 3: Obtain the APD by dividing the average differences
Disadvantages: insufficient tool for reassuring multi-faceted between scores & the number of response options on the test
psychological variables (intelligence and personality) then minus one
Kuder Richardson Formula 1.33 1.33
- KR20: most widely known = = .22
7−1 6
15
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Classical Test Theory
.20/ .2 = excellent internal consistency - Also called as True Score Model
.25 & above = issues on internal consistency - Most widely used/accepted model
- Simpler to understand than IRT
Advantage on Cronbach: APD ay hindi dependent sa dami ng
items. True Score: value that acc to CTT, genuinely reflects an
individuals ability (trait) level as measured
Measures of Inter-Scorer Reliability
- Scorer reliability, judge reliability, observe reliability, For Criterion-Referenced:
interrater reliability
- Degree of agreement/consistency between 2 or more
Reliability: pagkakaiba iba ng test score ng mga test
raters with a particular measure
takers.
- Often used when coding non-verbal behavior
- Simplest way to compute: Coefficient of inter-scorer C.R: halos magkakadikit lang yung mga scores, minimal
reliability differences. Issue: Pasa ba or Failed?
- Cohen’s Kappa: 2 raters
- Fleiss Kappa: 3 or more raters Individual differences (variability); r decreases regardless
sa stability ng performance
The Nature of Test

Homogenous Heterogenous
Test Score – Test Dependent
- 1 trait - multifaceted
-Higher in consistency - test-retest reliability
Marcos IQ Test SB 5
Dynamic Static 135 130
- internal consistency - test-retest/ alternate forms
- anxiety - intelligence Ability
Restriction Inflation of Range Test A Test B
Mas mahirap Mas madali
Lower score Higher score

Assumption: may true score talaga tayo.


- simple
- applicable in most situations
Low correlation High Correlation Disadvantage:
1. All items are assured to contribute equally in the total
Speed Tests Power Tests score questionable if interval in nature ang scaling
- uniformly low difficulty - no time limits 2. Length of test made – CTT model
- unhoned - difficult items - Longer the better
- test- retest, alternate
forms, split half Domain Sampling Theory
- use Spearman-Brown - is the test able to cover the whole domain?
formula wy - yung items ba ay equally representative of all the domain?
- measure of reliability: Internal Consistency
Criterion Referenced Test for Educational/Vocational
- Heirarchichal fashion (ascending difficulty) Generalizability Theory – Cronbach
- Pass-fail manner - Modification of domain sampling method
- If may male – remedial. - Assumption: nag-iiba ang test score dahil iba iba ang
- Estimating reliability is not appropriate testing situation
- As individual difference decrease, reliability would
also decrease CTT GT
Variability in Score: Error Ibang test
16
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
situation Theta Level – reference to the degree of the
True score Universal Score underlying ability/trait that the test taker is pressured
to bring to the test; degree of trait present sa test
Uni A Uni B Uni C Uni D taker.
Uni Score 10 15 2 13
(different universe, different universe scores) 2. Local Independence
- systematic relationship between all of the test items.
Universe Score: 100 - the relationship has to do with the theta level of the
Generalizable / Reliable test taker (at hindi sa kung ano pa man).
Observed Score: 98 Lesser error 3. Monotonicity
- if you select a response, dapat higher level talaga
- Dapat generalizable ang test sya ng trait na yun .

Generalizability Study a. I like going to parties (Higher theta level)


- How generalizable yung score mo sa isang test if b. I like being alone
iadminister sya sa ibang situation
- Anong impact ng ibang facets sa universe sa iyong Item Characteristic Curve
universal score - also called as item response curve, category response curve,
- Coefficient of Generalizability – the influence of item trace line
particular facets on the test score. - nagpapakita ng relationship between item response (piniling
item) and test takers level on the latent construct being
Decision Study – developers examine the usefulness of test measured.
scores in helping the test taker.
- tests’ reliability does not reside within the test itself Information Function/ Curve
- Reliability: circumstances under which is test is developed, - alin sa mga items ang mas nagdidiscriminate
administered and interpreted
Scale Information Function
Item Response Theory/ Latent Trait Theory - sum ng lahat ng items
- a person with X ability will be able to perform at a level of Y - highly reliable for assessing moderate – severe levels but has
poor precision for mid levels.
Difficulty
- different scores per item (CTT: equal scores) IRT – Differential Item Function
- more difficult, higher score given - identify biased items (then naicocorrect naman)
CTT – Cronbach’s Coherent
Discrimination - iter-items relation
- identifies higher ∑ and lower scores on ability/traits being - internal consistency
measured differential weight
- higher weight if it discriminates - In CTT, reliability measure: internal consistency (cronbach
alpha)
- pwede sa Dichotomous & Polytomous Items. Ex: .92 (High pero di naman nagiindicate kung hanggang
saang level ng depressive symptopathology)
CTT IRT
- no assumptions in - F.D is intact with IRT models - In IRT, precision of a scale varies depending on what level of
frequency distributions consistency is being measured.
- specific level
Rasche Model - acknowledge limitations
- very specific/stringent
assumptions about the Reliability and Individual Score
underlying distributions
Standard Error of Measurement
Assumptions in using IRT - higher reliability, lower error
1. Unidimensional - tool used to estimate or infer the extent to which an observed
- measures single continuous construct. score denotes from a true score

17
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Standard Error of a Score
- index of the extent to which one individual’s score
vary over tests pressured to be parallel

𝜎𝑚𝑒𝑎𝑠 = 𝜎 √1 − 𝑟𝑥𝑦
𝑠𝑒𝑚 = 𝑆𝐷 √1 − 𝑟𝑒𝑙𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡

SEM: indirectly expresses/ reassures reliability

r = .64
𝜎 = 15

𝜎𝑚𝑒𝑎𝑠 = 𝜎 √1 − 𝑟𝑥𝑦

= 𝜎 √1 − (. 64)
=9

Confidence Interval: range/band of test scores that is likely to


contain the true score

FSIQ = 75 95% confident= 70-80


Age = 25

75 +/– 1.96 𝜎𝑚𝑒𝑎𝑠 = 75 + 1.96 (2.37) = 75! 4.645

Standard Error of the Difference Between 2 Scores

𝜎𝑑𝑖𝑓𝑓 = √ 𝜎 2 𝑚𝑒𝑎𝑠 + 𝜎 2 𝑚𝑒𝑎𝑠


Squared SD

𝜎𝑑𝑖𝑓𝑓 = 𝜎√ 2 − 𝑟1 − 𝑟2

18
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 6: Validity Test Blue Print “Structure”
- a plan regarding the types of information to be covered by the
- how well a test measures what it purports to measure items, quantity of items, organization and content of items.
- no test or measurement technique is Universally Valid
Employment Test – Behvaioral Observations
- usually employs SME’s rating the degree to which the content
Validation – process of gathering and evaluating evidence of a test is a representative sample of the required job-related
about validity skills.
Test Developer – supply validity
Test User – conduct own validation studies with own group of To Measure Content Validity:
test takers. Lawsche Model – method for gauging agreement among
raters regarding how essential a particular item is.
Local Validation Studies Essential
- yield insights regarding a particular population of test takers Useful but not essential
as compared to the norming sample of test manual Not Necessary
- necessary if the test user sought to transform a nationally
standardized exam into Braille administration - If more than half of the panelists indicate an item as essential,
- necesarry if magkaiba yung original pop sa new pop kung higher content validity
saan iaadminister.
Content Validity Ratio (CVR)
Trinitarian View
𝑛
– classic conception of validity 𝑛𝑒 −( )
2
- “Umbrella Validity” – construct 𝐶𝑉𝑅 = 𝑛
2
- not mutually exclusive
- contributes independently (depende sa type of test) Ne= number of panelists indicating the essential
N = total number of panelists
Unitary View
- by Messick (Fragmental) Negative CVR
- takes into account everything from implication of test scores
into societal values to the consequence of test use. 10
4−( )
𝐶𝑉𝑅 = 2 = −0.2
Face Validity (Logical Validity) 10
(2)
- what a test appears to measure to the person being tested
than to what a test actually measures. Zero CVR
Ex: inkblots: low face validity
- on the perspective of test taker 10
15 − ( 2 )
Lower Face Validity 𝐶𝑉𝑅 =
10
= 0
= decrease in test -takers motivation to do his/her best. ( )
2
- still, it can be relevant and useful and only if it is perceived as Positive CVR
relevant and useful, otherwise, negative consequences may
result. 10
- more of public relations than psychometric construct 9−( )
𝐶𝑉𝑅 = 2 = 0.8
10
(2)
Content Validity
- how adequately a test samples behavior representative of the
universe of behavior that the text was designed to sample CVR is computed per item
- include key components of the targeted construct
-exclude content irrelevant constructs Note: history test may considered as valid at one classroom
Ex: Prelims but not necessarily to others; consider the culture and poilitics
Chapter Chapter Chapter as factors.
1 2 3
25 25 25 Criterion – Related Validity

19
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- how a test score can be used to infer an individual’s most in use.
probable standing on some measure of interest
B. Expectancy Data – provide info that can be used in
evaluating criterion-related validity of a test. Illustrates the
Characteristics of Criterion likelihood that the test-taker will score within some interval of
Relevant – applicable to the matter at hand scores in a criterion measure.

Ex: rest of succesful actor’s interest Expectancy Table – shows the percentage of people
(dapat relevant yung criteria) within specified test-score intervals who
subsequently were placed in various categories of
Valid – if test X is the criterion to validate test Y, dapat may criterion.
existing evidence na valid test X
Construct Validity
Uncontaminated - judgement about appropriateness of inferences drawn from
Criterion Contamination – applied to a certain criterion test scores regarding individual standing on a variable.
measure. If ang predicator variable ay nagamit din as criterion,
then contaminated na ang criterion Construct – informed, scientific idea developed/hypothesized
to describe or explain behavior. Unobservable, presupposed
Note: no methods/stats can gauge the extent to which criterion underlying traits that a test-developer may involve to descirbe
contamination has taken place, and no methods/stats can behavior. Must formulate hypothesis about the expected
connect such contamination behvaior of higher scores to lower scores. Unifying concept of
all validity evidence.
Concurrent Validity (Descriptive Validity)
- if test scores are obtained at about the same time as the Evidence of Construct Validity
criterion measures are obtained.
1. Homogenity – measuring a single
Ex: scores on psychodiagnostic test -> diagnosed patients -> construct/uniformity of a test in measuring a specific
existing construct.
Ex: correlation of a subscale and total test score.
- faster less expensive way to offer a diagnosis/classification.
*For Dichotomous Items: all items have significant
Test B Test A positive correlation sa total test scores. Higher scores
(Prior researches - brand new test tend to pass each items kesa sa low scorers.
supported to its - new purpose *Multipoint Scale: items that do not show significant
validity) - new population spearman correlation are eliminated. Stats: Cronbach
- validating criterion Alpha
*Item Analysis
Predictive Validity No. 3 = Higher Scores = mali
Lower Scores = tama
Scores: Ngayon Criterion: next 4 yearw Thus: Eliminate item number 3!
Feu CAT College GWA
2. Changes with Age
2 Types of Statistical Evidence - some constructs are expected to change overtime.

A. Validity Coefficient – correlation coefficient that provides a


measure of the relationship between test scores and scores on Ex: Reading Ability Test
the criterion measures Grade 3 Grade 6
- Pearson Score: 50 100
- Spearman Rho Valid if nagprogress yung scores
- Affected by Inflation of Range
3. Pre-test and Post Test Changes
Incremental Validity – degree to which an additional - formal education, therapy/medication OJT
predictor explain something about the criterion - ideally involves control group
measures that is not explained by predictor already
20
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
4. Evidence from Distinct Group
- also called as method of contrasted groups. Test Bias
- systematically prevents accurate, impartial measurement
Test of Depression
More Depressed Random Test Taker Random Error – Chance variation
Test Taker (Wallmart Shopper) Systematic Error – bias
Higher Scores Lower Scores
RATING ERRORS:
5. Convergent Evidence Also called as restriction of range errors or distribution
- higer correlation with a construct that is related. errors.
Test of Anxiety Test of Depression a. Severity / Strictness error: 0
Higher Higher b. Central tendency error: 50
c. Leniency / Generosity error: 100
Test of Introversion Test of Extroversion
Lower Higher Halo Effect: Focuses on the positive (bias)
To solve: Provide Training Programs
6. Discriminant Evidence/ Divergent
- should not be theoretically correlated
Methodological Problem
Maritable Satisfaction Social Desirability - Prevention during test development
Scale - estimated true score transformations (post hoc remedy)
(No Correlation at All)
Test Fairness
To Examine Both Convergent and Divergent - extent to which a test is used in an impartial & equitable way.
Validity:
Common Misunderstandings:
Multitrait Multimethod Matrix 1. We are all equal. Differences are not real
2 or more 2 or more 2. Unfair na iadminister ang test na iba ang standardization
traits methods sample.
- dapat may statistical evidences
- matrix/table that result from correlating variables - di naman agad agad invalid
(traits) within methods. 3. Remedying situations is biased and unfair.
- no it’s okay to give accomodations.
Ex:
-Extraversion -Personality Test
- Aggression - Behavior Observation

7. Factor Analysis - data reduction method


Exploratory Confirmatory
- Inductive - Deductive
Factor Loading – the extent to which a factor
determines the test scores.

Test of Bulimia

Bulimia Factor – Higher F.L (Convergent Evidence)


Anorexia – Lower F.L (Divergent Evidence)

Norming Factors: Knowledge, Judgement/ Verbal


Abstraction Ability > Mathematical

Validity, Bias and Test Fairness


- it is possible for a valid test to be used fairly or unfairly
21
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 7: Utility - for incrementall validity

Utility – usefulness or practical value of testing to improve Selection Ratio – no. of people na iha-hire na at no. of
efficiency, training and intervention. applicants.
Ex:
Other Factors Affecting Utility 50 𝑃𝑜𝑠𝑡𝑖𝑜𝑛𝑠
= .50
100 𝐴𝑝𝑝𝑙𝑖𝑐𝑎𝑛𝑡𝑠
A. Psychometric Soundness (RV)
- test scores tell us the utility of the test.
- a tests have utility if their use in a particular situations help us Base Ratio – percentage of people hired under existing
to make better decisions. (cost-efficient) system for a particular position
- if the selection ratio is very high, halos hired lahat.
- not all valid tests are outrightly useful test. Ex:
20 𝑆𝑢𝑐𝑐𝑒𝑠𝑓𝑢𝑙
B. Costs = .80
25 𝐸𝑚𝑝𝑙𝑜𝑦𝑒𝑑
- disadvantages, losses, expenses in both economic and non-
economic terms. Top Down Selection – process of awarding available
- funds for test purchase, supply blank test protocols, positions to applicants where by the highest scorer is awarded
computerized test processing, scoring and interpretation from in the first position.
the publisher of some service etc. Disadvantage – unintended discriminatory effects
- instrument that turns out to be ineffective
Hit Rate – correct classification
C. Benefits Miss Rate – incorrect classification
- does the benefit justifies the cost? False Positive – na hire tas mali
- profit, gains and advantages. False Negative – di nahire pero qualified
Economic:
Good section test in hiring: higher productivity, Base Rate – proportion of people in the population that
lesser sayang possess a particular trait, behavior, characteristics or attribute.
Non-Economic: Low Cut Score = false negative rate falls to zero
Higher quality and quantity of worker’s Note = Higher false negative ratio; higher false positive rates
Performance.
Lower time need for training; no. of accidents; High Cut Off Score = low section ratio – qualified per nareject
Lower turnovers, Low Cut Off Score = high selection ratio
Utility Analysis Ideal = moderate cut score = higher hit rate
- family of techniques that entail a cost-benefit analysis = lower error rate
designed to yield into relevant to a decision about the
usefulness/practical value of assessment. Return of Investment – ratio of benefits to costs
- cost benefit analysis
Lower Selection Ratio = lower hired applicants
How is Utility Analysis Conducted
Predictor ------------------------- criterion
A. Expectancy Data (test) (rating of performances on
- test taker’s scores lies within an interval (passing, acceptable the job)
or failing) of scores on a criterion measure. This must be linear
Taylor Russell Tables 𝐶𝑟𝑖𝑡𝑒𝑟𝑖𝑜𝑛 𝑆𝑐𝑜𝑟𝑒
- inclusion of a particular test in the selection system will
𝐴𝑛𝑜𝑛𝑔 𝑠𝑐𝑜𝑟𝑒 𝑎𝑛𝑔 𝑝𝑎𝑠𝑠𝑒𝑑 𝑠𝑎 ℎ𝑖𝑛𝑑𝑖?
improve selection
- provide an estimate of the percentage of employees hired by
the use of a particular test who will be successful at their jobs
(3 variables: test validity, selection ratio and base rate)
- best used if valid and low selection ratio
22
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Naylor Shine Tables Multiple Cut Score: use of 2/more cut scores with reference to
- provide an indication of the difference in average criterion one predictor for the purpose of categorizing test takers.
scores for selected group and original group.
Ano ba ang nai a add ng test na to sa established procedures? Multiple Hurdle/ Multistage
- several predictors
Taylor Russell – increase over current procedures - cut score is in place for each predictor used
Naylor Shine – increase in average score on some criterion Ex: Beauty Pageant
1. Beauty (screening)
Brogden-Cronbach-Gleser Formula 2. Body (swimsuit)
- used to calculate a dollar amount of a utility gain resulting 3. Brains (q & a)
from the use of a particular selection instrument under specific 1. Written application
condition. 2. Additional materials
3. Personal interview stage
Utility Gain – estimate of benefit using a particular test/
selection method. Compensatory Model of Selection
Productivity Gain – estimated increase in work output - higher score on english can compensate lower score in math
- post-hire training
Decision Theory and Test Utility - different weighs per predictor (higher weigh den sa mas
- more general, better model for selection and placement important)
(compared sa classical approach: correlational model) - stats used: multiple regression
Adaptive Treatment – yung job requirements yung mag-
adjust sa ability ng applicant Methods of Setting Cut Scores
Value of the Test – improves hit rate 1. Angoff Method (CTT)
- interrater reliability
Higher Selection Ratio = lower cut score = false positives - SME
Lower Selection Ratio = higher cut score = false negatives 2. Known Groups Method/ Method of Contrasting
Group
*Decision Theory provides guidelines for setting optional cut-off
scores. Pass and Fail
(Gaano ka-pass? Gaano ka-fail?)
Some Practical Considerations
1. Pool of Job 3. IRT-Based Method
- low qualifications - bago ka makapass, dapat above minimum level of
- high qualification (advanced trainings) difficulty. (Minimum level of difficulty is being
- top performers of the test are probably the least determined by experts)
likely of all applicants to be actually hired.
- Item-Tapping Method
2. Complexity of the Job for licensure exam
- the more complex the job, the more people differ on arrangement of items in histogram
how well or poorly they do that job -difficulty level- consensus of experts

3. Cut Score in Use - Bookmark Method


- reference point derived as a result of a judgement Training kung ano ba ang required for minimal
and used to divide a set of data into 2 or more competencies to pass
classifications they derive. Book of items (ascending order of difficulty)
BOOKMARK SERVES AS A CUT SCORE
2 Types of Cut Off Scores Issue: training of experts, floor/ceiling effect, length of
a. Relative Cut Score item booklets.
- norm-referenced cut score
- top 10% may A 4. Other Methods
b. Fixed Cut Score - Method of Predictive Yield
- absolute score *Decision – theoretic approach
- 95 – A * By R.L Thorndike

23
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
* Norm-referenced methid
* depends sa no. of positions to be filled, likelihood
na iaccept yung job & distribution of applicant scores

- Discrimination Analysis (Discriminant Function


Analysis)
*relation ng identified variable -→ 2 naturally occuring
groups
(score sa test) (successful or unsuccessful)

24
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 8: Test Development 2. Test Construction
- process of creating a test
Scaling (L.L Thrustone – forefront in developing
Text Conceptualization scaling methods)
- the process of setting rules for assigning number in
measurement.
Test Construction
Absolute Scaling – procedure for obtaining a
Test Try-out measure of item difficulty across sample of test takers
who vary in ability
Item Analysis
Types of Scales
Test Reversion - Age-based Scale – function of age
- Grade-based Scale – function of grade
- Stanine Scale – raw scores transformed from 1 to 9
- Unidimensional vs. Multidimensional
1. Test Conceptualization - Comparative vs. Categorical
• Thoughts/ self-talk Scaling Methods
• Review of literature tungkol sa test/
psychometric soundness Rating Scale – grouping of words, statements or
• Emerging social phenomenon/ pattern of symbols on which judgements on the strength of a
behavior particular trait, attitude or emotion are indicated by the
• In response to a need to assess mastery in test takers.
an emerging profession
Summative Scale – final test score is obtained by
➢ Good Item sa Norm Referenced summing the ratings across all items.
- Likert Scale – usually to attitude; easy to construct;
Question # 1: High Scorers = Correct usually reliable; Best: 1 to 5
Low Scorers = Wrong
However, insufficient & irrelevant ang rankings kapag Rating Scales – Ordinal in Nature
knowledge of mastery ang itetest.
1. 2 3 4 5
➢ Criterion Referenced ( don’t have equal intervals )
- usually sa licensure exams and educational context Not necessarily the same
- developed thru conceptualization of the knowledge
or skills to be mastered Method of Paired Comparison
- experimentation with different items, tests, formats - choose one from two statements/others
or measurement procedures will help the test ADV: forces test takers to answer
developer to discover best measure of mastery for
the targeted skill or behavior. Comparative Scaling (Q-Sort technique)
- entails judgement of stimulus in comparison with
Group A Group B every other stimulus on the scale.
Mastered Clueless (sorting – ordinal scale)

The items that best discriminate these 2 groups would Categorical Scaling
be considered as good items.
Never Sometimes Always
Pilot Work (Sorting)
- preliminary research sorrounding the creation of a
prototype of a test Guttman Scale (Ordinal)/ Scalogram Analysis
- literature reviews, experimentation, creation, revision - from weaker to stronger expression of beliefs,
and deletion of preliminary test items. attitude and feeling

Assumption: Agree sa A? / B C D
25
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
X A → B? C D DISADV: focus more limited area; subjectivity in
scoring and interscorer difference.
Datas are analyzed through:
Writing Items for Computer Administration
Scalogram Analysis – item analysis procedure
and approach to test development that involves Item Bank – relatively large and easily accessible collection of
a graphic mapping of a test-taker response. test questions.
Goal: to obtain an arrangement of items
wherein agreement with 1 item Computer Adaptive Testing – interactive, computer-
automatically converts endorsement of administered test-taking process wherein items presented to
less extreme positions. the test-taker are based on the previously presented items.
Madalas gamitin sa consumer ADV: lesser time; reducing measurement error by
psychology 50%; reduce floor effect and ceiling effect

Method of Equal Appearing Intervals Floor Effect – diminished utility of an assessment


- by Thurstone tool for distinguishing test-takers at the low end of
-Thurstone scale the ability being measured.
- for interval data
- direct estimation of one’s attitude Ceiling Effect – diminished utility of an assessment
tool for distingushing test takers at the high end of
the ability trait or other attribute being measured.
Writing Items
Item Branching – ability of the computer to tailor the content
Item Pool – the reservior or well from which items will or will and order of presentations of test items on the basis of
not be drawn for the final version of the test. responses to previous items with determined algorithm.
*pwedeng idepende sa previous items or
*Allowed naman ang mag interview to assist in item para di madaya
writing. * pwede iapply sa personality test
Item Format – form, plan, structure, arrangement and layout of * pwede pang detect ng inconsistent responding
individual test-items
a. Selected Response Format – fixed options
b. Constructed Response Format – supply/create own Scoring Items
answer
1. Cumulative Model
3 Types of Selected-Response Format - most commony used
1. Multiple Choice Format – stern, correct, distractors - simple & logical
2. Matching Item – premise & response; add more - higher score, higher the test taker on the trait
responses to minimize guessing error measured
3. True/False – Binary Choice Item (Only 2 ) – yes/no, 2. Class Scoring/ Category Scoring
/ or x, agree or disagree; contains single idea, not too - test taker’s responses earn credit toward placement
long, not too subject to debate; no distractions in a particular class or category with other test takers
involved; more prone to guessing. whose pattern of responses is pressumably similar in
some way.
3 Types of Constructed Response Items 3. Ipsative Scoring
a. Completion Item – provide a word/phrase to - comparing a test-takers on one scale within a test to
complete another scale within the same test.
b. Short Answer Item
c. Essay Item – writing a composition, recall of facts, 3. Test Try Out
understanding, analysis and interpretation - since may item pool na, you can now try out the test
- i-try mo sa population na target mo
ADV: depth knowledge, creative integration and - not fewer than 5, as many as 10 per item
expression of material ; requires recall, organization,
planning and uniting ability. The more subjects, the better

26
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Lower subjects = phantom factor – factors that actually are just index, higher internal consistency.
artifacts of the small sample size (in F.A)
Product of Item Score Standard Deviation (s),
What is a Good Item? Correlation ( r) between item score and total
a. Reliable and Valid test score. (Point biserial method)
b. Helps discriminate test-takers
Higher scores answered incorrectly = X (eliminate)
Lower scores answered correctly = X (eliminate) Factor Analysis and Inter-Item Consistency
- is this test measure the same thing?
4. Item Analysis Low factor loading = eliminated
a. Item Difficulty Index - if too many items taps the same area, alisin yung
- “p” = percent of people passing the item sa weakest.
achievement tests - Useful in test interpretation process (lalo na if
- range from 0 to 1 icocompose sa ibang groups)
(0: mali lahat)
(1: tama lahat) c. Item Validity Index
- indicates the degree to which a test is measuring
Ex: 50/100 Item 2 / = P2 = 50/100 = .5 what it purports to measure.
75/100 Item 3 / = P3 = 75/100 = .75 (easier than Higher item validity index, higher criterion-related
item 2) validity.
- item score standard deviation
> in personality testing: Item Endorsement Index – - correlation between item score and criterion score (
percent of people who said yes to, agreed with or r1c)
otherwise endorsed the item.
𝑠1 √𝑝1 (1 − 𝑝1 )
To get the overall test difficulty, just average all the item
difficulty under of each item. (r1c) x (s1) = item validity index
- best used when the goal is to maximize the criterion
𝑆𝑢𝑚 𝑜𝑓 𝑒𝑎𝑐ℎ 𝑖𝑡𝑒𝑚′ 𝑠 𝑑𝑖𝑓𝑓𝑖𝑐𝑢𝑙𝑡𝑦 𝑖𝑛𝑑𝑒𝑥 related validity.
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑖𝑡𝑒𝑚𝑠 𝑜𝑛 𝑎 𝑡𝑒𝑠𝑡
d. Item Discrimination Index “d”
Optimal Average Difficulty is : .5 - separate high and low scores
Individual Items: .30- .80
* higher d, the greater the number of high scores
*for selected response, consider guessing: chance success answering the item correctly.
proportion *negative (-) d-value = lower scores got it right!
Ex: true or false (eliminate/revise)
½ = .50 -range from -1 to +1
Therefore, the optimal item difficulty is between .50
and 1.00 = 0.75 Item Characteristic Curves
Midpoint representing Optimal Item Difficulty is obtained by - graphic representation of item difficulty and
summing the discrimination. The steeper the slope, the greater the
𝐶ℎ𝑎𝑛𝑐𝑒 𝑠𝑢𝑐𝑐𝑒𝑠𝑠 𝑣𝑎𝑟𝑖𝑎𝑡𝑖𝑜𝑛 + 1.00 item discrimination.
2
Other Considerations in Item Analysis
. 5 + 1.00 1.5 1. Guessing
= = .60
2 2 - formula scores: corrections for
5 Multiple choice item = 1/5 or .20 guessing
.20+1.00 1.20 a. educated guess; not entirely random
= = .60 b. problem of omitted item
2 2
c. others are great really “lucky”
d. level of instructions
b. Item Reliability Index
- leave it blank if unsure
- indicates the internal consistency of a test. The higher
27
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- affected by risk-taking
- not a big deal in personality tests
2. Item Fairness 5.Test Revision
- degree that a test item is biased
Biased Test Item – item that favors one group or Test Revision as a Stage in New Test Development
other group -characterize items according to its strength and weakness
Solution – item characteristic curve; differential Educ Placement/Employment: ITEM BIAS
item functioning Most highly skilled: ITEM DISCRIMINATION
- same proportion of person from each group Note: All domains must be covered
should pass any given item or the test. Walang
kinikilingan na gender, race, social class and etc. Test Revision in the Life Cycle of an Existing Test
Rorschach / Never gets old
3. Speed Test TAT X must be revised
- considered na di alam/mahirap yung last part
kasi short sa time. APA: I revise mo when significant changes in the domain
- measures of item discrimination may be represented, or new conditions of test use and interpretation,
artificially high for late appearing items make the test inappropriate for its unintended use.

Solution: restrict item analysis to items finished. Reasons for Revision


1. Luma na yung test + hindi na makakarelate
(not recommended) 2. Luma na yung vocabulary
1. Smaller number of test takers, less reliable 3. Cultural Changes, words have new meanings
2. Yung matalinong nakatapos – item analysis (offensive)
(all/selected) 4. Norms: group membership change
3. Higher iq = magmumukhang madali yung 5. Norms: age-related shifts in abilities, age-extension of
items sa dulo. norms
6. Improving tests R & V
Quantitative Item Analysis 7. When there are revisions in the theory which the test
- rely on verbal rather than mathematical/stats. was based
- encouraging test takers to discuss aspects of
test taking experiences. (group or individual) Revision Process: follows same procedure when making a
- general term for various non-statistical brand new test
procedures designed to explore how individual Note: Formal item analysis must be employed to evaluate the
items work. stability of items between revisions of the same test. (it may be
- explore thru verbal means: interviews, group directly comparable)(change in examinees performance)
discussions.
Cautions: providing test takers with the opportunity to Cross Validation & Co-Validation
describe a test can be providing students with the opportunity Cross Validation – attempt to validate a test that is already
to describe their instructors. If failed, iovercriticize. valid on one particular norm. (Expect Validity Shrinkage:
Decreased Validity Index)
Co-Norming – if for norming purposes/less sampling error.
a. “Think Aloud” test administration Co-Validation- for validity purposes
- to shed light on test taker’s thought processes
decreasing administration of a test. Quality Assurance during Test Revision
- done one on one - Strict adherence with standardized procedures
- identifies why they are misinterpreting it, - Recruit examiners – educational experience and
- perceive, interpret and response. professional experience, administration experience
b. Expert Panel with intellectual measures, certification and licensing
- sensitive review – study of test-items, typically status.
conducted during test-development process, in which - Will undergo training – written guidelines for test
items are examined for fairness to all perspective test administration classroom instruction; practice test
takers and for the presence of offensive language, administration; videotaped daw.
stereotypes or situations. - Administer a quiz/ other means to asses training

28
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- Examiners required to review a case prior to testify Possible Applications of IRT
Newsletter – designed to provide an ongoing way to
maintain quality assurance in test administration. 1. Evaluating the Properties of Existing Tests and
- Training for scorers Guiding Test Revision
- IRT Information Curves
For Restandardization: - paano nagwork ang individual item to
measure different levels of underlying
2 qualified scores rescore each protocol. construct.
If may discrepancy: isolve ni resolver - tailor an instrument for high precision
(pinipili through demonstration of (provide high info)
exceptional scoring accuracy & previous - add more items/response options. Higher
scoring experiencing) theta levels.
2. Determining Measurement Equivalence Across
Anchor Protocol – test protocol scored by highly Test-Taker Population
authoritative scorer that is designed as a model for - ensure that same construct is being measured
scoring and a mechanism for resolving scoring despite the language translation
discrepancies
Differential Item Functioning
Anchor Protocol ------ Scoring of Other Protocol Depression
100 81 PH – Lugmok Japan – not so obvious
^ Scoring Drift ^ Lower validity for between group comparisons
- Data entry phase = computer will detect some
inconsistencies DIF Analysis
– test developers scrutinize group by
The Use of IRT in Building and Revising Test group item response curves looking for DIF
items.
CTT - used to evaluate measurement equivalence in
item content
Advantages Disadvantages - evaluation of item-ordering effects
- Smaller sample sizes - Dependent on samples - effects of different administration procedures.
- Simple math models - Larger items DIF Items – same construct but different
- Wide application - Each items contribute probabilities as a function of their group
- Most researches are equally membership.
familiar 3. Developing Item Banks
- Compatible sa data
analysis
- Stats related software

IRT
Advantages Disadvantages
- independent of samples - complicated, many are
- shorter test unfamiliar
- greater precision - large sample size
- inappropriate in many
applications
- few IRT packages
softwares.

29
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 9: Intelligence and Its Measurement Alfred Binet
- walang binigay na definition. Components ang binigay nya.
Henry Goddard – (Intelligence is not observable) - reasoning, judgement, memory and abstraction
- degree of availability of one’s experiences for the solution of - abilites cannot be separated because they interact
his present problems and anticipation of future ones.
David Weschler
For Non-Psychologists: - agreggate or global capacity of the individual to act
1. Intelligence: reasons logically and well, reads wisely, purposefully to think rationally and to deal effectively with his
with common sense, keeps an open mind, reads with environment.
high comprehension - Included Non-Intellective Factors
2. Unintelligence: does not torelate diversity of users, - connative, affective/personality traits (drives, persistence,
does not display curiosity, behaves with insufficient goal awareness, potential to percieve & respond to social,
consideration of others. moral and aesthetic values)
For Both Lay People and Psychologists:
a. Practical Problem Solving Ability - listens to all Measure Qualititavely Differentiable Abilities
sides of argument Verbal ------------------------------------- Performance
b. Verbal Ability – good vocabulary
c. Social Competence – time for appointments WAIS III – verbal comprehension, working memory, perceptual
org, processing speed (additional indexes)
3. Academic Intelligence – studying hard
a. Lay People – interpersonal and social
b. Experts – motivation Jean Piaget
4. Everyday Intelligence – practical problem solving - focused on development of cognition
ability – social competence, character and interest in - intelligence: evolving biological adaptation to the outside
learning and culture world.
* higher cognitive skills = higher adaptation (mental trial and
Siegler and Richards : Developmental Psych Class error > physical trial and error)
Infancy – physical coordination, awareness of people, verbal - neither solely motivation nor learning
outpost, attachment - interaction with environment, psychological structures
Childhood – verbal facility, understanding, characteristic of become re-oragnzied
learning
Adulthood – verbal facility, use of logic, problem solving Stages of Cognitive Development
- maaring iba ibang ages pero its order was
Yusen and Kane (6.1, 3 and 6) unchangeable.
Younger – interpersonal skills - it happens because of interactions between
Older – academic skills biological factors and learning.

Intelligence Defined: Views of Scholar and Test Biological Aspects – due to inherent maturational
Professionals Mechanisms
Schema – organized action or mental structure that
Spearman – intelligence has many meanings, sa sobrang leads to knowing/understanding
dami, wala na Plural: Schemata
Wesman – no general agreement
Edwin G. Boring – intelligence is what the test tells Assimilation: organizing new into to fit already
establihsed thought
Francis Galton Accomodation: changing the thought to fit new info
- first person to establish heritability of intelligence (framing *to promote change (disequilibrium) & acquire new
nature-nurture debate) info, perception & cumulative skills: Physical
- intelligent: Best sensory abilities Activities and Social Peer Interaction
- info – senses/ sensimotor & perception related test (test of
usual acquity/ hearing ability); separate test for different Interactionism
components – interaction between heredity and environment to develop
intelligence.
- Binet, Wechsler and Piaget
30
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
d. musical
Factor-Analytic Theories of Intelligence e. spatial
- identifying ability or groups of abilities deemed to constitute f. interpersonal – ability to understand others
intelligence ex: succesful sales people, politicians, teachers,
clinicians, religious leaders
Information Processing Theories – identifying g. intrapersonal – ability to understand oneself
specific mental proccesses that constitutes
intelligence Raymond Cattell – Gc Gf Dependent
- crystalized – acquired (formal/informal) ex: retrieval of info +
Charles Spearman – Two Factor Theory application of general knowledge.
- pioneered new techniques to measure intercorrelations - Fluid – nonverbal, culture-free, independent of instruction.
between tests.
Horn
Predictor of Overall – G Factor – All Intelligence tests Vulnerable Abilities – decline with age, if injured, di na
Intelligence have in common babalik ex: GV
(high positive correlations) Maintained Abilities – di-nagdedecline, bumabalik ex:
Quantitative, reading and writing, long terminal
G Factor – Specific + Error
(lower to moderate correlations) Carroll – Three Stratum Theory of Cognitive Abilities
Heirarchal Model – Listed in Stratum
a. G / General Intelligence
S b. 8 Abilities / Processes
c. Level Factors / Speed Factors
S S
G CHC Model by: McGrew-Flanagan
S low r S
Cattell/Horn Carrol
E Low r -GX -G/
^r S - Q / RW (Broad) Applied - Q / RW (first stratum,
narrow abilities)
E * definition and codes

To solve: Based on F.A studies (not relevant to cross batt)

Group factors: common to group of activities but not to all. - No G Factor


Neither G nor S. Ex: linguisic, mechanical & arithmetical - to improve psych as in education
abilities. (psychoeducational assessment)

G factor- General electrochemical mental energy available to Cross Battery Assessment – assessment that employs test
the certain for problem solving from different test batteries and entails interpretation of data
- associated in facility in one’s own thinking of one’s own from specified subtests to provide comprehensive assessment.
experiences and in making observations and extracting Confirmatory Factor Analysis – used by CHC, more
principles. powerful, preffered over EFA
- best measure: abstract reasoning problems - greatest value from a heuristic stand point.

Guilford – ayaw sa G The Information Processing View


Thurstone – Primary Mental Abilities (7)
Difficult to not top G. Aleksandr Luria – Focus: paano (how) tayo nagproprocess?
Gardner – theory of Multiple Intelligence Two Info: Processing Styles
Simultaneous/Parallel Successive/Sequential
a. logical - mathematical Processing Processing
b. bodily – kinesthetic - integrated all at one time - individually processed in
c. linguistic - synthesized sequence
31
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- multitasking mamsh! - logical and analytic Mental Age – index that refers to the chronological age
Ex: seeing art/ paintings, - isa-isa lang bes! equivalent of one’s performance on a test or subtest; in
map reading Ex: memorizing a telephone reference to norms
number, learning the
spelling of a new word Children – educational placement
Applied in: Kaufman Assessment Battery for Children Adults – clinically relevant info; to measure learning potential
(KABC II) Pass Model by Nagleiri and Das and skill acquisition; judging one’s competency to create
decisions; vocational & career decisions and transitions
a. Planning – strategy development for problem sharing
b. Attention/Arousal – recepting to info Theory in Intelligence Test Development & Interpretation
c. Silmutaneous
d. Successive Binet – adapted spearman’s G factor - universal unity of
intellective function
To Tap PASS: Cognitive Assesment System (CAS) (issues Wechsler – verbal and performance – classification of subjects
on validity) where intelligence manifests; desired/coincided with
Thurstone’s PMA
Robert Sternberg – Triarchic Theory of Intelligence
Intelligence – means to govern ourselves so that our thoughts Multi Factor Theory of Intelligence by Thorndike
and actions are organized, coherent and responsive to both a. Social Intelligence – dealing with people
our internally driven needs and to the needs of the b. Concrete Intelligence – dealing with objects
environment c. Abstract Intelligence – dealing with verbal and
mathematical symbols.
1. Metacomponents – planning, monitoring what we do - G Factor – general mental ability; total number of
and evaluating what we have done modified neural connections or bonds available in the
2. Performance Components – administer the brain. Higher bonds, higher ability to learn.
instructions to metacomponents
3. Knowledge-Acquisition Components - “learning Intelligence: Some Issues
how to do something in the first place”
Nature vs. Nurture
Succesfull Intelligence – one effectively adapt, share, shapes
and select environment in away that conforms both personal Preformationism – all living organs are performed at birth. Di
and societal standards of success. (analytic, creative and na maiimprove
practical abilities; capitalize on strenghts; compensate for Nicholas Hartsoeker – existence of homunculi (little men) sa
shortcomings) sperm cell.
Predeterminism – one’s abilities are predetermined by genetic
Measuring Intelligence inheritance. Di na maiimprove ng learning

Types of Task-Used in Intelligence Tests Arnold Gesell – practice had little effect on tasks such as
Infancy (0-18 months) – sensori-motor development – non- climbing stairs, cutting with scissors, etc.; training does not
verbal motor response transcend maturation.
Ex: Altering Response – infant’s capacity for Maturation – responsible sa intelligence. Hindi learning mental
responsiveness development. Progressive morphogenesis of patterns of
behavior
Orienting Response – turning in the direction of
Stimulus Infancy – period wherein people realizes racial inheritance.
- build rapport to infants/examinees na di pa alam ang meaning Galton – hereditary (based on family histories of eminent
ng cooperation and patience people)
- structured interview sa examinee’s parents, guardian, etc. Dugdale – degeneracy was also inherited
Goddard – feeblemindedness – product of recessive gene
Older Child – verbal and performance abilities Terman – father of american version of Binet test;
Mexican/Native American – genetically inferior culture.
Pearson – immigrating jews (compared to Native British) were
inferior physiologically and mentally.

32
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Children Born to Poverty Stroken Parents health); moderate political and social views; successful in
adopted educational and vocational pursuit; commits less crime
Middle Class Lower Level Winner – gifted children cease to exploit their gift; frozen to
Higher IQ Lower IQ expertise; may lose motivation as adults.

However if matalino si mother, Higher IQ talaga kahit Construct Validity of Tests and Intelligence
anong SES ng nagadopt na family. - know how the test developer defined intelligence.

*Level of Parental Education – better predictor of IQ than Other Issues


parental norm in Portugese adolescents. 1. Flynn Effect – Intelligence Inflation
3 Points per decade – vary dipende sa culture
Verbal, Perceptual and Image Rotation (VPR) Model loading/crystalized/fluid
- by Wendy Johnson - progressive rise in intelligence test scores that is
- intelligence has a strong genetic basis expected to occur in a normed test intelligence from
- G factor – responsible to our intelligence the date when the first test was first normed.
Fluid Intelligence (product of heredity)
Nurture: pre-natal & post-natal, SES, educational opportunities, Crystalized Intelligence (product of
parental modelling relevant to intellectual development. learning and environment)

Inheritance and Interactionism


- we inherit a genetic potential (coded genetically)
- optimistic view
- we can use environment to push our genetic potential to the
limit. Being Gifted:
Ottis – Lennon School Ability Test
Hereditary Influence: childhood: 41% - screening for candidates fir gifted programs
Developmental Perspective adolescence: 55% - group test
Adulthood: 66%
* habang tumatanda tayo, we increasingly modify our
environment to complement genetic tendencies. Different Aptitude Test
*Genetics has higher effect on childhood experiences, social - to screen for social abilities/aptitudes
environment and behavior, leisure activities and various life
events. Structure of Intellect (SOI)
- by Guilford
Intelligence Test – most effective and most frequently - can assess creativity thru the use of personality and
used instrument for identifying gifted children. biographical influences or thru measure of creative
thinking
Stability of Intelligence
- stable na pag adult; higher in vocabulary, lower in arithmetic, Other Tools to Identify Gifted:
verbal analogies and non-verbal skills. - case studies, behavior rating scales
- nominating techniques
Young Adult Intelligence – most important determinant of - members of a class, team, work unit or
cognitive performance as an older adult. other type of group are asked to select
Full Scale IQs – remain (individual abilities: may change people in response to this question.
significally)
Advantage Disadvantage
IVNIK – Stable: Verbal Intellectual Skills - advanced - depression, feelings of
Least Stable: recall of newly learned info. - admiration and respect isolation, suicidal
* not all are absolutely stable. Some are just more stable than - financial freedom ideation, gestures or
others. action.
- cultural pressure to be
75 – decline in cognitive abilities average
Terman – gifted children tended to maintain their superior
intellectual ability. ; lower mortality rates (better physical/mental
33
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Culture Loading – test incorporates vocabulary,
Cattell – dahil sa crystalized concepts, traditions, knowledge and feelings
- improvements in education, greater educational associated with particular culture.
opportunities, greater cognitive demands on - subjective, qualitative, non numerical subject
workplace.
- Flynn – dahil sa fluid Culture Fair
- minimize effects of culture
2. Personality - common to all different cultures
Intelligence + Personality = Street Efficacy - all tasks are motivating to all grasps
- non-verbal (assembling, classifying, manipulating)
Intelligent: - simpler clear directions administered orally
Aggresiveness with peers, initiative, higher need for - lack predictive validity
achievement, competitive striving, curiosity, self- - mababa parin scores ng minority
confidence and emotional stability - different values, orientation, subcultures
Non-Intelligent:
Passivity, dependence, maladjustment Culture Specific – more valid measure of mental
development
Infants: Temperament (Personality) Measure Streetwiseness – ex: black intelligence test
- distinguishing manner of the child’s observable of cultural homogenity (BITCH) – Williams
actions and reactions. Culture Fair = Cattell Culture Fair I.T
- vigor of responding, general activity rate,
restlessness during sleep, irritability and cuddliness.
- differences in temperament: health risks-related Cultural/Regional Uppercrust Savvy Test (CRUST)
behaviors, dangerous driving, alcohol dependence, - perfect score if part ka ng society’s upper crust
unsafe sex, violent crime. - lack predictive validity
3. Gender - provided little useful/ practical info
Males Females - not relevant in educational purposes
- G Factor - language-related skill
- visual spatialization - Try out or Pilot test sa different ethnic backgrounds then
eliminate items na culture loaded/culture bias
4. Family Environment - Analysis of individual items & tests predicitve validity
- loving home where their safety and welfare are of No bias if available for use.
the utmost concern given ample opportunity for
learning and growth Other sources of bias: criterion for referral for assessment,
- presence of resources, parental expression of conduct of assessment, conduct of assessment, scoring and
concern about achievement, parental explanation of interpretation
discipline policies in a warm, democratic home
environment Intelligence: predicts school performance, years of education,
- Divorce: social status, income; negatively concealed with juvenile score
- Twins: different intrauterine environment *When administers intelligence test: great responsibility is
through preparation
5. Culture
Binet Simon Test – designed to separate “natural
intelligence form instruction” by disregarding degree
of instrutctions.

Culture Free Intelligence Test


- if cultural factors can be controlled, then differences
between cultures will be lessened.
- eliminate verbal. Ipasok si NV → minority (2) adult &
children; not a good predictor of success in acad and
business.

34
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 10: Test of Intelligence
Item Book 1: 2 routing test
Binet Item Book 2: non-verbal
– to screen for children with developmental disabilities in Paris Item Book 3: verbal
schools.
- created the first formal test of intelligence (1905) Ceiling: Highest level item (intellectually gifted)
- used spearman’s g factor then later adapted CHC

Stanford-Binet Intelligence Scales

First Edition: lack of representativeness ng Basal Level/Base-Level Criterion


standardization sample. – need mameet to continue; used to
- first published intelligence test to provide organized and describe a subtest with reference to
detailed administration & scoring specific test-takers performance
- first introduced the concept of IQ
- alternate item: pang sub if di maayos pag aadminister sa Ex: 2 constant items
regular item. If sunod sunod mali, na reach na ceiling
1960: deviation IQ : compose our performance to same age
level then discontinue
standardization sample.
Earlier versions:
𝑀𝐴
Ratio IQ: 𝐶𝐴 𝑥 100
Age Scale: items are grouped by age
Floor: Lowest level (developmentally delayed)
SB4: Point Scale: grouped by subtest/categories
If vague: “tell me more”
Cattell – Horn Model
Most SBS items are UNTIMED
Test Composite: index derived from the
combination of one or more subtest scores.
Testing the Limits
SB5: 2 – 85 and above
- Administering test items beyond the level at which the
Full scale IQ: all 10 subt.
test manual dictates discontinuance
- based on CHC Model
- Not scored, ilalagay lang sa report na nag conduct ka
- balance and verbal & non-verbal
then include the findings.
- issues on applicability on clinical population
Extra Test Behavior – behavioral observation
Adaptive Testing – tailored testing, sequential testing,
The Wechsler Test
branched testing and response-contingent testing.
- Individually administered
- From preschool to adulthood
Mid Difficulty – level of difficulty where adaptive testing
usually begin with. - All are point scales
o Deviation IQ (100,15)
Advantages: o Cinocompare sa age group
1. Collect minimum amount of the info in the minimum a. WAIS IV: 16 to 60 years & 11 months
amount of time b. WISC IV: 6 to 16 years & 11 months
2. Facilitates rapport c. WPPS-III : 3 to 7 years & 3 months
3. Minimizes examinees’ fatigue
WAIS IV (WB: TV Network)
Routing Test – task used to direct/route examinee to a - To evaluate the intellectual capacity of it’s
particular level of question. multilingual, multinational and multi cultural clients.
Core Subtest – used to obtain a composite score
Abbreviated Battery IQ score Supplemental/Optional Subtest – additional clinical
- Non-verbal fluid reasoning info/extending number of abilities sampled
- Verbal Knowledge
Pwede mag sub si supplemental kay core if:
- Contains teaching items – demo ng task
35
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
1. Incorrect administration of core subtest - lower items, lower reliability and lower validity
2. Naexpose si assesee
3. Physical limitation (di naperform) * if need ng intervention, administer the whole test

Practice Items/ Teaching Items – lower scores = truly a deficit • Wechsler Abbreviated Scale of Intelligence (WASI)
FSIQ Floor: 40 1999
FSIQ Ceiling: 160 o 6 – 89 years old (screen for intellectual
ability)
20-34 = period of optimal performance X o Two subtest: vocabulary and block design
same age norms / o Four subtest: vocabulary, block, similarity
and matrix reasoning + core with FISQ: top
WISC IV (2003) wide range of cognitive abilities
- Warming to CHC model
- G (Caroll, Wechsler) – FSIQ from core subtest only • Kaufman Brief Intelligence Scale
- Interrelated daw ang cognitive functions • Wide Range Intelligence Test = (WRIT)

7 Processes Scores: how the test taker processes Both with high correlation with WASI (convergent validity)
Note: WISC does not yield separate verbal & performance IQ
scores Wechsler Test in Perspective

WISC IV VS SB 5 Factor Analysis


(Similarities) - things that co-occurs tend to have a common cause
- 6 to 16 years old 2003 -2 – 85 - used to discover patterns of correlations coefficients suggests
and 11 months Individually - without the existence of underlying psychological dimensions.
- with administered supplemental - help researchers decide how best to summarize large
supplemental Child-friendly subtest amounts of info by using few scores.
subtest materials - with abbreviated
- short form Both have battery IQ for 2 Line of Demarcation – diagonal line indicating below 1.00
- included parent software for subtest correlations
education as scoring and report - No patient educ EFA: Most commonly used
variable writing as variable Confirmatory Factor Analysis – yield fit statistics – alin sa
- higher floor for Norming sample: - lower floor for mga models/hypothesis we tested ang most in agreement with
intellectual 2,200 intell deficits the data
deficiency CHC MODEL - includes SES
- no SES and WITH G and test taker Other Measure of Intelligence
educ variable educ
Individual Administration
WPPSI-III
- first major intelligence test that adequately sampled to total 1. Kaufman Adolscent and Adult Intelligence Test
population in US including racial minorities (KAIT)
- 2 years and 6 months – 3 years and 3 months Kaufman Brief Intelligence Test (KBIT)
- re affirmed V,P,FSIQ Kaufman Assessment Battery for Children (KABC)
General Language Composite – all optional - first flagship test
subtest/scores - focus: information processing (distinction between
Processing Speed Qoutient sequential and simultaneous processing)
- mas preferred ang SB5 over WPPS III - drawn on Alexandr Luria
2. Differential Ability Scales (DAS) – educational
SHORT FORMS settings
- pinapaikli to reduce time needed 3. Good Enough’s Figure Drawing Test
- short attention span (separate sampling of representative - reliable but issues validity
subtest) - screen for assessing intelligence
- for screening purposes only (not for important decisions) - draw human figure
- with EST “est” on scores - also used to assess personality

36
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Group Administration - first group intelligence test to be used in US
a. Group test in Military schools
1. Robert M. Yekes - measure abstract thinking reasoning ability
Army alpha: could read - assist in school evaluation & placement
Army beta: foreign born reunits, could not read decision making

Objective: measures the abiltiy to be a good Measures of Cognitive Styles and Specific Intellectual
soldiers (after the war ginamit to kasi mas cheap Disabilities
& mas madali lesser SB5. Na aliw sya)
2. Army General Classification Test (AGCT) - Cognitive Style: psychological dimension that characterizes
WWII the consistency with which one acquires and processes
3. OSS/Central Intelligence Agency – select spies information. (Critical Thinking, Music, Art Appreciation – not
and secret agents to work abroad tapped by popular IQ test)

Group Tests Used Today in the Armed Forces Witkin’s Field Dependence vs. Field Independence
1. Officer Qualifying Test (115 multiple choice) Dimension
- used by US Navy as an admissions test - degraded pictures test (Maria Kozhevnikov)
to a officer candidate school Visual Object Ability – ability to process info about the usual
2. Airman Qualifying Exam (200 MC) appearance of objects as well as potential properties of object
- given to all US Airforce volunteers (shape, color or texture)
3. Armed Services Vocational Aptitude Battery
(ASVAB) Creativity - problem solving, originality in perception,
- for new remits sa lahat ng armed originality in abstraction
services
- for students – future career plans
- most widely used aptitude test in US Measures of Creativity

Armed Forces Qualification Test (AFQT) a. Originality – innovative /non-obvious


- general ability used in selection of b. Fluency – ease of producing responses
recruits c. Flexibility – variety of ideas shifting from one
- guide training & selective decisions approach to another
d. Elaboration – richness in detail in verbal explanation
b. Group test in Schools
90% - group intelligence administered tests (school Critics: Group Intelligence Test: may concrete answer
ability test) can’t stand alone for class assignment It doesn’t allow for originality FFE
purposes
10% - individual intelligence test Convergent Thinking (Required sa most achievement
tests)
Group Intelligence Test - deductive reasoning process that entails recall and
- to plan educational goals for all children consideration of facts as well as series of logical judgements to
- instruction – related activities narrow down solutions and eventually arise at one solution
- screen to know who needs individually administered
test Guilford (SOI) – creativity
Divergent Thinking – free flowing, more creative;
Individual Intelligence/Ability Test – placement consequences, universal rules
in special class, program for gifted Remote Associates Test (RAT) by Mednick
- may 3 words tas supply ka ng 4th word na related sa tatlo
School: Torrance Tests of Creative Thinking
1. California Test of Mental Maturity - word-based, picture-based, sound-based
2. Kuhlmann-Anderson Intelligence Test Creativity Tests – not realiable (lower r, est.); highly
3. Henmon-Nelson Tests of Mental Ability susceptible to emotional and physical health, motivation
4. Cognitive Abilities Test Psychoeducational Batteries – test packages related to
5. Ottis-Lennon School Ability Test/Otis Mental educational settings
Ability Test
37
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 11: Assessment for Education Potential Assessment Device (LPAD)” which is designed to
yield info about the nature and amount of intervention required
Response to Intervention to enhance a child’s performance.

Mid 1970s – specific learning disability: discrepancy sa IQ and Vygotzky – zone of proximal development
sa achievement expected from the child in many areas.
Actual development Level of potential
2007 – disorder in one or more of the basic psychological Level development
processes involved in understanding or in using language, Result of instruction, guidance, intervention
spoken or written, which disorder may manifest itself on
imperfect ability to listen, think, speak, read, write, spell or do Note: Traditional – neutral
mathematical calculations.
Dynamic – not neutral, great diversity of approach (difficulty to
Individuals with Disabilities Education Act (IDEA) – use of judge validity)
process based on the child’s response to scientific, research-
based intervention. Emily Duvall – conducted pilot study with 3rd graders for
dynamic assessment using standardized test; progressive,
RTI Model: multi-level prevention framework applied in provide value and actionable data.
educational settings designed to maximize student
achievement through the use of data that identifies students Achievement Tests
with evidence based intervention & teaching adjusted on the - designed to measure accomplishment/degree of learning
basis of student responsiveness. after learning experience
- a sound achievement test: content valid and reliable
3 Levels:
1. Classroom environement Measure of General Achievement
2. Small group na nagfail Achievement Batteries – covers a number of academic areas
3. Individualized divided into several subjects.
Wide Range Achievement Test IV – Wilkinson & Robertson
Objective: accelerate learning process for all students; identify - reading, spelling, arithmetic and reading
learning disabilities Comprehension

For Implementation: Sequential Test of Educational Progress (STEP) Battery


Problem Solving Model – use of interventions tailored to - used for kinder to g 12
student’s individual needs that are selected by a - reading, vocabulary, math, writing skills, study
multidisciplinary team of school professionals. skills, science and social studies. Behavior inventory
Central Intervention Policy – selected school administrators educational environment questionnaire and
for multiple students. activity inventory

Advocates for Integrative Assessment – multidsicplinary SRA California Achievement Test


approach to evaluation that assimilates input from relevant - kinder to g12
sources.
Location Test/Routing Test – pretest to locale yung
Dynamic Assessment appropriate level of administration
- all ages
- based on test-intervention retect models Wechsler Individual Achievement Test (WIAT-III)
- Budoff, Feuerstein & Vygotzsky, theoretical and aspring - 4 -50
approaches. - schools, clinical and research settings
- 16 substests (di lahat inaadminister)
Budoff – explored differences between deficits indentified by - yield potential data for student achievement in
standardized tests due to differences in education tha mental academic areas like reading, writing, math, listening
deficiency (training improve test performance) and speaking skills

Feuerstein – Mediated Learning (teaching principles and *most appropriate test, depende sa educational objective ng
strategy) modified cognition. Developed “the Learning teacher/school.
38
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
“Musts” Achievement Test: could be used to predict; specific subject
- R & V, minimized bias which you have learned
Aptitude: broads abilities: modes variety of variables
“Desirables”
- easy to administer, score, updated, engaging & relevant to Previously referred as Readiness Test – physical factors,
target test-takers personality factors judged necessary for a child to be ready to
learn. ; replaced aptitude (too general kasi)
Measurement of Achievement in Specific Subject Areas
Pre-School Level
Most Measures – teacher made tests - first 5 years of life, time of profound change
Elementary – basic skills: reading, writing and arithmetic - 18 -24 months – capable of symbolic thought and develops
(silent ad oral reading); test varies dipende sa cognitive language skills
theoryit’s based; comprehension and vocabulary - 2 – vocabulary: 200 words or more

Secondary – cooperative achievement test; separate Tools Used


achievement test per subject area (english, math, literature, Checklist: questionnaire on which marks are made to indicate
social studies, science, foreign language) the presence or absence of a specified behavior, thought,
event or circumstance
College – final exams: covering all what they should know; Rating Scale: form completed by a evaluator to make
advanced placement program; college credit for work judgement of realistic standing with regard to a specified
completed in HS variable or list of variabled
Test of English Proficiency/English as Second Language
- ESL program placemet Child Behavior Checklist – Achenbach (CBCL)
- 1 ½ - 5 years / 4 – 18
Test of Minimum Competency - I nonote yung subject abilities, social relations, school
- designed to ensure na ang HS grads has atleast acquired performance
minimal skills to become a productive member of a society. - specific behavioral and emotional (open ended items for
(ex: ahsge) reporting additional problem behaviors)
- not approved kasi maraming US citizens ang di
makapagbasa ng issues on tests psychometric soundness. 8 Syndormes: Set of Co-occuring Emotional and Behavioral
Problems
College Level Examination Program (CLEP) – Adults 1. Anxious/Depressed
- independent study. Sources other than formal schooling 2. Withdrawn/Depressed
3. Somatic Complaints
Proficiency Examination Program (PEP) 4. Social Problems
- offered by American College Testing program 5. Thought Problems
- assess achievement and skills learned outside classroom 6. Attention Problems
7. Rule-Breaking Problems
Adult Basic Learning Exam (ABLE) 8. Aggressive Behavior
- 17 pataas
- di nakaka complete ng 8 years of formal schooling Connor’s Rating Scales – Revised (CRS-R)
- assessment of achievement sa vocabulary, reading, spelling - to screen for ADHD and other behavior problems
and arithmetic - L:15-20 S: 5-10
- Parent/Teacher Version – 3-17
Curriculum Based Assessment – acquired from formal - Adolescent Self-Report Version – 12-17
teaching - Monitoring ADHD Treatment
Curriculum Based Measurement – type of CBA, local norms
Fact Based – role memory/recall Behavioral Assesment System for Children (BASC 2)
Conceptual – application - identify adaptive difficulties on 16 scales (daily act – study
skills)
Aptitude Tests/ Prognostic Test - self-report of personality (SRP) if may sufficient insight sa
- informal learning or life experiences interpersonal relations, self-esteem and sensation seeking
Label: depende sa use ng items
APGAR – 1 minute of after birth → how well the infant
39
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
tolerated the birthing process - test battery that assesses the development of the reading and
5 minutes after birth → how well the infant mathematical skills important in the early stages of formal
adapted to the environment school learning

A – Activity (muscle tone) Level 1: individual : beginner to mid kinder


P – Pulse (heart rate) Level 2 : group : end of the kinder to grade 1
G – Grimace (reflex irritability)
A – Appearance (color) - oral administration, untimed, 90 mins
R – Respiration - with practice test (pwede iadminister few days before)
7 - 10 = normal
7 - Below = assistance Secondary School Level
0 – 3 = neurological deficit
Scholastic Aptitude Test
Parent Attitude Questionnaire – Parke and Sawin - college selection process, hs guidance and job placement
- 3 month birthday counselors
- to assess maternal attitudes both in the hospital and - help students decide wether further academics, vocational
approximately 3 months after birth training would be adviseable.
- attractive infants: more positive treatment (inexperienced
moms) 1. Multipart test (reading, writing, math)
2. SAT subject test (achievement sa different subjects)

Psychological Test Issues SAT & HS GWA: Best predictor of college success
Adverse effects
Pre-schoolers – short attention span
Ideal: test materials are colorful, engaging and
attentive sustaining 1 hour (higher item, higher American College Test (ACT)
fatigue and higher distraction) - same halos sa SAT
- curiculum based
Dual Easel Format: sample and teaching items for each *Both ACT and SAT are highly correlated with general
subtest dichotomous scoring intelligence
- predictor of both creativity and academic success
Infant Intelligence Test Kaufmann: suggested the inclusion of creativity items (to
- identify developmental disability/deficits (along with other reduce bias)
sources of info)
- wide application in research College Level and Beyond
- specialized early educational experiences in measuring
outcome of educational, therapeutic or pre-natal care Graduate Record Exams (GRE)
interventions - with general test forum and specific subject test
- does not predict adult intelligence - valid predictor
- identify children who are in low range of functioning & in need
of intervention Millers Analogies Test
- 100 item multiple choice analogy
Other Measures: - general intelligence, vocabulary, academic learning
Child Sexual Behavior Inventory - one of the most cost-effective of all existing aptitude tests
- 38 item behavior checklist that may be helpful in identifying when it comes to forecasting success in graduate schools.
sexually abused children as young as 2.
MCAT – physicians wanna-be
Elementary School Level Seashores Measure of Musical Talents – musical aptitude;
with pre-recorded tape
Metropolitan Readiness Test (MRT 6) Horn Art Aptitude Inventory – artistic aptitude
- school readiness test that (provide educators) assess
children’s readiness and aptitude for formal education Diagnostic Test

Medical Orthopedics: Binet and Simon


40
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- intelligence test data can be used to improve learning - also used CHC Model (more dominant)
Evaluative Purposes: making judgements/decision
Diagnostic Info: pinopoint difficulty (remedial purposes) Woodcock-Johnson III (WJIII)
- do not necessarily provide an answers to “why” is there a - consist of two co-normed batteries: test achievement, test of
deficit cognitive abilities (CHC)
- tinatake nung may perspective problems sa specific area - 2 years to 90+
(mas simple yung items involved) - yield G, specific cognitive ability, achievement, scholastic
aptitude and oral language
Reading Tests - may be used to diagnose SLDs and to plan educational
- Stanford Diagnostic Reading Test programs and achievement
- Metropolitan Reading Instructional Test Pag-isa lang: screening/binet reevaluations
- Diagnostic Reading Scales Pag dalwa: more comprehensive and detailed
- Durell Analysis of Reading Test assesment, composed of diagnostic info

Woodcock Reading Mastery Tests – Revised (WRMT-III) Other School of Assesment in Educational Settings
- paper and pencil
- measure of reading readiness, achievement and difficulty Performance Task – work sample designed to elicit
-15 – 45 minutes representitave knowledge, skills and values from a particular
- 4 ½ - 80 years old domain of study.

Phonological Awareness, Listening, Comprehension, Oral Performance Assesment – evaluation of performance task
Reading Fluency according to criteria developed by experts
- provide test takers reading related strengths and weaknesses
and for remidiation purposes Portfolio – synonymous work with other people

Math Tests Portfolio Assesment – evaluation of one’s work


sample
Stanford Diagnostic Mathematics Test (SDMT-4) - freedom of person to select the content of
- provide useful diagnostic insights sa math abilities of children portfolio. (most important)
just entering to school to just entering to college.
Authentic Assesment – more performance – based
Keymath 3 Diagnostic System assesment
- 4 ½ - 21
- comprehensive blueprint reflecting essential mathematics assess giftedness, reading
content, existing curriculum priorities and national math college: assist on career decisions
standards
ADV: engaging students in the assessment process,
Psychoeducational Test Batteries opp to think generatively, encouraging then to think
- contains two types of test about learning as an on-going and integrated
- abilities related to academic success process
- educational achievement in reading and arithmetic
- normative comparisons
- evaluation of test takers own strength and weakness. DISADV: kawawa yung non-creative students, time
(Educational invterventions) consuming sa evaluator and scoring (lower interact
reliability)

Kaufmann Assesment Battery for Children (K-ABC) Authentic Assessment (Performace-Based Assessment)
- 2 ½ - 12 ½ - evaluation of relevant, meaningful tasks that may be
- used Luna’s information processing skills: conducted to evaluate learning of academic subject matter but
1. simulatneous that demonstrate the student’s transfer of that study to real
2. sequential world activities
3. Verbal Comprehension/Reading Achievement - increase student’s interest and the transfer of knowledge to
-product achievement settings outside the classroom
KABC III – 3 -18 (dual theoretical basis)
41
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
DISADV: might assess prior knowledge and experiences, not 1. Delay Avoidance
simply what was learned in the classroom 2. Work Methods
3. Study Habits
*confounded by innate skills of the person* 4. Teacher Approval
5. Education Acceptance
Peer Appraisal Techniques 6. Study Attitudes
Peer Appraisal Methods – asking individual’s peer group to - skill score, attitude score, total criterion score
make the evaluation
ADV: can provide info about person’s difficulties na di nakikita
ng person in charge, allows multiple perspectives

Group Dynamics (knowledge of individual’s place in the


group) – to guide the group and reach it’s optimal efficacy.

“Guess WHO” – elem. Caution if negative attribute

Nominating Technique – individuals are asked to select or


nominate other individuals for various activities

Sociogram – graphic method of organizing peer appraisal


results; provides info such as “who is popular in the group”,
etc.
Note: Individual’s perception of a group are constantly
changing

Measuring Study Habits, Interests and Attitudes

Academic Success – ability and motivation

Study Habits Checklist


- for grade 9 – 14
- assesses study habits such as notetaking, reading material,
general study practices
136 Phi Beta Kappa
“Best/Good students are the best judges of important and
effective techniques”

What I Like To Do Inventory (Interest)


- 150 forced choice items
- academic interests, artistic interests, occupational interests,
interests sa leisure time activities
- suggestions for designing instructional activities

Survey of Study Habits and Attitudes (SSHA) and Study


Attitudes and Methods Survey
- grade 7 to college/ 100 items
- tap poor study skills and attitudes that could affect
performance

Form H: grade 7 to grade 12 Form C: College


Both 20 -25 minutes

- users likert scale


6 Areas:
42
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 12: Personality Asessment: An Overview *Hippocrates – phlegmatic, choleric,
sanguine, melancholic
Laypeople: individual’s make up which elicits good or bad *Carl Jung: motivation
reactions from others. Chess Player: introverted, intuitive, thinking and
judging
Personality – individual’s unique constellation of psychological *John Holland: RIASEC (Self-Directed Search)
traits that is relatively stable over time - self administered, self-scored, self-interpreted aid
to type people according to their system and to
McClelland – most adequate conceptualization of a person’s offer vocational guidnace.
behavior in all it’s detail - measure of interest and abilities
- Holland’s Theory of Vocational Personality
Menninger – the individual as a whole, height, weight, love
and hates, bp and reflexes, smile and hopes, bowed legs and - occupational choice has a great deal to do with
enlarged tonsils, it means all that anyone is and he is trying to one’s personality and self-perception of abilities
become’ - critcs: self-biased (valid various sabi ni Holland)
*Friedman-Roseman – Jenkin’s Activity Survey
Byrne – personality psychology is psychology’s garbage bin in Type A – competitive, haste, restless, impatient,
that any research which doesn’t fit other existing categories tune pressured, need for achievement and
can be labeled personality dominance
Type B – mellow/laidback
Personality Assessment – measurement and evaluation of *MMPI
psychological traits, states, values, interests, attitudes and etc. Profile – narrative description, graph, table of
extent to which a person has demonstrated certain
Psychology Traits targeted characteristics as sa result of
administration of tools of assesment.
Gordon Allport – generalized neuropsychic system with the Personality Profile – target character are traits,
capacity to render many stimuli functionally equivalent, and to states and types
initiate and guide forms of adaptive and extensive behavior.
Personality States
Robert Holt – real structures inside people that determine their 1. Psychodynamic: Ego, Id, Superego
behavior in lawful ways. (structure pertains to biochemical 2. Temporary
substance that modifies learning)
State Trait Anxiety Inventory (STAI) – Charles D.
Cattell – traits are mental structure pero walang physical Speilberger
status - designed to distinguish various states from traits

Guilford – any distinguishable, relatively enduring way in


which one individual varies from another WHO?
Self Report
*consider content on which behavior is displayed (situation- - sila mismo magproprovide ng info
dependent) - diaries, responses to oral/written questions/ test items
- private info
Robert Del Vecchio (152 longitudinal studies)
- 50 – 59 years old- consistent/peak Sexual Sensation Seeking Scale
- only the samples could provide highly personal info needed
Traits that Lack Consistency – honesty, punctuality,
conformity and attitude toward authority and Limitation
introversion/extroversion. - we cannot verify their reports

Psychological Type COMMONLY USED TO ASSESS SELF CONCEPT (one’s


– constellation of traits that is similar in pattern to one identified attitudes, beliefs, opinions about oneself)
category of personality within a taxonomy of personalities • Beck Self-Concept Test
- description of people - respondent are asked to compare themselves
Examples:
43
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
to other people on variables such as looks, Impression Management – attempt to manipulate other’s
knowledge, ability to tell jokes impressions through “selectove exposure of some information
to suppresion of other infos”
For Children: Enhancement – claiming of positive attributes
• Piers-Harris Self-Concept Scale Denial – repudiation of negative attributes
- 80 self statements (3-12 grades) Self-Deception – tendency to give favorably biased but
- covers behavior, intellectual and school status, honestly held self-description.
physical appearance and attributes, anxiety,
popularity, happiness and satisfaction *ALL AFFECTS VALIDITY*
Validity Scale – subscale of a test to know how honest they
• Tennessee Self-Concept are sa pag sagot o baka dahil sa responses style,
• Beck Youth Inventories (BYI II) – Disruptive carelessness, deception, unintentional, unconsidered standing.
Behaviors
- also measures depression, anxiety, anger, disruptive WHERE?
behavior in children and adolescents aged 7 to 18 - schools, clinics, hospitals, academic research laboratories,
years employment counsellings, vocational selection centers, offices
of psychologists/counselors.
Self Concept Differentiation – different self-concepts in - natural settings, behavioral observations.
different roles (lower SCD, higher well-being) more unified and
content senser of self. HOW?
1. Scope and Theory
Another Person as the Referent California Psychological Inventory (CPI 434)
“Child of emotional difficulties” - general inventory of individual’s personality (wide
- useful if knows ng third party yung subject scope)
Self-spouse > Self-peer - yield info about many personality related variables
such as responsibility, self-acceptance and
Personality Inventory for Children (PIC 2) dominance
- standardized interview of a child’s parent (mom usually) - originally concerned to measure personality traits
- true or false across cultural groups and predict the behavior of
- provide chemical information generally well functioning people.

• Different informants may have different properties on Locus – place/site/location


the subject being evaluated Of Control – person’s perception about the sourec of
• Mas mapapabilis mapansin yung difficulties sa things that happen to him/her
YOUNG CHILDREN kaysa sa older ones
• Mas madalas nag-aact na kaharutan (hyperactive, Theoretical: Blacky Pictures Test (Blum)
sadista sa kalaro) kesa overcontrol problems - elicit fantasies associated with various
psychoanalytic themes
Multiaxial Empirically Based Assessment System – - cartoon-like pictures of blachy (dog)
Archenbach - make-up stories in response to such cards.
• Incorporate cognitive/physical assessment of the
subject, self-report of subject, ratings by parents and Atheoretical: MMPI
teacher ADV: allows test users to impose their sum theoretical
preferences to the interpretation of findings.
• Context dependent evaluation
• Cultural content
2. Procedures and Item Formats
Structured Interview – must follow an interview
WHAT?
guide, little leeway in terms of posing questions na
wala sa guide.
Response Style – tendency to respond to a test item/interview
Graphology – handwriting analysis which provide
question in some charactersitic manner regardless of the
insights into personality (lower R, lower V)
context.; it can also give us cue about test taker’s personality
3. Frame of Reference

44
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- aspects of the focus of exploration such as first - yung strenght ay nakadepende sa large population
frame as well as other contextual issues that endure pag cinompare
people, places and events
Ipsative Approach
Sort Technique (by Stephenson) - icocompare sa other traits of same individual
- most descriptive → to least descriptive
- used by Carl Rogers: Idealist → perceived actual 5. Issues on Personality Test Development and Use
self/ reevaluation Validity Score – tutol sila McCrae and Costa (pwede
naman daw external sources)
Leadership Q Test – Military Settings
Tyler Vocational Classification System – desired - Personality trait may have different meanings to
occupation other culture
ADV: case of application, wide range of purpose in - issues on applicability norms
clinical and research (used in measuring attachments
in rhesus monkeys) Minority Grades – Higher Psychopathology MMPI
(possible na accurarte naman) - depende sa size ng
Adjective Checklist Format sample and appropriateness of stats (extracurricular
- check mo yung adjective na nag apply sayo factors), cross cultural research
- simple and adaptable
- pwede pang measure ng state, traits and self- Culture & Learned Meanings – Issues
perceptions

Sentence Completion Format Developing Instruments to Assess Personality


- complete/fill in the blanks 1. Logic and Reason (Content-Oriented Approach)
Ex: Rotter Incomplete Sentence Blank - may dictate what content is covered by the items
Ex: WWI/PDS – Personality and Adjustment
Semantic Differential (Bipolar Adjectives) Problems
- strength, degree, magnitude (self-concept
descriptions – opinion surveys) Woodworth Psychoneurotic Inventory
- ellicit self-report of fears, sleep disorders and other
Forced Choice Format problems deemed sympathetic of a pathological
- choices are equal in social desirability condition – psychoneuroticism
Ex: EPPS Higher Problems – Higher Psychoneuroticism

Embedded-Figures Test In Many Items


- find hidden figures (held d/ind) - helpful din ang research
- clinical experience
Hand Test – Projective Test - SMEs
- what they think the hands might be doing
2. Theory – items will be approached depending on the
4. Scoring and Interpretation theory applied
Nomothetic Approach – a limited number of 3. Data Reduction Methods
personality traits (exists in a lot of people) can be - Factor Analysis/Cluster Analysis
applied to people; exists in all people in varying
degrees Cattell’s 16PF
Idiographic Approach – individual’s unique Surface traits – 36
constellation of personality traits with no attempt to Source traits – 16PF
characterize each person according to any particular
set of habits.; more flexible; case study and personal (had Cattell’s fire but stull believed in the primacy of
records his 16PF)

Normative Approach – Most Common in Personality Waller and Zavalla – seven factor model
Assessment
The Big Five (NEO-PIR)
45
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- clinical applications and research on personality Empirical Criterion Keying
assessment Criterion Control
- 5 domains + 30 factors With presumed normal ind (random)
- 17 years and older diagnosis (standardization
sample)
O – Openness – Intellect Factor – activr imagination,
aesthetic sensitivity, attentiveness, intellectual yung nagdiscriminate na items
curiosity, independence of judgement yung ireretain

C – Consciousness - planning, organizing Validity Scales


L – Lie Scales (I do not always tell the truth/ gossip sometimes)
E – Extraversion – sociability, practivity and F – Frequency/Infrequency – Higher F – di sineryoso (faking
assertiveness bad/eccentric)
K – Correction Scale – Higher K – defensive
A – Agreeableness – interpersonal tendencies, Lower K – excessive self-criticism
sympathy, friendliness, others see similarity - used to correct scores on 5
decimal scales
N – Neuroticism – emotional stability, coping in
stress ? – cannot say – failed to mark any response
- indecisiveness, defensiveness, carelessness, lack of
BFI (Big Five Inventory – 44 items) – non- experience relevant to the items
commercial purposes to researcehs and students - 10 – 30 – di na mainterpret (to solve: I-clear sa
instructions)
TIPI (Ten Item Personality Inventory) – two item - reported in T-scores
each dimension
MMPI Content Scales – Wiggins Content Scales
FFNPQ (Five-Factor Non-verbal Personality - group of test items in similar
Questionnaire) – showing respondents illustrations content
of behaviors of BIG 5 ( twins: similar (due to shaped
genes))
Supplementary Scales (Factor Analysis)
Criterion – standard on which a judgement or division Ex: Harris-Lingoes Scales
can be made - with labels such as broading and social alienation
Criterion Group – reference group of test takers who - designed to be more internally consistent than the
share specific characteristics and whose responses to umbrella scale from which the subscale was derived
test items serve as standard according to which items
will be included in or discarded from the final version Paul Meehl – proposed 2 point scale; 2 highest most
of a scale pathological scores, interchangeable.

Empirical Key Ring – process of using criterion Welsch Codes – ex: 6’78” 1-53/4.2 #90 FL-/K
groups MMPI Clinical and Validity Scores
- higher heteroginity
- lower internal consistency MMPI 2
- does not have to rely on logic and reason – major difference: more representative
standardization sample
Minnesota Multiphasic Personality Inventory - 567
- originally called as medical and psychiatrical inventory - 18 years old
- by Hathaway and McKinley - added: type A, drug use, suicide potential, mental
Adjustment, attitude towards work
MMPI - microcontent scales
-566TF
- aid in psychiatric diagnosis with adolescents and adults 14 3 New Validity Scales
and above 1. Back Page Infrequency (Fb)

46
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- for candid, deliberate and diligent test takers na may - MMPI 2 RF
tinatago
- detects random/inconsistent response pattern MMPI-A
- sa MMPI versions noon, higher scores ang adolescents sa
2. Trin Scale – predictable clinical scales kaya mas mukhang nagsusuffer sila sa
Ex: a, b, a, b, a, b / a, a, a, b, b, b psychopathology
- 14 to 18 years old
3. Vrin Scale – unpredictable/ random responses - 45 to 69
- most widely used measure of psychopathology in
James Butcher – Senior Author of MMPI 2 adolescents
- S-scale – detect self-prevention in a superlative manner

Paul R. Less-Haley Personality Assessment and Culture


- Faking Bond Scale (FBS) – detect maligners who submitted
bogus personal injury claims Acculturation – on going process by which an individual
thoughts, behaviors, values, worldview and identity develop in
Criticized: Butcher – bala naman genuine relation to general thinking, behavior, customs and values of a
physical/psychological problem particular cultural group

Criticism of Original MMPI – di represented ung buong US Value – those that match an individual prizes or the ideals an
Pop individuals believe in.
Instrumental Values – help one attain some
MMPI Retructured Form/FR – Tellegen objective (honesty, imagination, ambition and
Goal: make clinical scales more distinctive cheerfulness

Terminal Values – end point objective (comfy life,


Response sa 2 Basic Problems exciting life, sense of accomplishment and self-
1. Overlapping Items – lower distinctiveness/ respect)
discriminant validity of individual scales, hirap I
interpret ng high scores kase overlapping. Personal Identity – set of cognitive and behavioral
2. Conceptual in Nature – Factor: common sa characteristics by which individuals define themselves as
psychopathology (anxiety, abuse, despair, members of a particular group; one’s sense of self
maladjustment)
Identification – process by which an individual assures a
James Frank – Demoralization – failure to cope with stress ; pattern of behavior characteristic of other people and refered to
common factor in psychopathology it as a central issue the minority group must deal with.

Steps: Worldview – unique way people interpret and make sense to


a. Alamin ang core components ng C.S their perceptions as a consequence of learning experience,
b. Gawa ng revised scales to measure core components cultural background and related variables
c. Gawa ng item port if revised

Extract Demoralization Factor (gawa bago)


- measures a broad, emotionally colored variable that
underlies much of the variance common to mmpi – 2
clinical scales

Added New Scales – indirectly assessed like suicidal


ideation

Higher Order Scales:


- emotional/interanalyzing dysfunction
- thought dysfuction
- behavioral/externalyzing dsyfunction
47
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 13: Personality Assessment Methods - for scoring and interpretation and administration presented in
numbered order from 1 -10
Objective Methods
- short answer items Inquiry – Second Administration
Ex: Anxiety Inventory - inaalam ng examiner yung features ng inkblot tna dahilan ng
1. I stay calm as always (TRUE) issue on validity pagformulate ng test-taker ng PERCEPT.
- for clarification
Both Ability and Personality Tests are: - for scoring and interpretation
- can be answered quickly - inaalala kung nareremember pa ba, nabago o na retain yung
- if well written ang item, little explanation percept.

How Objective is Objective? Testing the Limits


Ability test: true, one correct answer - points out the specific area of the inkblot that will provide
Personality: walang tamang sagot additional info about the personality functioning
• Identify any confusion or misunderstanding
Issues: concerning the task.
- objective nga pero ililink sa psychoanalysis theory • Keri ba ng test-taker na i-refocus yung percepts given
- self-report a new frame of reference
• If na-anxious ba sila nung una, will they perform betts
*objective if shorthand description of test portrait sa added structure? *

Projective Methods *elicit one last response – provide source of questions and
- technique of personality assessment in which some infernece applicable to treatment considerations
judgement of the assessee’s personality us made on the basis Location – part of the inkblot where the percept was formed
of performance on a task that involves applying some sort of Determinants – qualities of the inkblot that determine what the
structure to unstructured or incomplete stimuli individual percieves
- indirect Content – human figures, animals, anatomical parts, clouds, x-
- minimized faking rays and sexual responses
- minimized for proficiency in english Popularity – frequency ng responses obtained from general
- minimized cultural bias regulation
- major advantage: tape both conscious and unconscious Form – how accurately the individual’s perception matches the
corresponding part of the inkblot
Lawrence Frank : coined “Projective Method”
- most important things about an indivdual are what he cannot Whole Responses – conceptual through process
or will not say. Form Level – reality testing
Psychotic – lower form level scores
Wlihelm Stern’s Cloud Picture Test Human Movement – creative imagination
- ano nakikita nila sa pics ng clouds Color Responses – emotional reactivity

Outbreak of Projective Test: rebellion against normative data; Alternate Form of Test: The Behn-Rorschach
attempt to break down the study of personality into study of - analogous series of plates
specific traits with varying strenghts Note: exner challenge the axlom that reliable cures first before
- purely clinical perspective validity
Paradox: nagiging nano-referenced yung interpretation
- madalas gamitin sa forensic work and generaly accepted by
Rorschach Inkblot Test (1921) the court
- from interpretation test yung inkblot (YES) clinical and research purposes
5 Inkblots : B&W (NO) academic respect
2 Inkblots : B&W + Red
3 Inkblots : Colored Holizman Inkblot Technique
*inappropriate yung computer administered - standardized, designated to be more psychometrically sound

Comprehensive System by Exner Pictures as Perspective Stimuli

48
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Early 20th Century GCR – Group Conformity Rating – degree of sameness sa
- girls – religious and moral themes standardization group
- different sa gender, age
- experiments used it to gauge children’s APPERCEPTIVE PERSONALITY TEST
imagination - with objectivity in scoring
- with fantastic/ unreal scores
Social Situation Picture Test - unrestricted emotional tune
- used to juvenile deliquents
Words as Projective Stimuli
Thematic Appercetion Test Word Association Test by Rapaport
- apperceive: to perceive on terms of perceptions - assessee verablizes the first word that comes in responses to
- originally designed as an aid to eliciting fantasy material from a stimulus word
patients in psychoanalysis - semi-structured, individually administered, projective
- 30 pictures (B&W) – certain classical human situations + 1 technique of PA that involves presentation of list o stimuli
blank words. Mother – Love
Cover of Story - personal experience, dream, imagined
event, book, almost everything Galton – first investigated WAT
- 20 suggested no. varies Cattell and Bryant – first to use cards with stimulus words
Scoring: printed on them
1. Stories Kraeplin – effects of physical states and practice on word
2. Manner of Deliquency association
3. Extra-test Behavior and Verbalizations
- 60 words
Need – determinants of behavior from within - assessed in popularity, reaction time, content and test-retest
the individual environment responses
Press – environmental force
Thema – interaction between need & press Kent-Rosanoff Free Association Test
- attempts develop a standardized test using words as
Protagonist – identifying with someone projective stimuli
- reveals concerns, hopes, fears and desires - 150 words
- affected by creativity, age, education and SES
Self-Report – Self-Attributed Motives
TAT – Implicit Motives – unconscious influence or
behavior required in the basis of experience SENTENCE COMPLETION TESTS
- Face validity, most suscpetible to faking
High Agreeableness – unaffected by exposure in aggression
• Washington University Test
*susceptible to faking o Based on writings of loevinger and her
* if yung narrator ay yung actor mismo, baka i-avoid yung collagues in the area of self-concept
conflicts development

• Rotter Incomplete Sentences Blank


HAND TEST o Most popular standardized sentence
- affection, dependence, aggression completion test
ROSENZWEIG PICTURE – FRUSTRATION THEORY o 69 to Adulthood
- cartoons – frustrating scenarios o 3 Stages
Intropunitive – self-harm ▪ HS – G9 to G12
Extrapunitive – hurts others ▪ College – G13 to G16
Inpunitive – evaded ▪ Adult
o 40 items
Obstacle Dominance – focused sa frustration o Family attitudes, social and sexual attitudes,
Ego Defense – protecting the frustrated person general attitudes and character
Need Persistence – solves frustrating problem

49
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
o 7 point scale = need therapy – extremely - Examiner’s pressure
good adjustment - Age of the examinee
- Specific Instructions
Sounds as Projective Stimuli - Subtle reinforcement are provided by the examinee
BF Skinner - developed an auditory projective test “a deuce - Subjects utilized every available cue in the testing
for snaring out complexes” situation, including cues related to the actions ir the
- Rosenzweig + Shannon – tautophone appearance of the examiner
Auditory Apperception Test – Stone – creates a story based - Examiners appeared to interpret projective data with
on 3 sounds played regard to their own needs and expectations. Their
Azzageddi Test – Henry Murray – presents spoken own subjective feeling towards the test taker and their
paragraphs own constructions regarding the total test situation.

Issue: if di nagdifferentiate, walang satisfying na scoring, For Objective Test:


redundant and onti lang napipiga - clinician training, with perspective, patient’s social
class, motivation to manage a desired impression
FIGURE DRAWING TEST
- quick and easily administered 3. Psychometric Soundness
- assess personality, intelligence, neurological intactness, - cincriticize yung uncontrolled variations, in protocol
usual motor coordination, cognitive development, learning length, inappropriate subject samples, inadequate
disabilites control groups, poor external criteria, as factors
- can be administered in group or individual contributing to spiniously increased ratings of validity.
- inappropriate yung test retest and split half
Karen Machover – figure drawn was related to impulses,
anxiety, conflicts anf compensation characteristics Objective – sturctured - tap conscious aspects of
personality
Good Enough Harris Dapt – may second sheet (opposite sex Projective – constructured – beyond conscious
yung una) awareness

House - Tree – Person Test Patient – examiner relationship – most robust


Kinetic Family Drawing – relation sa family predictor f psychotherapy intervene

Critiques on Projective Methods


1. Assumptions Behavioral Assessment Methods
- (Murstein) very wrong daw na the more analyses, Sign Approach
the more na maraming marereveal about their - traditional approach to psych assessment
personality - test responses are desired to be signs or clues of underlying
- Marami paring factos/variables such as the personality or ability
environment, response set, reactions to the examiner
and factors contributing to response patterns Sample Approach
- Idiosynchratic nature of responses sa projective - focuses on the behavior
stimulus; uniqueness ng responses - not as a sign but as a sample to be inspected in it’s own right
- Questionnable din yung assumption na projection is
greater onto stimulus material na same subject Behavioral Assessment
- what a person does in situations
Note: research was less prone to faking than - not linked to any theories of personalities
Tenessee Self-Concept Scale - documented evidences
Issue: baka insights of unique perception and
organizing novel situation ang mapiga, hindi - Higher percentage of color responses on research
personality percentage 90 + IQ = positive verbal contingencies
- Higher percentage of movement (three dimensional)
- Nag-eexist daw ang unconscious (based kase sa responses on research percentage 90 + IQ = negative verbal
psychodynamic) contigencies

2. Situational Variables Traditional: describe, classify, diagnose


50
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Behavioral: formulation of specific treatment program c. Analogue Studies
- research investigation in which one or more
Frequency/Event Recording – nirerecord each time the variables are similar or analogous to the real variable
targeted behavior occurs. that the investigator writes to examine
Interval Recording – only during predefined interval of time Ex: rats – humans
Students – employees
Intensity
- gauge by observable and quantifiable events such as Analouge Behavioral Observation
duration - specific behaviors
- ratio/percentage of time na nag occur - photos, videos and live

a. Timeline Followback Methodology d. Situational Performance Methods


- originally designed sa clinical interview for assessing - procedures that allows for observation and
alcohol abuse evaluation of an individual undera standard seto of
- prompted with memory aids to assist recalling of the circumstance
targeted behavior during the defined timeline - performance of some specific task under actual and
- to identify antecedent conditions. simulated condition
Common: mas okay iobserve kesa sabihin
b. Ecological Momentary Assessment
- record behavior not retrospectively but as they occur Leaderless Group Technique – individual
- for analysis of immediate antecedents initiative, cooperation, leadership and related
variables
OCD – naturalistic observations
- provides an opportunity to observe the degree
Varieties of Behavioral Assessment of cooperation exhibited by each individual
a. Behavioral Observation and Rating Scales group member and the extent which each is
- involves watching the activity of the targeted clients/ able to function as a part of a team.
maintaining records of their activites - used in military and industrial and individual
Settings; to identify people who work well
Video Recording – kahit hindi physically present together and those with superior managerial
Behavior Rating Scales – preprinted sheets in which skills and executive potential.
observer notes all the presnces and absences of
targeted behavior e. Role Play
Code – less time to enter data - acting an partially/improvised past in a simulated
situation
b. Self-Monitoring - teaching, therapy and assessment
- act of systematically observing and recording
aspects of one’s own behavior or events related to the f. Psychophysiological Methods
behavior - ginamit ng behaviorally – oriented clinicians to see
- observation and behavior of the clinical interest at the relevance of physiological factors and
the time/place of it’s natural occurrence psychological and it’s interchangeable effects.

Self-Report – standi-ins; verbal Biofeedback – best known P.P measures used to


descriptions, different sa time/place psych; designed to gauge, display and record a
where it occurred naturaly continouos monitoring of selected biological
process such as pulse/heart rate
- depende sa competence, diligence and motivation
ng assessee Phallometric Data – record na nakukuha sa poiunt
- bot t a tool for assessment and intervention plethymograph, para ma encourage na mag open up
yung denial na sexual offenders; can be applied to
Reactivity – possible changes in an assesee’s treatment and program evaluation
behavior, thinking or performance in the presence of a
schemes Polygraph/ Public Eye’s Best Lie Detector -records
are called before as tracing, graph, chart, polygram;
51
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
respiration, salvanic skin response, blood volume,
pulse rate; yes or no questions; not standardized,
high false-positve for lying

g. Unobtrusive Measures
- telling a physical trace/record
- wrappers left – calorie intake
- do not necesssarily require the presence of
respondents

Issues in Behavioral Assessment


- mas appropriate yung generalizability theory sa behavioral
assessment kesa sa personality
- dapat may interrates reliability

Person 1 Person 2
- cum laude - average student
- top 1 in boards (RPm) - RPm board passer
-excellent in communication - average in communication
skills skills

Higher Scores Lower Scores


Contrast Effect

To solve: training, composite judgement (average of multiple


rates)

Reactivity – use hidden cameras; recording devices instead

Clinical Approach
- relieved heavily in the clinician’s judgement as intuition
- lack of preset and uniformly applied rules for drawing clinical
conclusions and making predictions
- internalized, less formal, more creative version of A.P

Statistical/Actuarial Approach
- statistical standardization, norms, preset, uniformity applied
rules and procedures
- for most attempting patterns
- retention only of proven hypothesis

Mechanical Approach
- Computer devices

52
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 14: Clinical and Counseling Assessment Disordered behavior → dahil sa sakit/ role-playing/
- interview and test data will be supplemented with case
history data if the patient will not or cannot cooperate DSM IV – Categorical
DSM V – Dimensional, continouos approach
Premorbid Functioning 15% higher r; 37% higher v
- functioning prior to development of a disorder
- case history data If may PD, possible na may other disorders
- protypical (DSM V)
Higher IQ – good candidates for insight oriented methods that Categorical : Stereotypical
require high levels of abstract ability ASPD, Borderline, Narcissistic
OCPD, Avoidant
Normal Test-Taker – interview/personality tests Personality Disorder Trait Specified (PDTS)
- take into account the degree of disturbance that
Field - Dependent Cognitive Style – CBT Approach existed with regard to a particular personality trait
Field – Independent Cognitive Style – Humanistic Approach
Fatalism – what happens in life is largely beyond a person’s
control
Objective of Clinical Assessment – diagnose mental Self-Efficacy – confidence in one’s own ability to accomplish a
disorder task
DSM – diagnose/describes all known mental disorders Social Support – expressions of understanding, acceptance,
empathy, love, advice, guidnace, care, concern, trust from
Critiques in DSM IV TR friends, family, community caregivers or others in one social
- rooted in medical model environment
- patterns of thinking are not described
- unreliable Therapeutic Contract – agreement between client and
- no treatment offers theraphist setting forth goals, expectations + mutual obligations
- culturally insensitive (dissociative disorder) with regard to a course of therapy
- DSM IV TR – descriptive and atheoretical
Note: head noddin; “wary kasi baka ma re-inforce tas puro
DSM III – Jerome Wakefield (mental disorder) yung topic ng chika nya”
- harmful dysfunction
- harmful failure of internal mechanisms to perform their
naturally selected function Types of Intervention
- evolutionary view of MD
- acquired thru Darwinian process and natural Content – general (getting-to-know-you), specific narrow
selection Structured – prepared in advance; uniform method of
- na acquire explanation/evaluation
- harmful
Structured Clinical Interview for DSM IV
- semi-structured interview for diagnostic decision
Pecking Order (Scajelderup – Ebbe) Making
- withdrawn (depressed) - axis 1
- 1 to 2 hours
Social Rank Theory of Perception
- if talo, nagdidisengage so he/she is no longer competitor for SCID II
resource/status - axis II
- depressed + anxiety (vigilant and preoccupied with others - 30 mins to 1 hour
who are percieved as more powerful), suicide ideation
Schedule for Affective Disorders & Schizoprenia
Evolutionary Perspective – to protect self for future harm; - mood disorders and schizoprenia
involuntary defeat strategy; di sya helpful in the long run
Structured Interview of Repointed Symptoms
Klein – di daw true ang evolutionary perspective; - defect malignery

53
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Tone – Stress Interview - more discriminating in determining symptooms severity in
- place the interviewee in a pressure state both college and depressed outpatient sample
- ability to remain cool under pressure
Test-Battery – group of test administered together
State of Consciousness – Hypnotic Interview Standard Battery – 1 IQ, 1 PT, 1 Neuropsychological test
- less accuracy in recall
- more confident but inaccurate David Rapaport – no more tests, the better
- more suggestible to leading questions
- more vulnerable to distinction of memories Culturally Informed Psych Assessment
- considers acculturation, values, identity, worldview, language,
Focused Mediation Technique etc. sa evaluation & interpretation of data
- increased memory yet was resistant to report misleading - read any existing case history data
information - interview family, friends, professionals
- translator – usually family members (minsan di pwede pag
Cognitive Interview may bata)
- rapport
- use imagery & focused retrieval to recall information Shifting Cultural Lenses
(describes events in the viewpoint of perpetrator) - Steven Regeser Lopez
- think alone without interruption
- higher recalling accurate details A – age
D – disability
Collaborate Interview R – religion
- halos no boundary na E – ethnicity
- active perspect of client S – social status
S – sexual orientation
MSE (Oriented Times Three) I – indigency
Insight – alam nya kung bakit sya nainterview N – natural origin
G – gender
Standardized Interview – higher interrate in reliability;
systematic, specific and clear Managed Care
- doesn’t consider culture
DSM II – vague, lower interactions - unmet ethical consideration
DSM III – detailed/specific - low cost, lower quality

Structured Interview – more accurate in predicting criterion Self-report – impression management


(behavior of probation)
Written test – admit illegal acts Reacculturation – new sense of identity after recovery

Case History Data – existing documents/records; helpful sa Forensic Psych Assessment – theory and application of
pag-iinterpret sa other data psych evaluation & measurement is legal context; not highly
motivated/truthful
Millon Clinical Multiaxial Inventory III - competent
- enduring personality traits & acute symptoms - criminally responsible
- uses multiaxial system - ready for parole

Beck Depression Inventory Dangerousness to Oneself and Others


- self-report - history of previous attempts to commit the cut
- depressed higher scores (compared sa anxiety, adjustments) - drug/alcohol abuse
- more sense, higher kesa sa lower scores - unemployment
Limit: no validity score - ideation, motivation and imaging
(MMPI) - availability & lethality of methods and means
- how specific/detailed
Center for Epidemological Studies Depression Score - see social support
(CES-D)
54
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Duty to Warn Psychopath – four (4) tries more likely to fail in relive from
- predicition of dangerousness – moves out of science; not that peers
accurate
Emotional Injury/Psychological Harm/Damage
Competency to Stand in a Trial - mental suffering, pain, emotional harm
- defendants abilty to understand the charges against him & - discrimination, harassment, malpractice, stalking, unlawful
assist in his own defense termination of employment
- sufficient present ability to consult with his lawyer of a - analyze insturmental validity (interview, case study)
reasonable degree of rational and factual understanding of the
proceedings against him Profiling
- physically and mentally present - investigative psych
- right to choose and assist counsel - crime solving process that draws upon psychological &
- right to act as a witness on one’s own behalf criminological expertise applied to the study of score evidence
- right to commit opposing witness - dapat may inconvinience sa culture ang profiler

*Possible na humarap sa court ang M.R, psychotic, Custody Evaluation – assessment of parental capacity;
neurological disorder basta competent aspect of child-rearing, IQ, PT

Factual Criteria – defendants ability to understand the - under 5 years old – lesser weight
charges against/relevant preceedings - Fiqure drawing tests
Inferential Criteria – defendant’s ability to communicate with - structured exercises with dolls
counsel or informed decision - TAT

The Competency Screening Test Neglect – failure to give universal degree of care
- sentence completion format Abuse – conditions that will give use to abuse of a child
- 3pt scale : 2 – appropriate
1 – meaningfully appropriate Anatomically Detailed Dolls (ADD)
0 – clearly inappropriate - sexually abused, - sex oriented but not significant
- not widely acceptable tool for detecting sexual abuse
- assessee’s clear understanfing of the lawyer’s role
- discriminates seriously disturbed, state-hospitalized men from Child: 2 – 7 highly suggestible
antrop groups consisting of students; etc
Child Abuse Potential Inventory – identify abusers
Fitness Interview Test Parenting Stress Index – stress sa parental role; di pwede sa
- idiographic measure court kasi higher scores not necessarily indicates na abuser
sila.
McArthur Competence Assessment Tool – Criminal
Hotudicthon (McCAT-CH) Psych Report – report findings
Evaluation of Competency (ECPT-R) – nomoethic
Barnum Effect/ Aunt Fanny Effect
M’ Naughten - vague
- right/using test Issues: Paul Meechl – pseudosuccesful
- defect of ream from diseases of the mind
- di nya alam ginagawa nya Actuarial Assessment
- application of stats for clinical judgement
Durham Rule – product of mental disease/defect
Ali Standard – can’t affect the usefulness of the test Clinical Assessment
- application of own concrete training
Rogers Criminal Responsibility Assesments (RCAS)
- systematic and empirical approach to insanity evaluations Mechanical Prediction
- reliability, organic factors, psychopathology, cognitive control - application of ststs sa computer generated findings
- discriminates
- heirarchal decision

55
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
Chapter 15: Neuropsychological Assessment 3. History taking, MSE, test administration
- dapat may knowledge sa neuroana, neurochem at
Neurology – branch of medicine that focuses on the nervous neurophsyio
system and it’s disorders - timing
Neuropsychology – branch of psychology that fcuses on the 4. Note the effects of drugs
relationship between brain functioning and behavior Antipsychotic – parkisonian like
Neurophyschological Assessment – evaluation of brain and May COVER symptoms
nervous system functioning as it relates to behavior
Behavioral Neurology – medical specialty within neurology • A careful history is critical to the accuracy of
that focuses on brain-behavior relationship (with emphasis on assessment (pseudodementia)
biochemistry) • Case history/Archival Records
Neurotology – branch of medicine that focuses on problems o Premorbid fine
related to hearing, balance and facial nerves o Treatment
o Published Cases: useful insight

Left Hemishphere Right Hemisphere Short Portable Mental Status Questionnaire


- most frequent - spatial ad textual - completed & assessors
- reading, writing, arithmetic, recognition
speech - art and music appreciation Nueropschological Impairment Scale – Self Report
- right handedness Mini-Mental State Exam (MMSE)
Nuerological Damage - screen for cognitive impairment (concentration, language,
- pwede maraming ma affect - diffuse orientation, memory and attention)
– focal (one site) - manifests in behavioral
- pathological alteration of effect 7 Minute Screen – alzheimer
tissue due to infection or
injury Nueropsychological MSE
- note behvaior relevant to neuropsychological evaluation
- most useful screening tool (diagnosis)
Neurological Damage – most inclusive (brain, spinal cord,
CNS + PNS) Non-Invasive Procedures – does not involve any intrusion
Brain Damage – general reference to any physical or unto examinee’s body
functional improvement sa CNS (anatomical destruction) - physical and abilities
- scalp/skull (enlargements)
Organicity - muscle tore (soft/rigid); strength and size
- organic brain syndrome (loss of abstraction ability, deficits in - reflexes
reasoning, irritability in problem solving tasks)
- functional consequence of brain damage
Neuropsychological Tests
IQ, PT, Perceptual Motor/ Sensory Test
Functional Deficit -psychological a. General Intellectual Ability
Organic Deficit – structural, physical - Wechsler
Hard Sign – definite (abnormal reflex, cranial nerve damage) - Pattern Analysis (pattern of test scores)
Soft Sign – merely suggestive (unable to copy stimulus, minor Lower block design – damage in right
sensori-motor, deficits + discrepancy v & NV performance) hemisphere
- Deterioration qoutient /index – patterns of subtest
scores

Neurologist Neuropsychologist Possible na ma-alter and test manual


(for short-attention span)
Make some accomodation
I-administer ahead yung taking subtests tas alisin
1. Analysis of case history data yung di useful subtests
2. Preliminary
56
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
b. Ability to Abstract (usually unpaired talaga) D. Perceptual, Motor & Perceptual
- Weschler Similiarities Test Motor
- The Proverbs Test - sensory functions (perc)
- Color Form Sorting Test (Weigl’s Test) - ability to move parts of the body
- Object Sorting Test (motor)
- Winconsn Card Sorting Test (geometric + colors) - integration + coordination of
(frontal lobe functioning) P&M; ex: henad-eye coordination
– jigsaw puzzle; Ishihara Test:
c. Executive Function color blindness
- organizing, planning, cognitive flexibility, inhibition of
impulses (frontal/prefrontal lobes) • Wepman Auditory Discrimination Test –
muss/much: same ba sila ?
• Tower of Hanoi • Bruininks – Oseretsky Test of Motor
• Porteus Maze Test Proficiency (BOTMP)
- prudence, forethought, mental alertness, - 4 ½ - 14 ½
power of sustained attention - running speed, agility, balance, strenght,
Adults = / response speed, dextrisity
Children = ? (/ for those with T.B.I) - Purdue Pegboard Test
• Clock Drawing Test (CDT) - originally for employee selection
- quickly screen for executive functions
- cognitive dysfunction (dementia/ *women do better than men
neurological condition)
- visual memory deficits, mild cognitive • Bender Visual – Motor Gestalt Test
improvement, decline in function due to - nine cards (used by Max Westheimer)
aging gestallen (configurational wholes)
- i-copy mo lang yung designs
A. Trail Making Item (usually 5 mins) + - diagnostic significance
- visio perpetual skills, working - no/few scoring, no norm (clinical
memory, ability to switch between judgement)
tasks. * Gobetz – recall phase (neurotic – lower
- most widely used sa Halstead- recall)
Reitan Neuropsychological Test - widely adopted for + info, not
- predict impairments in mobility for personality
and mortality
BVMG II
B. Field of Search Items - with copy & recall phase (timed)
- usually timed - +7 items (4 if 4-7 years old & 11
- right hemisphere lesions months) (3 if 8 -85)
(deficits in vision stanning ability) - motor test
- can be applied in everyday - perception test
activities such as driving, piloting - sensitive to maturation
aircraft, monitoring aircraft
Decker – childhood – adol = higher
C. Boston Naming Adult = lower
Test/Confrontation Naming/ Old age = rapid decrease
Identification
- perceptual component :
percieving the stimulus E. Verbal Functions
- semantic component : ano Controlled Word Association
meaning? Test (Verbal Associative
- lexical component : sabihin mo Fluency Test)
na A = apple, aunt, ant, arch, art

57
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
(adaptive funtion of dementia Luria-Nebraska Neuropsychological Battery
patients) - Aleksandr Luria inspired
- contains clinical scales designed to assess cognitive
Aphasia – di macomprehend processes and functions
/magets yung written or spoken - 1/3 of time lang compared sa (H-R)
language - identifies location of brain affected
Reitan-Indiana Aphasia
Screening Test (AST) Neurosensory Center Comprehensive Examination of
- naming common objects, Aphasia (NCCEA)
following verbal instructions - communication deficit
writing familiar words
Multilingual Aphasia Exam – Montreal Neurological Institute Battery (MNIB)
hispanics - trained neuropsych
- locating specific brain lesions

Southern California Sensory Navigation Test


- sensory integrative + motor functioning in children (4-9)

Severe Impairment Battery (SIB)


Passive – pwede ma forget - for sure impaired patients (almost floor)
Procedural – implicit; unconscious - attention, orientation, language, memory, visioperception and
Declarative – explicit; conscious constructiom
Semantic – general facts
Episodic – context-dependent; repeat digits in the context of Cognitive Behavioral Driver’s Inventory
memory test - whether individuals with brain damage are capable of driving
a motor vehicle
California Verbal Learning Test II
- echolalia (repeat lang) OTHER TOOLS
- recall, recognition, learning rate, error types, encoding
strategies, forcerd choice (maligneing) X-Ray – density of tissue
Cerebral Angiogram – tracer element is injected to the blood
Milner Tactile Non-sense Figures stream bago I x-ray yung cerebral area
- measure immediate/tactile/haptic memory CT Scan (Computerized Axial Tomography) – 3 dimensional
view of structure
Seguin – Goddard Formboard PET Scan (Position Emission Tomography) – biochemical
- pwede di tactile memory if blindfolded lesions
SPECT (Single Motion Emission Completed Tomography)
Neuopsychological Test Batteries – radio active trace fluid iodine with clear pictures if organs and
Fixed Batteries – prepackaged – for newbies tissues
Flexible Batteries – ikaw pipili; different norms per test; Brain Scan – radio isotope scan; may iniinjectsa brain; altered
integrate all data; tests do develop blood supply
Electroencephalograph (EEG) – electrical activity to the brain
* fixed was favored over felixble in most designs electropods pasted to the scalp; safe, painless, non invasive
procedure; diagnosis/treatment of seizure and other disorders
Halstead-Reitan Neuropsychological Battery Electromyograph – inserted sa ausicles
- most used fixed neuropsych battery Echoencephalograph – sound/ sanic energy/ abnormal
- first lab purposely for Brain-Behavior Correlates (Halstead) behvaio (OCD)
- requires highly trained examiners Lumbar Puncture – Spinal tap; may special needle na
- full work day to complete iniinsert sa widest spinal interpace; chemical normality of
- need ng supervision of trained psychologist when interpreting fluids; normality of intracanal pressure
- Halstead Impairment Index - .5 pataas (indication & problem) MRI – structural view
- used to identify behavioral deficits associated with particular FMRI – anong part yung active during the activity
neurological lesions

58
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
CHAPTER 16: ASSESSMENT, CAREER AND BUSINESS - Ability to understand the relationship between physical
forces and various mechanical tools
Strong Interest Inventory (_______ history of continuous use)
G-Stanley Hall – assess children’s interest in various Hand Tool Dexterity Test
recreational pursuits. - Blur the lines sa aptitude, achievement and interest
Edward K. Strong Jr. – SVIB - Take apart, reassemble, manipulate (sequence) time
- Originally for men limited

Strong-Campbell Interest Inventory O’Connor Tweezer Dexterity Test


- Both version of F and M - Perceptual motor, finger dexterity
- Insert brass pins with a metal plate using pair of tweezers
Strong Interest Inventory Revised Edition - Fire motor skills
- Contemporary career interest
- Probe preferences sa occupations, school, subject and General Aptitude Test
activities - identify aptitudes. Any of working age
- 3 hours
Self-Directed Search – based sa RIASE ni - 3 components (psychomotor tastes, paper and pencil)
Holland Special Aptitude Test Battery (SATB)
Minnesota Vocational Interest Inventory - Race normed
- Empirically keyed - ↓ criteria validity ↑ test retest validity
- Compare yung interest patterns sa non-professional
occupations Measure of Personality
-MMPI 2RF – pang clinical lang
.40 - .72 interest and aptitude’s correlation
NEOPI-R – most widely used to diagnose
*aptitude in HS – A level of job = +evaluation of Integrity Text – no to polygraphs/ lie detection test
Employee
0 aptitude in HS – A level of job = - evaluation Criterion Focused Occupational Personality Scale – COPS
ng employee - for narrowly defined personality
-However, Hindi connected ang job satisfactory sa aptitude ng
HS Applicant Potential Inventory
- Interest X Job Performance - electronically administered
X Job Satisfaction
*Wag masyado magrely sa interest inventories! *pre-employment honesty testing – negative work- related
+ increases ng confidence + self-efficiency + attitudes
personality + portfolio project
_______________________________________ MBTI – Carl Jung
- basic differences in the ways human beings take info and
Measures of Ability and Aptitude make valuable decisions
Aptitude – greater amount of informal learning
Achievement – more limited and focused Checklist of Adaptive Living Skills
- surveys the life skills needed to make successful transition
Wonderlic Personnel Test - personal living skills, home, community, employment
- Measure of general ability - any age
- 12 mins - observation for 3 months
- useful in career and pre employment counselling
- Asses spatial skills, abstract thought, mathematical skills.
- Screens individuals for jobs that require both crystalized Cross-Cultural Adaptability Inventory
and fluid - Meyers “culture shock”
- ability to adapt new cultures
Bennet Mechanical Comprehension Test BMCT - 6 point likert scale format
- Paper and pencil - tolerate ambiguity, maintain one’s personal identity to new
surroundings, interact with people from cultures

59
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
- emotional resilience, flexibility, opener, personal ability, - organizationally standardized procedures for evaluation
personal autonomy involving multiple assessment techniques (paper, and pencil,
situational performance test)
Career Transitions Inventory - nag originate kay Murray
- assess psychological transition
- task change, position change, occupation change Drug Testing – presence/absence of drug thru analysis of
blood, urine, hair or other biological specimen
Factor for Retirement
- goal instability scale Immunoassay Test - / or A?
- life satisfaction scale Gas Chromatography/ Mass Spectometry – anong drug?
- leisure satisfaction scale - illegal or over the counter? Indicate drugs taken; pwedeng
- interpret support evaluation scale legal drugs amdetect

Retirement Satisfaction Scale/Inventory Marijuana – with residue


- assess adjustment to retirement Cocaine – without residue but with impairment
Higher ex, higher emotional stability = higher R.S
Measure of Cognitive Ability
Screening, Selection, Classification, Placement - may difference talaga
- superficial process of evaluation based in minimal standard - Asians (math), Whites (comprehension + verbal ability)
(screen) (cognitive ab) compare sa black and Hispanics
- accept or reject (selection) - illegal ang differential cutoff/race normed
- rating/categorization (classify) To solve: more computer-administered evaluations that
- assignment to a group or category sa isang institution minimized verbal content, verbal skills
(placement)
Productivity – Output of Work Effect
Application Form – useful tool for quick screening
Letters of Recommendation – unique source of detailed info Rating and Ranking Procedures – most common method of
about applicant’s past performance, quality of applicant’s evaluating productivity
stability
a. Forced Distribution Technique
Performance Test – to obtain job related performance sample - for large number of assessees
- unsatisfactory, poor, fair, average, good, superior
Seashore Bennet Stenographic Proficiency Scale/Test - no. of absences
- stenographic competence
- blurred ach, apt, int/performance b. Critical Incidents Technique
- + and – behavior
Minnesota Criterial Test (MCT) - Honeymoon Period: 3 months (newbies)
- clinical apt
- speed/accuracy Peer Ratings – peers have the tendency to rate their
counterpart higher compared to supervisor
Business Leadership Ability Limit: gantihan

Leaderless Group Technique Team: 2 or more people who interact independently toward a
- communication skills, problem solving ability ability to cope common and valued goal
and work together
Motivation
In Basket Technique Paired Comparison (Forced Choice) Format
- managerial ability, organizational skills, leadership potential - champagne, rural unskilled population
- organizing, planning, problem solving, decision making, - working with friends and neighbors (praised for good work)
creativity, leadership, written communication skills → least imp. Factor (2nd to the least)

Assessment Center • Vroom Expectancy Theory of Motivation –


expectancy, instrumentality, valence

60
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm
• Maslow - branch of social psych that deals primarily with the
• Aldefer – no hierarchy development advertising and marketing of products and
• McClelland – Nach (moderate), NPow services.
• Work Preference Inventory – competition +
recognition
Measurement of Attitudes
Burnout – psychological syndrome of emotional evaluation, - typically measured by self-reports
depersonalization and reduced personal accomplishment that Limitation – we differ in our ability to be introspective of level
can occur among individuals, who work with other people in of self-awareness; some are not willing to be candid about their
same capacity attitudes

Emotional Exhaustion – inability to give oneself emotionally Implicit Attitude – non conscious automatic association in
to others memory that produces a disposition to react in some
Depersonalization – distancing from other people and even characteristic manner to a particular stimulus
developing cynical attitudes towards them - gut level reactions

Consequence of Burnout – higher absenteeism, higher job Implicit Attitude Test


turnover frequency and duration of quick leave; insomnia, - most embraced in research field
alcohol + drug abuse - reaction time
- psych – complex
Maslach Burnout Inventory
- most widely used measure of burnout Surveys

Personnel in Nursing, home for the aged and children – Face to Face
higher burnout; higher in burnout, lower in satisfaction - adequate classification
- availability to stimuli
Attitude – learned disposition to read in some characteristic - manage feasible expression
manner to a particular stimulus - not for sensitive topics

Job Satisfaction Telephone Survey


- more productive, more consistent in work output - less info
- less likely to complain, less likely to be absent - biased if knows mo ang directory
- a pleasurable or positive emotional state resulting from the - random dialing is a problem
appraisal of one’s job experiences
- work environment, work schedule, stress organizational Mail Survey – Most Appropriate Survey Method
culture - low in cost
- sensitive topics
Organizational Commitment - need consultation
- loyalty, identification and involvement
O.C Questionnaire Disadvantages:
- no response
Organizational Culture - not intended recipients
- also called as corporate culture - late responses
- way of work
- discussion of organizational culture Consumer Panel – sumasagot
- develop a sense of uniqueness about the Diary Panel – detailed records of behavior
culture on a specific company
- first impressions, physical space, corporate structure and Motivation Research Methods
roles - individual/ focus group
- analyze matters for consumer behavior and attitudes
Other Tools of Assessment for Business Applications - test respondents
- provide data to declare hypothesis
Consumer Psychology - has diagnostic value

61
PSYCHOLOGICAL ASSESSMENT. Prepared by: Prof. Rolf Gian Marcos, RPm

Focus Group
– group led by independent moderator
- 6 to 12 respondents
- typically 1 to 2 hours
- recorded/ video taped
- moderator – active listener; doesn’t judge; encourages
- exploring research

Dimensional Quantitative Research


- comprehensive and systematic
- BASIC ID
B – behavior
A – affect
S – sensation
I – imagery
C – cognitive
I – interpret
D – drugs/biological
- for any discipline

Note: If in case you are confused about a certain concept,


kindly look at the reference noted below. Handouts could
strengthen your knowledge, but purely depending on it might
limit your opportunity to read the small details that might help
you better understand the subject matter. In addition, I
apologize if there are typographical errors. Rest assured that I
have given my best to proofread all of them, but just like what
Freud said about superego, perfection is in no way attainable

Reference:
Cohen, R. J., & Swerdlik, M. E. (2018) Psychological Testing
and Assessment: An Introduction to Testing and Measurement.
New York, NY: McGraw-Hill Education

I CAN DO IT, I CAN MAKE IT, I WILL TOP THE BOARD


EXAM AND BE THE NUMBER 1!!!!!!!!!!!!!
RPMs GRAB YOUR LICENSE, IT’S MINE!!!!!!!!!!

To my VIP, Lowerbox, Upperbox and Gen Ad RPm babies:


I will always be praying for your success. Pass or fail, I will
forever be grateful for your effort, passion and dedication to
uplift the mental health field in our country. Focus on your
passion, on why you are doing this in the first place. Fatigue is
normal. Acknowledge it. Then remind yourself that this isn’t
only your fight, this is the fight of all the people that needs our
help. You are very much needed. God bless you! See you sa
Oath-taking at PAP Convention! Pag na-meet mo ulit ako, wag
ako dedmahin pls, masasabunutan kita (mataas ang aggressive
drive  ) charet!!! MWAAAA
- Sir/Sis Gian ☺

62

You might also like