13th European Conference on Psychological Assessment

Original Photo adapted from Hansueli Kramer / CC BY

Date: Thursday, 23/Jul/2015
8:30am - 9:30am	K1: Assessing and Changing Cognitive Processes in Addiction Session Chair: Victor J. Rubio Reinout W. Wiers (University of Amsterdam, Netherlands)
KO2-F-180 (Ⅵ)
9:45am - 11:15am	IS1: Is “Q-short” a Useful Approach for Psychological Assessment? Pitfalls and Opportunities of Short Questionnaires for the Measurement of Psychological Constructs Session Chair: Christoph J. Kemper
KO2-F-180 (Ⅵ)
	Is “Q-short” a useful approach for psychological assessment? Pitfalls and opportunities of short questionnaires for the measurement of psychological constructs Chair(s): Christoph J. Kemper (University of Luxembourg, Luxembourg) In recent years, the development and application of short questionnaires (“Q-short”) for psychological constructs has been gaining pace. At present, short measures are widely-used for psychological assessment in diverse domains, e.g. personality, social or I/O psychology, psychopathology, social and educational science, and behavioral economics, as well as diverse assessment settings such as research and practice. Their popularity is largely due to the promise of higher efficiency of measurement, lower cost, lower respondent burden, and higher data quality. Besides these obvious advantages, there is also considerable criticism for using short questionnaires of psychological constructs leaving researchers and practitioners in limbo concerning the choice of an appropriate measure for their assessment setting. As the criticism mainly pertains to the methodology of short scale development, the symposium focuses on the construction process. Presenters demonstrate and/or compare construction strategies such as manual and automated approaches (e.g. Ant Colony Optimization) or top-down strategies (starting with a longer version of a scale) and bottom-up strategies (starting with a single item to which further items are gradually added) using empirical as well as simulated data. Aim of the symposium is to make recommendations for the development, validation, and application of short questionnaires in research and applied settings. Presentations of the Symposium Assessing personality and situation perception at the same time, in a short time Matthias Ziegler¹, Kai Horstmann¹, Marko Vetter²; zieglema@hu-berlin.de zieglema@hu-berlin.de ¹Humboldt-Universität zu Berlin, Germany, ²Schuhfried GmbH, Austria The idea of interactionism suggests that human behavior is caused equally by the situation and the personality of the actor. However, personality tests and situation perception are scarce. Here, the B5PS, a test capturing the Big 5 and 42 facets as well as 5 dimensions of situation perception (Situation 5) is used as starting point for the development of a short test. This short test yields scores for the Big 5 and the Situation 5 and was evaluated using a representative sample of 400. During the talk, the construction strategy paying specific attention to the nomological network of the constructs assessed will be explained. Moreover, evidence for the psychometric quality of the short test will be reported and compared with the original test (criterion, convergent, discriminant, and factorial validity as well as construct and test-retest reliability). The mixed-method approach applied here can be generally applied in test construction and can serve as a best practice example. Following the ants: Pros and cons of Ant Colony Optimization (ACO) for short scale development Anne B. Janssen¹, Martin Schultze², Adrian Grötsch³; a.janssen@jacobs-university.de a.janssen@jacobs-university.de ¹Jacobs University Bremen, Germany, ²Freie Universität Berlin, Germany, ³Technische Universität Braunschweig, Germany The present study was aimed at constructing useable, reliable, and valid short scales of two measures assessing proactive personality and supervisor support. For this purpose, we compared Ant Colony Optimization (ACO; Leite et al., 2008) and classical item selection procedures. ACO is algorithm-based, and selects and compares sets of items according to defined criteria. For proactive personality, the two selection procedures (ACO and classical item selection) provided similar results. Both five-item short forms showed a satisfactory reliability and a small, however negligible, loss of criterion validity. For a two-dimensional supervisor support scale, ACO found a reliable and valid short form. Psychometric properties of the short version were in accordance with those of the parent form. A classical short form for supervisor support revealed a rather poor model fit and a serious loss of validity. Benefits and shortcomings of ACO compared to classical item selection procedures and recommendations of ACO application are discussed. Best practices in short scale development: Comparing state-of-the-art methods using simulated and empirical data Peter M. Kruyen¹, Constanze Beierlein², Beatrice Rammstedt²; p.m.kruyen@gmail.com p.m.kruyen@gmail.com ¹Radboud University Nijmegen, The Netherlands, ²GESIS Leibniz Institute for the Social Sciences, Germany Psychological constructs have attracted increasing attention as valuable predictors of social phenomena. However, most psychological measures include too many items to be practically useful in large-scale research. Because of this, researchers often remove items from these long measures. By doing so, many researchers rely on well-known techniques such as maximizing coefficient alpha. Research has shown, however, that these strategies may result in serious deficiencies. Recently, psychometricians have developed sophisticated methods that are believed to result in sound short scales. From the viewpoint of practitioners, there seems to be little guideline on how to choose and apply these new techniques to optimally shorten a scale. Against this background, the aim of our talk is three-fold: First, we explain the limitations of old approaches. Subsequently, we will introduce several state-of-the-art procedures. In this context, we will compare “top-down” and “bottom-up” strategies. Top-down approaches refer to item-selection process which start with a longer version of a scale. “Bottom-up” approaches, in contrast, start with a single item. Here, scale components are gradually added. We also distinguish between manual and automated approaches. We use both simulated and empirical data to evaluate these different methods. Finally, we provide recommendations for using appropriate procedures for constructing short measures.
9:45am - 11:15am	S1: Opportunities and Challenges of Longitudinal Perspectives Session Chair: Grégoire Bollmann Discussant: Martin Tomasik
KOL-G-217 (Ⅳ)
	Opportunities and challenges of longitudinal perspectives Chair(s): Grégoire Bollmann (University of Lausanne, Switzerland) Discussant(s): Martin Tomasik (University of Zurich, Switzerland) Time poses several challenges to longitudinal perspectives, an example of this would be when it comes to ensure measurement invariance of constructs or to assess people evaluations of past events. This symposium brings together researchers from the Swiss National Center of Competence in Research LIVES and those interested in longitudinal perspectives to explore these challenges and discuss the opportunities they also entail. First, introducing the issue of measurement invariance, Brodbeck and colleagues examine standardized inventories of marital satisfaction and psychopathological symptoms. In two 2-waves studies on married individuals and on patients before and after psychotherapy, respectively, this team presents the evolution of these constructs over time. Sarrasin then showcases the invariance of a 5-item self-esteem scale with multigroup confirmatory factor analyses in the tumultuous context of late adolescence and young adulthood. Her results highlight that changes in self-esteem of this vulnerable population are mainly related to changes in their satisfaction with their body image. Finally, Morselli and colleagues present life-history calendars as a means to approach past events. Their work compares respondents’ subjective evaluations of their personal trajectory obtained with graphical representations or a differential scale and pinpoints advantages of life-history calendars. Presentations of the Symposium Longitudinal measurement invariance issues illustrated by examples of marital satisfaction in later life and the structure of psychopathology before and after psychotherapy Jeannette Brodbeck, Hansjörg Znoj, Pasqualina Perrig-Chiello; jeannette.brodbeck@psy.unibe.ch jeannette.brodbeck@psy.unibe.ch University of Bern, Switzerland When questionnaires are administered repeatedly over time, measurement invariance needs to be established in order to determine whether the same construct is measured with similar precision. After a short introduction to cross-sectional and longitudinal measurement invariance, these concepts are illustrated by two examples employing a confirmatory factor analysis framework for categorical data. The first example is a 10-item version of the Marital Satisfaction Inventory (Whisman, Snyder, & Beach, 2009) administered at baseline and two years later to a population sample of 1275 married individuals aged 40+ (NCCR LIVES, IP-12). Measurement invariance held only for a modified one-factor model but not the three factor model which fitted the data best at baseline. The second example is the latent structure of psychopathology assessed with the Brief Symptom Inventory (Franke, 2000) before and after psychotherapy in 526 patients. Configural factorial invariance was not confirmed and the latent structure of psychopathology was simpler after psychotherapy. Implications of these measurement invariance issues for theory, statistical analyses, and adaptation of the measures will be discussed. Measuring self-esteem among young adults in different educational tracks: A longitudinal perspective Oriane Sarrasin; oriane.sarrasin@gmail.com oriane.sarrasin@gmail.com University of Lausanne, Switzerland Self-esteem is subject to strong variations during late adolescence and young adulthood: Not only does it drop and then rise again gradually, but also marked differences across educational groups are often found. To ensure that such within- and between-individuals comparisons are unbiased, it is necessary to verify in preliminary analyses that measurement of self-esteem is invariant. To illustrate this, data of young adults (M = 18.7, SD = 2.61) in academic (N(2013) = 147; N2014 = 115) and vocational (N(2013) = 160; N(2014) = 65) tracks from the two last waves of the Longitudinal Lausanne Youth Study (NCCR LIVES) were selected. Cross-sectional multigroup confirmatory factor analyses (MGCFA) showed no difference in the measurement of the five-item self-esteem scale across the two groups, indicating that in both years unbiased mean comparisons can be conducted. Contrary to previous research, participants in academic and vocational tracks did not differ significantly in their self-esteem. In contrast, longitudinal MGCFAs revealed one within-individuals difference: While all other items remain stable, participants reported being less satisfied with their body image as they grew older. The inclusion of this item in a composite score can lead to the erroneous conclusion that young adults’ general self-esteem decreases over time. The use of Life-History Calendar Methods (LHC) to assess subjective evaluation of the personal life trajectory Davide Morselli, Dario Spini, Nora Dasoki, Elenya Page; Davide.Morselli@unil.ch Davide.Morselli@unil.ch University of Lausanne, Switzerland Life-history calendar (LHC) methods have been increasingly used life-course research as well as other domains that are interested in the timing of events and trajectories. There is indeed a consensus on the fact that the highly structured but flexible approach of the LHC facilitates the memory of past events. Respondent's experience provide a context for retrieval of autobiographical memories and they are used as anchoring points and time landmarks for recollecting events. The literature has shown that this method provides more reliable answers that conventional question lists on biographical retrospective data. If the LHC method has been mainly used to collect data on factual (i.e., objective) events, a few experiences have adapted it for tapping subjective dimensions and assessing the psychological impact of events. In this study we investigate whether the LHC method can be used to asses respondents' evaluation of their own personal trajectory, by comparing two different methods. The first one relies on the use of a differential scale on which respondents indicate their evaluation. The second one maximizes the visual potential of the LHC and respondents are asked to graphically express their evaluation.
9:45am - 11:15am	PA1: Measurement 1 Session Chair: Klaus D. Kubinger
KOL-G-204 (Ⅱ)	PA1: Measurement 1 Session Chair: Klaus D. Kubinger
	On designing data-sampling for Rasch Model calibrating an achievement test Klaus D. Kubinger¹, Dieter Rasch², Takuya Yanagida³ ¹University of Vienna, Austria; ²University of Natural Resources and Applied Life Sciences, Vienna; ³University of Applied Sciences, Austria; klaus.kubinger@univie.ac.at klaus.kubinger@univie.ac.at Though calibration of an achievement test within psychological and educational context is very often carried out by the Rasch model, data sampling is hardly designed according to its statistical foundations. Kubinger, Rasch, and Yanagida (2009) suggested an approach for the determination of sample size according to a given Type I and Type II risk, and a certain effect of model misfit when testing the Rasch model is supported by some new results. The approach uses a three-way analysis of variance design (A > B) x C with mixed classification. There is a (fixed) group factor A, a (random) factor B of testees within A, and a (fixed) factor C of items cross-classified with (A > B). In accordnce with to Andersen’s Likelihood-Ratio test, the testees must be divided into at least two groups according to some criterion suspected of causing differential item functioning (DIF). The Rasch model’s quality of specific objective measurement is in accordance with no interaction effect A x C. The results of simulations studies are: the approach works given several restrictions, and its main aim, the determination of the sample size, is attained. Additionally, our approach's power is consistently higher than Andersen's test. Examining fit in covariance modeling with ordinal data Christine DiStefano¹, Grant Morgan², Phillip Sherlock¹ ¹University of South Carolina, USA; ²Baylor University, USA; distefan@mailbox.sc.edu distefan@mailbox.sc.edu Fit indices are routinely used with covariance modeling to provide information about the goodness of fit between the hypothesized model and the data. These indices include relative fit indices (e.g., Goodness of fit Index, Root Mean Square Error of Approximation, Standardized Root Mean Square Residual) and incremental fit indices (e.g., Tucker Lewis Fit Index (Nonnormed fit index, Comparative Fit Index, Incremental Fit Index). Recommendations and rules of thumb for interpreting various fit indices have been presented in the literature; however these guidelines are largely built from investigations using continuous, multivariate, normal data, and normal theory estimators (maximum likelihood or generalized least squares). As most of the data used in empirical studies is not continuous and may not be normally distributed, these recommendations may not hold when ordered category data are analyzed and/or robust estimators are used. Little is known as to how ad-hoc fit indices behave under non-normal and/or ordinal data. The purpose of this study is to examine the performance of fit indices under situations of categorical data and non-normality. Conditions such as sample size, number of ordered categories, non-normality, and estimation technique will be manipulated to examine the performance of fit indices. Statistical and theoretical reductionism in research on scientific thinking: How much can the Rasch model tell us? Peter Adriaan Edelsbrunner¹, Fabian Dablander² ¹ETH Zurich, Switzerland; ²University of Tübingen, Germany; dostodabsi@gmail.com dostodabsi@gmail.com In recent research on scientific thinking, Rasch modeling was employed to investigate the dimensionality of items that were meant to cover a wide variety of skills. Based on generic fit statistics and model comparisons, it was concluded that scientific thinking represents a unidimensional psychological construct. Using simulations, we argue that generic fit statistics and model comparisons based on the Rasch model merely warrant crude conclusions about the use of composite scores for practical assessments. Without strong prior theory, results from the Rasch model do not warrant theoretical conclusions about the dimensionality of the underlying psychological construct. In the simulations we compare the adequacy of various alternative measurement models for examining structural assumptions about scientific thinking. Based on the simulations, crucial assumptions of the Rasch model and their implications for theory development are discussed, expanding the discussion by drawing parallels to reductionism in intelligence and psychiatry research. We conclude that an undue reliance on Rasch models might not benefit and even hinder theory development in research on scientific thinking. Alternative measurement models and experimental studies might provide more thorough insight into the structure of scientific thinking. Finally, we discuss our study’s implications for other fields with frequent application of Rasch models. Teaching statistical inference and the null hypothesis significance controversy Ernest Kwan, Irene R. R. Lu Carleton University, Canada; ernest.kwan@carleton.ca ernest.kwan@carleton.ca Null hypothesis significance testing (NHST) is the predominant procedure for statistical inference in the social sciences. Quantitative methodologists, however, have debated the legitimacy of NHST, and the American Psychological Association convened a task force to evaluate the role of NHST in quantitative research. We describe an approach to teaching statistical inference that illustrates the problems of NHST and reviews the recommendations of reform made by the task force and other renowned methodologists. This pedagogical approach is designed for a statistics course enrolled by graduate students in a research-oriented doctoral program. Accordingly, our approach also illustrates how NHST should and should not be used to evaluate substantive theories or hypotheses of interest.
9:45am - 11:15am	PA2: Organizational Assessment 1 Session Chair: Bengt Jansson
KOL-G-209 (Ⅲ)
	Prediction of job performance, based on leaders' ratings, with a personality test: A correlational approach Bengt Jansson University of Gothenburg, Sweden; psybenja@gmail.com psybenja@gmail.com Predictive criterion validity was examined in this study based on leaders' ratings of job performance. Employees were recruited with the personality test JobmatchTalent (JMT). A number of 258 recruited individuals was included in the analysis, and the ratings were performed by 95 leaders. The ratings were based on three dimensions (criteria): (1) productivity and motivation, (2) quality and structure of performed work, and (3) interpersonal skills. Nine main scales of JMT were used as predictors of the criteria (i.e. Work Structure, Personnel Drive, Decision characteristics, Activity, Drive, Acting, Tolerance, Social interest, and Communication). The predictive validity was based on pairwise correlations between criteria and predictors. By use of conceptual similarity, the 27 possible correlations were categorized as convergent (expected) or divergent (non-expected). A clear support was found for the assumed relationships. The average convergent correlation was .37 (N = 258). For a subgroup of recruited managers (n = 120), the corresponding value was .46. Convergent values differed significantly from divergent values. The predictive validity of the JMT was rated as good. The values found were compared to an extensive international meta-analysis, in which a correlation of just less than .40 was considered to be a very high value. Predicting job performance of financial representatives based on Harrison Assessment Talent Management System (HATS) Kristie Ann Craigen Blurton Family Development Center, China; kcraigen@blurton-fdc.com kcraigen@blurton-fdc.com The goal of this research was to empirically evaluate the predictive utility of Harrison Assessment Talent Management System (HATS) for use as a selection instrument within organizational settings. This was done by investigating the strength of the relationship between the job fit percentages generated by HATS, and the performance ratings (i.e., number of life insurance policies sold within the first 6 months of employment) of 238 employees (201 men and 37 women) working for a large American insurance company. The resulting correlation between HATS prediction scores and performance ratings yielded significant findings, r (236) = .599, p> .0001, indicating HATS is a useful instrument for predicting employee performance. Further ancillary analysis revealed 7 of the 156 predictor variables significantly correlated with performance ratings. As such, it is highly probable that many traits measured in the workplace (and by HATS) are best conceptualized as sharing a curvilinear relationship with performance. Moreover, the strength of the correlation coefficient between employee performance ratings and HATS prediction scores, in addition to its innovative methodology makes it a strong and rather unique instrument among the psychological tests currently available for use in organizational settings. Predictive validity of an asynchronous video interview assessing leadership competences in a real teamwork situation Anne-Kathrin Schock, Tuulia Ortner, Eva Traut-Mattausch University of Salzburg, Austria; anne-kathrin.schock@sbg.ac.at anne-kathrin.schock@sbg.ac.at Interviews are the most commonly used assessment tool in personnel selection. Nevertheless their application often lacks procedural standards and systematic scoring, which in turn can reduce fairness, comparability, and transparency. We sought to investigate the predictive validity of a new and promising tool in regard to standardisation: asynchronous video interviewing. Asynchronicity implies that candidates and interviewers do not need to be present at the same place and time, as recording and scoring form independent steps. To validate an interview for the assessment of leadership competences, a sample of 120 students completed the interview and held presentations in teams of 10 persons. Afterwards, students were asked to name and rank the team members according to their leadership potential during teamwork. First analyses of the interview revealed that the number of practical competences relevant to leadership success in small newly formed groups specified by participants was significantly associated with the frequency of assigned leadership. Further results regarding the interviews’ predictive validity will be discussed. Measurement invariance and differential item functioning investigation of FVAT-S across gender and age groups in Turkish sample Levent Sevinç, Hayri Eren Suna Assessment Systems; erens@assessment.com.tr erens@assessment.com.tr The main purpose of the study was to investigate the measurement invariance of FVAT-S (FVAT-Short Form) across gender and age groups. FVAT-S is an ability test, which is widely-used in employee election procedures in Turkey. The sample that was used consisted of 2464 applicants (1273 females and 1191 males). The measurement invariance test was performed with Multiple Group Confirmatory Factor Analysis (MG-CFA). To decide the invariance, the change in the Comparative Fit Index (ΔCFI) was considered and additionally RMSEA, SRMR, and TLI were reviewed. At the first stage, FVAT-S showed configural and metric invariance across gender groups and then, partial scalar invariance was attained with the modifications. In terms of age groups (aged between 20-25 and 26-32), FVAT-S showed, and metric, and scalar invariance. Items which disrupting the scalar invariance across gender groups were tested against Differential Item Functioning. All items which decrease the invariance included gender-based DIF. These results showed that FVAT-S is a useful tool to compare age groups; however amounts of error may be changed between age groups. The DIF-included items in FVAT-S are needed to be reviewed to achieve full scalar invariance in gender groups. Test order effects in high stakes assessment: An experiment using objective personality tests and cognitive ability tests in personnel selection Alexander Leonard Schünemann¹, Lale Khorramdel-Ameri^2,3 ¹Schuhfried GmbH, Austria; ²University of Vienna, Austria; ³Educational Testing Service, USA; schuenemann@schuhfried.at schuenemann@schuhfried.at For various reasons, varying test orders within test batteries is a common practice. Different test orders are used for organizational reasons, to avoid cheating, or to prevent a decrease in motivation if test-takers realize that others are performing faster. However, this practice is not well explored or proven to be without consequences for test results and therefore might decrease fairness and comparability in assessment processes. An experiment is presented that aims to extend a study from Khorramdel and Frebort (2011) by examining the influence of test order effects in personnel selection. The data comes from 70 applicants for a management trainee program of an Austrian railway company who had to take part in a computerized assessment of their professional potential as part of the selection process. Objective personality tests and cognitive ability tests were administered within a computer test battery and the administration order was systematically varied. The results are compared to the findings of Khorramdel and Frebort (2011) and are discussed with respect to subjects’ differences in cognitive ability and educational level, achievement motivation, and resilience.
9:45am - 11:15am	PA3: Clinical Assessment 1: Children and Adolescents Session Chair: Nina Krüger
KOL-G-221 (Ⅴ)
	The Anxiety Screening for Kids (ASK): A new self-report instrument to measure trait-anxiety in primary school aged children Nina Krüger University of Hamburg, Germany; Nina.Krueger@uni-hamburg.de Nina.Krueger@uni-hamburg.de Children at primary school age are already able to provide information about their level of trait-anxiety. The self-report is an important source of information for anxieties which are not recognizable on behavioral level. However, there is a lack of instruments available in the German language in order to be able to assess heightened primary school aged children’s anxiety levels so far. This study aims to examine the test quality of the “Anxiety Screening for Kids (ASK)”. In order to define reference values and psychometric properties for the ASK 894, primary school children from northern Germany were tested, some of them repeatedly. Different quantitative methods as well as qualitative methods were applied to examine the validity of the test. Overall, the results verify the high test quality of the ASK. The given objectivity as well as satisfying up to good reliabilities and a satisfactory up to good retest reliability are to be highlighted. Furthermore, good discriminant and convergent validity as well as differential validity are given. The sensitivity to change has also been confirmed. The results characterize the ASK as an objective, reliable and valid test for measuring self-reported anxiety of primary school children. Validity of the TAS-20 and interrelations between alexithymia and psychosocial ill-being among Finnish adolescents Eero Laakkonen, Niina Johanna Junttila, Päivi Niemi University of Turku, Finland; eerlaa@utu.fi eerlaa@utu.fi Alexithymia is defined as a set of cognitive-emotional deficits such as the inability to identify or express emotion and affect, as well as avoidance in coping with conflicts or articulating emotions. Due to poor affective communication, individuals with alexithymia may lack the skills to seek support from individuals around them. The first aim of this study was to validate a measurement scale (based on TAS-20 by Bagby, Parker & Taylor 1994) in order to evaluate Finnish adolescents (n =393) with alexithymia. Secondly, we aimed to study the interrelations between the adolescents' alexithymia and psychosocial ill-being (social and emotional loneliness, social anxiety, social phobia, and depression). This study was a part of a longitudinal research project focusing on the socio-emotinal well-being of Finnish school students in lower secondary school. Data was based on the self evaluations from the grades 7-9 (ages 13-15). Analyses were conducted mostly using structural equation modeling. Based on validity analyses a three-factor model (Difficulty Describing Feelings, Difficulty Identifying Feeling, Externally-Oriented Thinking) of adolescents with alexithymia was confirmed. Results showed that alexithymia was significantly interrelated with the psychosocial ill-being: alexithymic adolescents suffered more from loneliness, anxiety, social phobia, and depression, compared with the non-alexithymic subjects. Gender differences were not significant. Asian Adolescents Depression Scale (AADS): Urdu translation and analysis of psychometric properties Muhammad Rizwan^1,2, Gisela Michel¹, Sumbleen Khan², Mehwish Mursaleen², Saira Qurashi² ¹University of Luzern, Switzerland; ²University of Karachi, Pakistan; Muhammad.Rizwan@unilu.ch Muhammad.Rizwan@unilu.ch Depression is a mental illness that is prevailed in every society of the world regardless of gender, socioeconomic status, and age of habitants. There are lots of scales developed to measure the level of depression among community and clinical samples. One of newly developed measures of depression is Asian Adolescent Depression Scale (AADS; Woo, 2004) that is especially developed to measure level of depression among Asian adolescents. The Present study was designed to translated/adapt the English AADS into Urdu (National Language of Pakistan) by following back translation method. The Urdu translated version of AADS was applied with 292 adolescents in Karachi, Pakistan. The age range of participants was 13-16 (M=14.4) years. Analysis of internal consistency by Cronbach’s alpha α = .86 provided a good evidence of AADS reliability. The positive correlation of AADS was found with Depression (r=.36), Anxiety (r=.31), and Stress (r=.34) subscales of DASS-21, and negative relationship was found with MSPSS (r= -.35). The correlation (r=.77) between original and Urdu translation of the AADS is also strong. Results indicated that Urdu AADS can be confidently utilized with community samples of Pakistani adolescents. Convergent and divergent validity of the Preschool and Kindergarten Behavior Scales – 2nd edition and the Strengths and Difficulties Questionnaire Sofia Major, Maria João Seabra-Santos University of Coimbra, Portugal; smajor@fpce.uc.pt smajor@fpce.uc.pt The development of behavior rating scales specifically intended to assess preschoolers’ social-emotional behaviors has been a concern in the field of psychological assessment for quite some time. This study aims to analyze the correlations between scores on the Portuguese version of the Preschool and Kindergarten Behavior Scales – 2nd Edition (PKBS-2) and on the Strengths and Difficulties Questionnaire (SDQ-Por). The sample was composed of preschool children (n=80; 40 males, 40 femalles) assessed with both rating scales, by parents and teachers. As expected, all the PKBS-2 Social Skills scores had negative correlations with the SDQ-Por problem scores, and all the PKBS-2 Problem Behaviors scores had positive correlations with the SDQ-Por problem scores. Correlations with the SDQ-Por Prosocial Behavior score were in the opposite direction. The correlation for Problem Behaviors total scores was of .53 and .73 for parents and teachers, respectively (p˂.01). The strongest correlation was found between the PKBS-2 Opposition/Explosive subscale and the SDQ-Por Conduct Problems (r=.65, parents' ratings) and between the PKBS-2 Overactivity/Inattention subscale and the SDQ-Por Hyperactivity/Inattention (r=.79, teachers' ratings). Results confrim the constructs assessed by both instruments, reinforcing utility of the Portuguese version of the PKBS-2 as a valid tool available to assess social-emotional behaviors at preschool age.
11:15am - 11:45am	Coffee Break
Lichthof/Atrium	Coffee Break
11:45am - 1:15pm	PD: Potential Impact of the Revised EFPA Review Model for the Description and Evaluation of Psychological and Educational Tests Chair: Dave Bartram Discussants: Johnny Fontaine, Mark Schittekatte, Fons van de Vijver
KO2-F-180 (Ⅵ)
11:45am - 1:15pm	PA4: Organizational Assessment 2 Session Chair: Olaf Ringelband
KOL-G-217 (Ⅳ)
	A different sort of pedigree: Top-managers’ personality structures, career success, and derailment risks Olaf Ringelband md gesellschaft für management-diagnostik, Germany; ringelband@management-diagnostik.de ringelband@management-diagnostik.de Top-managers’ personalities differ significantly from those of other people. Two sample groups of Top-managers (n=1,052 and n=495) filled out two different personality inventories (BIP and CPI, respectively). The results of the survey proved that generally, managers are more assertive, sociable, self-confident, and show stronger performance motivation than other professionals. The ramifications of those personality traits on professional success and career development are discussed. Special attention is paid to the derailment risks associated with the aforementioned traits— and their connection to top-managers’ psychopathic behavior (“The Dark Triad”, Paulhus, 2010). Concluding possible measures for reducing top-managers’ derailment-risks are discussed. The development of an instrument to assess organizational learning in small and medium enterprise in Asia Yu-Lin Wang National Cheng Kung University, Republic of China (Taiwan); ywang@mail.ncku.edu.tw ywang@mail.ncku.edu.tw Organizational learning has been examined since the 1950’s and the base of literature on the topic has expanded conceptually, theoretically, and somewhat empirically during the past decades. However limited instrument in measuring organizational learning processes exists. Scholars have indicated that part of the reason is because it is difficult to develop a quantitative measurement of organizational learning. Such limited instrument in measuring organizational learning processes has hampered the empirical research on organizational learning. In addition, current empirical studies on exploring organizational learning have tended to focus on large firms. Unlike large firms, small and medium enterprises, with limited resource on money and people, usually adopt different approaches in learning and obtaining knowledge. Moreover, existing organizational learning instruments based on Western countries may not fit for Asian context. It is necessary to develop an indigenous organizational learning instrument that captures organizational learning processes may be unique to an Asian country. As a result, the purpose of this study is to develop a valid and reliable instrument to measure organizational learning in small and medium enterprises in Asia in order to understand and explain the organizational learning phenomena. Unmasking ethical leadership: Quantitative research on the characteristics that describe ethical leaders at work Eirini Marina Mitropoulou¹, Ioannis Tsaousis¹, Despoina Xanthopoulou², Konstantinos Petrides³ ¹University of Crete, Greece; ²Aristoteleio University of Thessaloniki, Greece; ³University College London, UK; psyp165@psy.soc.uoc.gr psyp165@psy.soc.uoc.gr The appropriate definition and assessment of ethical leadership has been a source of conceptual confusion in the leadership literature. During the last decade different theories have been evolved, all including different type and number of leadership characteristics. Consequently, none of the existing theories provides a full understanding of the concept of ethical leadership. In this study, all ethical leadership characteristics are evaluated that are present in international literature. In total, twenty seven characteristics derived and their relevancy was tested in a quantitative study, using a multi source sample (both employers and employees in public and private sector in Greece). A new factor structure was investigated for all twenty seven characteristics with CFA testing one-, two-, three-, four-, five-, and by- factor models. Fit indices showed that a four factor model had the most acceptable fit to the data. The four factors that emerged were named Ethical Virtue, Solidarity, Ethical Practices and Fulfillment of Ethical Goals. This new four-factor model will be the groundwork for creating a new psychometric scale that will assess ethical leadership at work. Personality-based Person-Organization (PO) fit: A new direction for personality assessments Punya V. Iyer^1,2, Alec W. Serlie², Janneke K. Oostrom³, Marise Ph. Born¹ ¹Erasmus University Rotterdam, The Netherlands; ²GITP, The Netherlands; ³VU University, The Netherlands; iyer@fsw.eur.nl iyer@fsw.eur.nl This study aims to demonstrate the value of personality assessments from a Person-Organization (PO) fit perspective. Organizations could improve the utility of HR-assessments by using personality questionnaires to assess individual personality as well as personality-based PO fit. We initially hypothesized personality-based PO fit to predict satisfaction and intention-to-stay. Furthermore, we hypothesized the fit relationships to be stable over time (two years apart). In phase-I (T0), 636 employees in Netherlands completed questionnaires on their individual personality, perceived organizational personality, and the criteria (satisfaction and intention-to-stay). In phase-II (T1), 202 of the original respondents completed the same questionnaires. The personality dimensions measured were agreeableness, enterprise, competence, chic, ruthlessness, innovativeness, and stability. Polynomial regression analysis revealed that, at T0 PO fit led to satisfaction for all dimensions except agreeableness. Similarly, PO fit at T0 predicted an intention to stay for all dimensions expect agreeableness and competence. PO fit at T1 led to satisfaction for competence, chic, ruthlessness, and stability whereas PO fit predicted an intention to stay for enterprise and chic. We conclude that personality questionnaires can be used in the form of PO fit and specifically personality-based PO fit is a valuable and stable predictor of an individual’s future attitudes and behaviors. Assessment of consumer heterogeneity: A comparison of two multidimensional latent modeling approaches Irene R. R. Lu, Ernest Kwan, D. Roland Thomas, Louise A. Heslop Carleton University, Canada; irene.lu@carleton.ca irene.lu@carleton.ca The assessment of consumer heterogeneity is essential for marketing segmentation in both profit and nonprofit organizations. We explore two methods that capture consumer heterogeneity within and between groups: latent class modeling and diagnostic classification modeling. The paper also discusses the advantages and limitations of the application of each method in marketing.
11:45am - 1:15pm	PA5: Clinical Assessment 2 Session Chair: Anna Barbara Słysz
KOL-G-204 (Ⅱ)
	The structure of thinking of novices and experienced diagnosticians - The report of the research Anna Barbara Słysz Adam Mickiewicz University, Poland; aslysz@amu.edu.pl aslysz@amu.edu.pl The aim of the presentation is to introduce the detailed analysis of the structure of thinking of novices and experienced diagnosticians. It will be performed on the example of case conceptualisation. The creation of such a conceptualization for psychological diagnostics is a complex thought process, which requires processing a wide range of data, formulating hypotheses about psychological onset and maintenance mechanisms of the client/patient's problem. A group of 30 psychotherapists served as subjects of the research. The presented research was planned in such a way so as to obtain a graphical representation of complex, cause-and-effect diagnostic inference. In order to study the structure of professional thinking, a complex diagnostic task (case conceptualisation) was employed. Footage was prepared showing a 40-minute conversation between a psychotherapist and a client. The diagnostic task of the psychotherapists consisted of categorising the client's statements and presenting relations between the categories. In order to analyse similarities between concept maps visualising the structure of diagnostic thinking, a dedicated software application was developed. The characteristics of the structure of diagnostician knowledge (e.g. coherence, complexity, relations between the individual elements of the structure) vary depending on the factors defining the professional profile of psychotherapists. Psychometric properties of Everyday-life Fatigue Questionnaire (EFQ) Joanna Urbańska Adam Mickiewicz University, Poland; joanna.urbanska@amu.edu.pl joanna.urbanska@amu.edu.pl The aim of this study is to investigate the psychometric properties of the Everyday-life Fatigue Questionnaire (EFQ; Urbańska, 2010). EFQ is a self-report inventory assessing everyday-life fatigue, constructed on the basis of theoretical principles of classical test theory (ERA/APA/NCME, 1999/2007). EFQ is a paper-and -pencil instrument with 24 items and three scales: subjective physical fatigue, subjective mental fatigue, and subjective social fatigue. The total sample consisted of 454 participants (adults), with ages from 24 to 85 (M = 60); 295 females and 159 males. Results of the study indicate that the reliability (the Cronbach's alpha) for the total scale was .89 and the 3 subscales demonstrated high reliability as well. EFQ has been also used in other studies by different researchers, yielding similar results. Good psychometric properties of the EFQ allow for the conclusion that it is a suitable instrument for the assessment of the everyday-life fatigue at adults. Moreover, the EFQ showed interesting statistically significant relationships with WHOQOL-bref and Fatigue Assessment Scale, especially in longitudinal studies. Do psychiatric symptoms diminish response quality to self-rated personality tests? Evidences from the PsyCoLaus study Marc Dupuis¹, Emanuele Meier¹, Caroline Vandeleur², Roland Capel¹ ¹University of Lausanne, Switzerland; ²Lausanne University Hospital, Switzerland; marc.dupuis@unil.ch marc.dupuis@unil.ch Our purpose of this study was to examine the relationships between psychiatric symptoms and the response quality to personality questionnaires. The study sample consisted of 1,981 participants from the Swiss cohort study “CoLaus PsyCoLaus” in Lausanne who completed both the NEO Five-Factor Inventory (NEO-FFI) and the Symptom Checklist 90 revised (SCL-90-R). Based on Gendre’s functional method, different indices measuring the quality of the entire set of responses to the NEO-FFI were calculated: response coherence, reliability, response level, variability, modality, normativity, positivity and negativity. Multiple linear regression analyses were performed to measure how much of the variance of such indices of response quality could be explained by the SCL-90-R factors. Determination coefficients ranging from 2.4% to 37.2% were measured for the response indices, indicating that some aspects of response quality are explained by psychiatric symptoms. Response normativity, positivity, and negativity were the indices most strongly associated with the SCL-90-R factors, while reliability was only related to paranoid and oppositional symptoms. Our findings suggest that an important part of the variance in response quality to self-rated questionnaires can be explained by the presence or absence of psychiatric symptoms. These findings call for further research in identifying populations unable to provide sufficiently valid responses to self-rated questionnaires. The PsicAP Project: A randomized controlled trial to improve psychological assessment and treatment with based-evidence psychological techniques of emotional disorders in Spanish primary care centers Antonio Cano Vindel², Roger Muñoz Navarro¹, Paloma Ruíz Rodriguez², Cristina Mae Wood², Benigna Díaz-Ovejero², Esperanza Dongil¹, Itziar Iruarrizaga², Mar García Moreno¹, Fernando Chacón³, Francisco Santolaya³, María Dolores Gómez Castillo³, Patricia Tomás Tomás¹, PsicAP Research Group³ ¹University of Valencia, Spain; ²University of Madrid, Spain; ³Spanish Council of Psychologists, Spain; roger.munoz@uv.es roger.munoz@uv.es Emotional disorders (ED), such as anxiety, mood, and somatoform disorders overwhelm existing resources in Spanish Primary Care (PC) centers. They are poorly detected and sparsely attended with adequate treatment, generating a higher use of health care services than physical illnesses. Other countries have provided Cognitive-Behavioral Therapy (CBT) programs to treat ED in PC demonstrating a high cost-effectiveness when compared to treatment as usual (TAU). The PsicAP Project is a pilot study that seeks to implement an evidence-based psychological group treatment protocol for ED in PC. A randomized controlled trial with two parallel groups will be conducted with a sample of 1126 participants: an experimental group (CBT) compared to a control group (TAU). Clinical symptoms, level of disability, quality of life, cognitive-emotional factors, treatment satisfaction, as well as data on attendance, drug use and other variables that reflect cost-effectiveness will be measured. Follow-up assessments will be completed at 3, 6, and 12 months. Also, the psychometric properties of the PHQ will be studied to improve the assessment of ED in Spanish PC. As in other countries, this treatment may help improve the mental health of these patients and reduce costs. Changes in personality functioning as a result of group psychotherapy with elements of individual psychotherapy in persons with neurotic and personality disorders – MMPI-2 Katarzyna Cyranka, Krzysztof Rutkowski, Michał Mielimąka, Jerzy A. Sobański, Łukasz Müldner-Nieckowski, Edyta Dembińska, Katarzyna Klasa, Bogna Smiatek-Mazgaj, Paweł Rodziński Jagiellonian University Medical College, Poland; katarzyna.cyranka@interia.pl katarzyna.cyranka@interia.pl This study is an analysis of group psychotherapy influence on the personality functioning of patients on treatment for neurotic disorders and selected personality disorders (F4-F6 under ICD-10). The study concerned 82 patients (61 women and 21 men) who underwent intensive short-term group psychotherapy in a day hospital. A comprehensive assessment of the patients’ personality functioning was carried out at the outset and the end of the psychotherapy utilising the MMPI-2 questionnaire. At the treatment outset the majority of the study patients demonstrated a considerable level of symptoms of disorders in five MMPI-2 clinical scales (Depression, Hysteria, Psychopathic Deviate, Psychastenia, Schizophrenia), and moderate pathology in Hypochondria. In the Mania scale most patients obtained results comparable to the healthy population when the treatment commenced. Having undergone the psychotherapy treatment, the majority of the examined were observed to demonstrate positive changes in those areas of personality functioning which were classified as severe or moderate pathology. Short-term intensive comprehensive group psychotherapy with elements of individual psychotherapy leads to desirable changes in personality functioning.
11:45am - 1:15pm	PA6: Assessment in Children and Adolescents Session Chair: David Gallardo-Pujol
KOL-G-209 (Ⅲ)
	Using formative tools: The validation of the Juvenile Victimization Questionnaire David Gallardo-Pujol, Georgina Guilera, Judit Abad, Noemí Pereda Universitat de Barcelona, Spain; david.gallardo@ub.edu david.gallardo@ub.edu The assessment of children and youth exposure to violence requires comprehensive instruments able to measure negative experiences in order to identify polyvictimization and understand its outcomes. The Juvenile Victimization Questionnaire (JVQ; Finkelhor, Hamby, Ormrod, & Turner, 2005) is a comprehensive assessment tool including five general areas of child and adolescent victimization, in its original version. The JVQ is a formative instrument, where the score of the module is just the sum or the count of the events of each type in its items. Formative instruments need a different validation methodology in comparison with reflective instruments. The purpose of this study is to present a validation of the JVQ. We collected a sample of 1105 adolescents (ranging from 12 to 17 years, mean age was 14.52); of those 590 were male, and 515 were female. We conducted a Principal Component Analysis in order to explore how items cluster together. Different solutions were explored, but one-component and nine-components solutions appear to be more in accordance with victimological theory, explaining between 9% and 39% of variance respectively. These aggregations of items are radically different to those suggested by Finkelhor et al. (2005). Either solution may shed light to explanatory processes in victimization and its relationships with later psychopathology. Explorations in prescholers resilience measurements Moshe Israelashvili Tel Aviv University, Israel; mosheil1@post.tau.ac.il mosheil1@post.tau.ac.il There is a growing awareness that is focused on resilience interventions among preschoolers. However, assessment of preschoolers' resilience, a mandatory step in developing and evaluation the utility of such programs, is complicated. This is due to the common use of either the parents or the kindergarten teaches as source of assessment, rather than direct measurement of the preschooler themselves. The proposed presentation will report on the development and implementation of two ways to directly assess preschooler resilience. One way is related to picture completion while the other way is related to semi-active story listening. These tools were administered and evaluated twice: one group consisted of 135 preschoolers, ages 4-5, 69% Arabs and 31% Jews, 70 boys and 65 girls, and their parents and kindergarten teachers. The second group consisted of 253 preschoolers, learning in four Jewish and four Arabs kindergartens; and their parents and teachers. Results of measurements conducted among preschoolers (Jews and Arabs) and its relationships to measurement conducted among their parents and teachers will be presented. The study findings support the notion that such a direct assessment of preschoolers' resilience is possible. Implications for personality assessment, program development and projects' evaluation will be discussed. Investigation of a structural model between desirable parental attitudes, adolescents’ self-esteem, vocational identity, and self-regulated learning Hyun-Jung Lee, Miri Yoon, Sehee Hong Korea University, South Korea; redredlee@naver.com redredlee@naver.com The objective of study is to verify the relationship between parental attitudes and self-regulated learning deeply through the structural model set self-esteem and vocational identity as mediation variables. The first and third wave of panel data from Korean Children and Youth Panel Survey (KCYPS) 2010 was used to find the relationship, and variables were Perceived Parental Attitudes by adolescents (Huh, 1999), Self-regulated Learning (Yang, 2000), Self-esteem (Rosenberg, 1965), and Vocational Identity (Holland, 1980). Participants were 2,259 students (1140 Males, 1119 Females) in 7th grade in 2010. Normal distribution of the data were proved by estimates of skewness and kurtosis. Model fits of the research model were satisfied. Parental attitudes have a positive impact on self-regulated learning through self-esteem and career-identity, so mediation effects on both related variables have verified. The result of the difference test between mediation effects which mediate self-esteem and vocational identity was not significant. That is, mediation effects of both were similar for the impacts of parental attitudes on self-regulated learning. The relationship between parental attitudes and self-regulated learning, double mediation effect was significant with self-esteem and vocational identity. The implication of this study is to present necessary grounds for school policy settlements by identifying impacts on self-regulated learning in adolescents. Measuring multidimensional parental self-efficacy of mothers and fathers of children aged 1.5 and 3 years Niina Johanna Junttila, Minna Aromaa, Päivi Rautava, Jorma Piha, Hannele Räihä University of Turku, Finland; niina.junttila@utu.fi niina.junttila@utu.fi Mothers' and fathers' parental self-efficacy (PSE) develops during the first years of parenthood. Later PSE has an important role in linking distinct parental, child, and situational factors. Cross-sectional links between parents' loneliness, depression, and PSE has been found, yet no longitudinal research on the early mechanisms of PSE exists.The first aim of this study was to validate a measurement scale (based on Parenting Tasks Index by Coleman & Karraker 2003) in order to evaluate Finnish mothers' (n=765) and fathers' (n=668) PSE. Second, we aimed to study whether parents' psychosocial ill-being (social/emotional loneliness, social phobia, and depression) and marital satisfaction during pregnancy and/or during toddlerhood predicts their PSE at child aged 1,5-years. Analyses were conducted using structural equation modeling. Based on validity analyses a five-factor model (Presence, Emotional support, Routines, Playing, and Teaching) of mothers' and fathers' PSE was confirmed. Evaluating psychosocial factors as a predictors for the PSE factors, we found that parents' ill-being predicted several factors of their PSE. That is, parents' psychosocial ill-being during pregnancy and toddlerhood predicted lower levels of their PSE factors at child aged 1,5- and 3 years. Overall, 25 to 34 percent of PSE was explained by prior (22/18 months) psychosocial ill-being. Skin conductance and sinus arrhythmia as a complement of psychological assessments during parent-infant interactions Catarina Tojal¹, Raquel Costa¹, Iva Tendais² ¹Universidade Europeia \| Laureate International Universities, Portugal; ²Universidade de Trás-os-Montes e Alto Douro, Portugal; catarinatojalrebelo@gmail.com catarinatojalrebelo@gmail.com The impact of the quality of early interactions on infant developmental outcomes is well reported in the literature. It is well established that parental behaviors, such as sensitivity and responsivity, are crucial for the quality of the interaction. Understanding the physiological mechanisms beneath adequate/inadequate parental behaviors during these interactions may be crucial for psychological assessment and early intervention. We intend to examine the underlying physiological mechanisms associated with the quality of parental behaviors during dyadic interaction at 6 weeks and 6 months of age. A sample of 37 parents and infants participated in this study. Parental skin conductance (SC) and respiratory sinus arrhythmia (RSA) were monitored during a face-to-face (FTF) still-face (ST) procedure (FTF1 - SF – FTF2). Interactions were recorded according to the Global Rating Scales9 protocol. During FTF1, parental non-intrusive behavior is associated with lower RSA (r=-.433, p=.039). Parental non-remote behavior is associated with higher SC during the SF episode (r=.364, p=.048). Lower excited engagement is associated with higher SC (r=-.367, p=.046) during FTF2. The physiological reactions during early interaction may be an important complementary tool for the psychological assessment of parental behaviors.
11:45am - 1:15pm	PA7: Meaning in Life and Well-Being Session Chair: Shulamith Kreitler
KOL-G-221 (Ⅴ)
	The Children's Quality of Life Questionnaire Shulamith Kreitler^1,2, Michal M. Kreitler², Yasmin Alkalay² ¹Sheba Medical Center Tel Hashomer, Israel; ²Tel-Aviv University, Israel; krit@netvision.net.il krit@netvision.net.il The purpose of this study was to test a measurement model of a new measure of quality of life for children and adolescents. The sample (n=3574) included males (n=1295) and females (n=1387); four age groups: 6-8 years (n=223), 9-11 years (n=1008), 12-14 years (n=949), 15 -18 years (n=539); inhabitants of small towns (n=1213), medium cities (n=1008) and big cities (n=1122; and ethnic groups (Jewish=2734; Arab=840). The Children's Quality of Life Questionnaire, administered to all participants, included 53 items, each with three response alternatives, assessing 15 scales: Functioning at school, Social functioning, Cognitive functioning, Functioning in the family, Physical state, Worries, Basic needs, Body image, Sense of Mastery, Self image, Negative feelings, Stress, Positive feelings, Fun, and Motivation. The results showed the validity of the measurement model in the Jewish sample: Chi sqare=646.605, df=75, p<.0001, chi square/DF=8.621, CFI=.988, NFI=.986, TLI=.983, RAMSEA=.053. It was confirmed in the ethnic, gender, age and residential subsamples. The scales differed significantly in rank order and standardized regression weights in the subsamples. Accordingly, the measurement model validates the questionnaire structure, and demonstrates its stability across subsamples defined in demographic terms, and is still sensitive enough to present differences in the standardized regression weights of the subsamples. The Meaning in Life Questionnaire: Psychometric properties of the Italian version Luca Negri, Antonella Delle Fave University of Milano, Italy; luca.negri@unimi.it luca.negri@unimi.it Among the instruments developed to assess meaning in life as a major component of well-being, the Meaning in Life Questionnaire (MLQ) measures both perceived presence of and search for meaning. The psychometric properties of the Italian version of MLQ were investigated. The moderating effect of resilient personality on the relationship between search for meaning and subjective well-being was explored. A group of 464 Italian volunteers aged 20-60 (M=39.34; SD=10.86; 254 female and 210 male) were administered MLQ, Big Five Inventory, Satisfaction with Life Scale, Positive and Negative Affect Schedule, and Short Form Health Survey. The MLQ structure was investigated through exploratory and confirmatory factor analyses after sample split. Internal reliability and construct validity were evaluated. Hierarchical regression analyses were employed to test the moderating role of personality. Results replicated the original MLQ two factor structure (S-Bχ2(34)=65.24, p<.01; SRMR=.059; CFI=.969; TLI=.959; RMSEA=.063, 90% CI=.039/.086, p=ns), also highlighting the instrument reliability in the Italian context. Search for meaning was associated with higher levels of negative affect and lower life satisfaction among non-resilient participants, but not among resilient ones. These findings provide a novel contribution to the understanding of the relationship between meaning and well-being. Rasch analysis of the Meaning in Life Questionnaire: Findings from three countries Lusilda Schutte¹, Marie P. Wissing¹, Suria M. Ellis¹, Paul E. Jose², Dianne A. Vella-Brodrick³ ¹North-West University, South Africa; ²Victoria University of Wellington, New Zealand; ³University of Melbourne, Australia; lusilda.schutte@nwu.ac.za lusilda.schutte@nwu.ac.za Meaning in life has recently been positioned as a flagship indicator of well-being. The Meaning in Life Questionnaire operationalizes Steger et al.’s (2009) model of meaning in life, and involves two independent 5-item subscales: Presence and Search for Meaning. Respondents rate their degree of agreement with each item on a 7-point Likert scale, ranging from 1 (absolutely untrue) to 7 (absolutely true). The sample (n =635) contained participants from Australia, New Zealand, and South Africa who completed the English version of the scale. The data were analysed with Winsteps 3.81 and RUMM2030 using the Rasch rating scale model. For the Presence subscale, the only reversed-phrased item in the scale (“My life has no clear purpose”) was highlighted for possible removal, as it displayed misfit, deviation from unidimensionality, and differential item functioning among the countries. Addition of more difficult to endorse items was suggested. The items of the Search subscale was sufficiently unidimensional, locally independent and well-targeted for the sample. For both subscales, participants did not distinguish reliably among the seven response categories and a revised 4- or 5-point rating scale is suggested. Theoretical implications for understanding meaning in life and recommendations for future use of the scale are suggested. Short scales for the assessment of accomplishment and positive relationships: Initial validation and correlative and experimental evidence for their association with well-being Fabian Gander, René T. Proyer, Willibald Ruch University of Zurich, Switzerland; f.gander@psychologie.uzh.ch f.gander@psychologie.uzh.ch In his Authentic Happiness Theory, Seligman (2002) proposed three basic orientations that lead to happiness: The life of pleasure, the life of engagement, and the life of meaning. The Orientations to Happiness Questionnaire (Peterson, Park, & Seligman, 2005) has been developed as a subjective measure for these three orientations. In 2011, Seligman revised his theory and added two new components; positive relationships and accomplishment. In three studies, we examine the construction and initial validation of two short scales for the assessment of the endorsement of positive relationships and accomplishment. Study 1 describes the scale construction and provides evidence for the factorial, convergent, and divergent validity in three samples (n = 233, n = 336, and n = 125). Study 2 showed that the new scales have high test-retest reliabilities over a period of 1, 3, and 6 months (r = .68 - .78). Study 3 examines the malleability of positive relationships and accomplishment in an intervention study that uses the two new scales as dependent measures. Overall, the three studies show that the new scales have satisfactory psychometric properties and possible applications are discussed. Measuring well-being at school: Results of an initial investigation with daily diaries Lisa Wagner, Willibald Ruch University of Zurich, Switzerland; l.wagner@psychologie.uzh.ch l.wagner@psychologie.uzh.ch Recently, there has been an increased focus on measuring childrens's and adolescents’ well-being at school, both as an important outcome as well as a factor that may influence school achievement. One widely accepted conceptualization of well-being at school encompasses both a cognitive (domain-specific satisfaction) and an affective component (domain-specific positive and negative affect, cf. Long, Huebner, Wedell, & Hills, 2012). In the present study, we investigate the psychometric properties of a daily measurement of well-being at school. A sample of 183 secondary school students (mean age = 14.3 years; 100 females, 83 males) completed measures of habitual school satisfaction (respective scale of the MSLSS, Huebner, 1994) and habitual positive (PA) and negative affect (NA) at school (modified from PANAVA-KS, Schallberger, 2005) as well as measures of school satisfaction and PA/NA at school that were adapted for measuring daily experiences. On five consecutive days, PA and NA were assessed each morning and each afternoon, and school satisfaction each afternoon. We will present results on the factor structure of the daily measurements, their reliability, their variability within and between persons, as well as within and between classrooms, and on their relationships with habitual well-being at school. Implications for further research will be discussed.
1:15pm - 2:30pm	Lunch
Cafeteria	Lunch
2:30pm - 3:30pm	K2: Assessment of Personality Disorders in DSM-5 Session Chair: Daniel Leising Robert Krueger (University of Minnesota, USA)
KO2-F-180 (Ⅵ)
3:30pm - 4:30pm	PO1: Poster Session 1 (Including Coffee Break)
Lichthof/Atrium	PO1: Poster Session 1 (Including Coffee Break)
	A qualitative assessment of parents’ perceptions of sexuality education Carina Sobral Parente^1,2, Maria Carmo Cunha², Luisa Ramos Santos² ¹GAF - Community Organization, Portugal; ²Polytechnic Institute of Viana do Castelo, Portugal; psicologa@carinaparente.pt psicologa@carinaparente.pt The aim of this study was to understand pre-school parents’ perspectives on health promotion concerning the affective and sexual education. A focus group was used as a methodology in order to know more about parents’ needs in this area but also to acknowledge their acceptance of this kind of program to their children. Twenty four focus group sessions were conducted within this qualitative study. The sample was 24 parents of pre-school children (23 female, 1 male), and the mean age was 34 years. It used a discourse analysis method to collect data. This analysis presented the following 6 general categories: body’s health and care; family and citizenship; body as a source of pleasure; health and diseases; risk and protection factors; and values, rights, and responsibilities. We can conclude that speeches typify traditional cultural values connected with sexuality. The majority of the participants assumed to have insufficient knowledge and skills in affective and sexual education of their children. They suggested parental and teacher training programs in order to improve their knowledge and skills and to break through some educational and psychological barriers they encounter. The findings of this study will be used to develop a parents’ health education program for parents of pre-school children. Evaluating career adaptability in a sample of Brazilian university students Marucia Patta Bardagi¹, Marco Antonio Teixeira² ¹Federal University of Santa Catarina, Brazil; ²Federal University of Rio Grande do Sul, Brazil; marucia.bardagi@gmail.com marucia.bardagi@gmail.com Career adaptability refers to the ability of individuals to manage their careers considering the changes and uncertainty that characterize the world of work. Mark Savickas' career adaptability model establishes four dimensions: concern, control, curiosity, and confidence. Research on adaptability in different international population groups has shown that adaptability is useful and important to describe individual differences in career development and outcomes. This study investigated career adaptability in a sample of 467 Brazilian university students (mean age of 23.3 years; 325 women, and 142 men). There was a statistically significant gender difference in concern, with women having higher scores than men. Worker students obtained statistically significant higher scores than others in control and confidence. These results suggest that women tend to anticipate and plan their career paths more than men, which may reflect a more careful attitude in career managing or a possible perception of barriers that require coping strategies. The results also indicate that concrete experience in the workplace is associated with proactive attitudes and confidence. Those are aspects to be more specifically investigated in future research and to be considered in career interventions. Assessing the relationships of leisure activities’ characteristics with personality and subjective well-being in European adolescents Lisa Wagner¹, Daniela Conrad², Najdana Gajić³, Ondřej Kácha⁴, Katarina Martinović⁵, Aleksandrina Skvortsova⁶, Lotte van Doeselaar⁷, Darja Voitenko⁸ ¹University of Zurich, Switzerland; ²Ulm University, Germany; ³University of Belgrade, Serbia; ⁴Masaryk University, Czech Republic; ⁵University of Zagreb, Croatia; ⁶Leiden University, The Netherlands; ⁷Utrecht University, The Netherlands; ⁸Vilnius University, Lithuania; l.wagner@psychologie.uzh.ch l.wagner@psychologie.uzh.ch Research on associations between leisure activities and adolescents’ subjective well-being has yielded mixed results. Whereas previous studies mainly focused on specific activities and ignored their strong dependence on culture, the present study examined global characteristics of leisure activities (structure, effort, social contact, cf. Bradley & Inglis, 2012) and their relationship with personality and subjective well-being in a cross-cultural study. A sample of around 600 adolescents from six European countries (Czech Republic, Croatia, Estonia, Lithuania, Switzerland, and the Netherlands; mean age = 16.3 years) completed online questionnaires measuring the characteristics of their four preferred leisure activities, subjective well-being (life satisfaction, positive, and negative affect), and personality. Results showed that (1) social contact was positively associated with subjective well-being, (2) structure and effort were positively associated with positive affect, (3) relationships with personality traits were mostly as expected (e.g., social contact was positively related to extraversion and agreeableness), and (4) social contact predicted higher life satisfaction and structure predicted higher positive affect beyond the influences of demographic variables (age, gender, and parents’ education) and personality traits. We conclude that assessing social contact, structure, and effort of leisure activities, instead of focusing on one dimension or on specific activities, proved to be a valuable approach, especially in cross-cultural research. Longitudinal changes in infants’ difficult temperament: The effects of gender and maternal psychological characteristics Soyoung Kim, Sehee Hong Korea University, South Korea; soyoungkim.ems@gmail.com soyoungkim.ems@gmail.com The purposes of this study were to investigate longitudinal changes in difficult temperament during infancy and to test the effects of predictors of difficult temperament. To accomplish these purposes, a multi level growth model was applied to Panel Study on Korean Children (PSKC) data. To predict longitudinal changes, five predictors were considered: maternal self esteem, parenting stress, reactive parenting style, postpartum depression, and infant’s gender. The results showed that infants' difficult temperament increased significantly. In addition, individual differences were also significant. Postpartum depression had a positive effect on the initial status of newborn baby, and a negative effect on the rate of change. Self esteem had a negative effect on the initial status, and a positive effect on rate of change. The interaction of mother's reactive parenting style and self esteem had a significant effect on each time point of the infant's difficult temperament. These findings have implications on the importance of managing postpartum depression, and can emphasis on the reactive parenting style in the parent education program. Adaptation of the Frost’s Mutidimensional Perfectionism Scale (FMPS) in Spanish children and adolescents Antonio Godoy, Raquel Nogueira, Sara Reyes, Aurora Gavino University of Malaga, Spain; godoy@uma.es godoy@uma.es There is ample empirical evidence that perfectionism plays an important role in the emergence and maintenance of various types of psychological problems-including depression and anxiety-in children, adolescents, and adults. One of the instruments most used to evaluate perfectionism is the Frost’s Multidimensional Perfectionism Scale (FMPS). However this scale has been scarcely used in children and adolescents. The present study examines the structural invariance of FMPS scores among girls and boys, and among children and adolescents using structural equations modeling. FMPS scores reliability and validity are also explored. Participants were 1648 students (791 male and 857 female) of primary and secondary education 10-17 year old (mean = 13.24; Standard deviation = 1.87). FMPS scores showed measurement and structural equivalence in boys and girls, as well as in children and adolescents. All six scales presented good internal consistency and test-retest reliability. FMPS scores significantly correlated with the scores of theoretically associated constructs, such as obsessional dysfunctional beliefs, and, moderately, with measures of anxiety and depression symptoms. Normative data of the six subscales and the total score for Spanish children and adolescents are presented. The FMPS is a reliable and valid scale for the assessment of several dimensions of perfectionism in young people. The Brixton Spacial Anticipation Test: A normative study in a schizophrenia sample Micaela Moro¹, María Sigrid Gallego¹, Esther Lorente-Rovira² ¹Universitat Jaume I, Spain; ²Hospital Clinico de Valencia, Spain; moro@psb.uji.es moro@psb.uji.es The Brixton Spacial Anticipation Test measures the capacity of inducing rules and shifting them, considered as a component of the executive functions. Recently it has been used as a component of the neuropsychological assessment of disorders as stroke, brain injury, and eating disorders, among others. However, its use is rare in schizophrenia, a disorder in which executive functioning are considered as a core symptom of the disorder and having a high impact in individual functioning and social participation. In this study, normative data for the Brixton Spacial Anticipation Test are presented for 110 patients with schizophrenia (58 female and 52 male) and 60 controls (33 female and 27 male). Performance on the Brixton Test was significantly lower in the clinical simple and in male controls. Based on these results, the norms were presented separately by sex and sample. The normative data provided in this study could be useful both in descriptive studies of executive function in schizophrenia and for clinicians, as a mesure of the results of cognitive remediation interventions. A hierarchical factor analysis of the Five Facet Mindfulness Questionnaire Dandan Pang, Willibald Ruch University of Zurich, Switzerland; d.pang@psychologie.uzh.ch d.pang@psychologie.uzh.ch Despite several validated questionnaires, the factor structure of mindfulness remains unclear (i.e., solutions from one to five facets have been reported). Derived from the items of several scales, the Five Facet Mindfulness Questionnaire (FFMQ; Baer et al., 2006) offers a good way to study the components of mindfulness. The present study aims to investigate the structure of the FFMQ in a German-speaking sample. Altogether 166 psychology students (age: 19-58 years, M = 23.0, SD = 4.2; 30 male, 136 female) completed a paper and pencil version of the German language questionnaire (Zarbock et al., 2010). A hierarchical factor analysis (Goldberg, 2006) was employed to show how the factors unfold between one and six factors. The first unrotated principal component split up into two and then three factors. While the second (“describing”) and third factor (“non-judging”) stayed unchanged in following factor solutions, the first factor split into “observing” (stayed unchanged) and “non-reactively attending”. The latter factor then split up into “non-reacting” and “awareness” (both stayed unchanged). Thus, a five-factor-solution seemed most preferable. These results will be compared with one replication sample of adults with and without meditation experience, for which data collection is still ongoing. Reliability and further validity information will be provided. New perspectives in Mindfulness self-report assessment: Factor structure and Item Response Theory (IRT) analysis of the Five Facets of Mindfulness Questionnaire (FFMQ) Oscar Lecuona, Raquel Rodríguez-Carvajal, Eduardo García-Garzón, Alberto Angosto, Carlos García-Rubio Universidad Autónoma de Madrid, Spain; oscar.lecuona@hotmail.com oscar.lecuona@hotmail.com Self-report measures for Mindfulness have been recently flourished within the increase of mindfulness-related contributions. One of the most popular and recent ones is the Five Facets Mindfulness Questionnaire (FFMQ). Despite its popularity, literature contributions point out improvement areas for the FFMQ: (1) EFA and CFA based on item parceling, which has been shown to be a misleading technique, (2) uncertain model fit, (3) item redundancy, or (4) alternative factor structures. These aspects, along with the author’s interpretation, demand deeper psychometric research on this issue. In this study, a summary of factor structures proposed for the FFMQ is provided, along with a replication on a Spanish sample with a 7 factor solution (using EFA and ESEM). Then, a model comparison was developed to choose the best model, based on which one provided a better fit. When the best factor structure was selected, an item response theory (IRT) model is proposed, based on Graded Response Model (GRM). Recommendations for future research include the need for developing new theoretical and empirical models of mindfulness in order to improve the FFMQ’s factor structure prior to performing more advanced psychometric models and analysis. Emotional Intelligence – Validation of a new ability-based measure Dalit Lev-Arey Margalit¹, Yossi Hasson² ¹Tel Aviv-Yaffo Academic College, Israel; ²The Hebrew University of Jerusalem, Israel; Dalitlev@mta.ac.il Dalitlev@mta.ac.il During the last two decades, there has been a surge of interest in the construct of the concept of Emotional Intelligence (EI) and in ways of measuring it. The construct, defined as the ability to monitor one's own and other peoples' emotions and to use emotional information to guide thinking and behavior, is said to influence various important aspects on peoples' life (e.g., job performance, social relationships, and learning). The two most popular measures of the construct, the MSCEIT (Mayer, Salovey & Caruso, 2002) and EQ-i (Bar-On, 1997), have received criticism for using a self-report format (i.e., EQ-i), focusing on theoretical knowledge of emotions (i.e., MSCEIT, EQ-i), and ignoring the social context of the questions (i.e., MSCEIT, EQ-i). The current study presents a new ability-based measure of the EI, aimed at overcoming some of the aforementioned challenges of the MSCEIT and EQ-i. This measure, Emotional Social Intelligence Test (ESIT), presents people with short, 30-second open-ended video clips of social and emotional dilemmas, and ask for their cognitive, emotional, and behavioral response to the situations. Additionally, this paper presents results from lab and field studies, which show an incremental validity of the ESIT beyond cognitive ability, personality, and other EI measures. Investigating Big-Fish-Little-Pond Effect on students’ self-concept of learning mathematics with eighth-grade Taiwanese students Liang-Ting Tsai, Chih-Chien Yang National Taichung University of Education, Republic of China (Taiwan); liangting.tsai@gmail.com liangting.tsai@gmail.com The purposes of this study were examine the Big-Fish-Little-Pond Effect (BFLPE) of mathematics achievement on the scores in the Students Like Learning Mathematics (SLM), Students Value Mathematics (SVM), and Students Confident in Mathematics (SCM) scales of Taiwanese eighth-grade students. This will be done by using the Trends in International Mathematics and Science Study (TIMSS) 2011 data. The sample comprised 5,042 Taiwanese eighth-grade students from 150 schools that participated in the TIMSS 2011. Results from a 2-level hierarchical linear modeling analysis showed that the students’ individual mathematics achievement was significantly positively associated with SLM, SVM, and SCM scores. In contrast, school-average mathematics achievement was a significant negative predictor of these three variables (SLM,SVM, and SCM). Furthermore, the results indicated that students high in achievement experienced a greater BFLPE on SLM and SCM scores than did students with low achievement. Implications of the findings and suggestions for future study are discussed. Psychometric properties of a Russian version of the Alabama Parenting Questionnaire-Brief Form Svetlana V. Loginova¹, Helena R. Slobodskaya¹, Helena A. Kozlova¹, Natalia A. Fedorova² ¹Federal State Budgetary Scientific Institution “Scientific Research Institute of Physiology and Basic Medicine”, Russia; ²Novosibirsk State Medical University, Russia; loginovasvv@gmail.com loginovasvv@gmail.com The study proposes validation of the Russian version of the Alabama Parenting Questionnaire - Brief Form (APQ- BF) for parent report in a sample of 298 children aged between 6 and 17 years. Exploratory and confirmatory factor analysis confirmed the five factor model, the scales demonstrated satisfactory internal consistency. Correlations with lower and higher order personality traits measured by parent reported Inventory of Child Individual Differences-Short version (ICID-S) and mental health problems measured by the Strengths and Difficulties Questionnaire (SDQ) supported validity of the Russian version of the APQ- BF. Gender and age differences supported the discriminative validity of the APQ- BF scales. Russian version of the APQ-BF has been shown to be a reliable and valid measure and is recommended for studying parenting practices in families of school age Russian children. Development of Depressive Symptoms Intensity Survey (LIS-D): Preliminary analysis Vinícius Renato Thomé Ferreira IMED/Faculdade Meridional, Brazil; vthome2@gmail.com vthome2@gmail.com Depressive symptoms produce great personal losses, and an accurate evaluation of the symptoms is critical. There are few instruments constructed and adapted for the assessment of depressive symptoms in Brazil, and virtually none to consider the cognitive triad of depression. The objective was to build an instrument to assess depressive symptoms and the cognitive triad, called Depressive Symptoms Intensity Survey (in Portuguese, LIS-D). It was built and is being validated considering International Test Commission (ITC) quality criteria. LIS-D consists of 42 questions on a Likert 5-point scale, where the highest score is the more severe depressive symptoms. Initial studies with the participation of 449 respondents from clinical and non-clinical population aged between 18 and 84 years showed moderate correlation with the Beck Depression Inventory (BDI) adapted for the Brazilian population (Spearman ρ=0.67, p≤0.0001), and a Cronbach's alpha=0.957. In addition, there was identified seven factors (depressive feelings, depressive behavior, loss of energy, concentration, somatic symptoms, family relationship, and suicidal ideation) by factor analysis using Varimax rotation (KMO=0.956, p≤0.001). The next step is to assess LIS-D items using IRT, which will allow evaluating each question answered by the patient to obtain a more accurate design of the symptoms scores. Esteem towards adolescents: Psychometric characteristics of the Esteem Scale Angela Sorgente¹, Semira Tagliabue², Margherita Lanz¹ ¹Catholic University of Milan, Italy; ²Catholic University of Brescia, Italy; angela.sorgente@unicatt.it angela.sorgente@unicatt.it A modified version of the Self-Esteem Scale (Rosenberg, 1965) was applied to measure how much adults esteem adolescents. Participants were adolescents ( n= 1332; 786 females, 546 males; age M=17.00; SD =1.46) who were asked to fill in the Esteem Scale perceived both from parents and one other significant adult. Two equivalent samples were selected from the whole sample (explorative and validation samples). They did not differ regarding gender, age, and geographical region. On the explorative sample, EFA on polychoric matrix and WLS extraction method showed a two factors structure. The same structure was tested on both the parents (χ2(34)=129.10, p<.001; CFI = .97; RMSEA = .06 (.05-.08)), and the significant adult (χ2(34)=129.55, p<.001; CFI = .96; RMSEA = .06 (.05-.08)) versions, showing acceptable fit indexes. Multigroup analyses on non-independent groups conducted on both explorative and validation samples showed a partial measurement invariance of the two versions (Δ χ2(baseline models;partial invariance model)(3)= 1.52 - 4.56, p>.05): three items were found to be invariant while seven resulted non-invariant. The partial invariance was due to higher factor loadings in parents’ version than in significant adults’ version, although they were all higher than .50. In conclusion, the instrument is suitable to be used for measuring esteem in adult-adolescent relationship. Large-scale assessment of infants' competencies: Validity of the SUF-provided data of the German National Educational Panel Study (NEPS) Jan-David Freund Otto-Friedrich-University Bamberg, Germany; jan-david.freund@uni-bamberg.de jan-david.freund@uni-bamberg.de Even though there is agreed upon existence and importance of predictive indicators of the later development of competencies in infancy, there is a lack of representative panel studies which deal with causalities and the development of interindividual differences. Field inquiries ensure a strong external validity, but on the other hand merely controllable test conditions are a challenge for assessments designed for laboratories. Because of funding and sample representativity, the measurement of infant competencies in existing birth cohort studies is usually based on information provided by the parents. The birth cohort of the German National Educational Panel Study (NEPS) additionally gathers data from laptop-driven direct measurement in the home environment of 3.500 representatively sampled children. For example, in the first panel wave, when infants where 6 to 8 months old, the NEPS-assessment contained two habituation-based categorization-tasks and a semi-structured toy-play situation. The data is provided to the scientific community via scientific use file (SUF). On the one hand the poster will provide background information and an overview on the rich available data; on the other hand it will deal with the results of an evaluation study about the effects of early temperament on the assessments and their validity. Development of scales for assessing well-being of preschool and primary school children Irina V. Toporkova¹, Helena R. Slobodskaya¹, Evgeniya N. Petrenko¹, Svetlana V. Loginova¹, Olga S. Kornienko² ¹Federal State Budgetary Scientific Institution “Scientific Research Institute of Physiology and Basic Medicine”, Russia; ²Novosibirsk State University, Russia; irina.toporkova@gmail.com irina.toporkova@gmail.com This questionnaire for parents was based on recent research on the assessment of child well-being. It includes measures of objective child well-being: affluence, physical health, and mental health. These factors were measured by the Strengths and Difficulties Questionnaire, family relationships, and two scales to measure the structure of child activities and involvement of family members in the child's life. Subjective child well-being was evaluated by the Huebner Students’ Life Satisfaction Scale for 8-10 year-olds. The scale ‘Structure of Child Activities’ asked to indicate the number of hours during the day/week that the child spends at the following activites; playing or walking outdoors, communicating with children, playing games on computer, watching TV, studying or playing alone, playing with pets, video/audio communication, sport sections, hobby clubs, individual developing classes, cinema, and concerts. The scale describes the variety of the child's chores and leisure activities. The scale ‘Time together with family members’ asked about the following activities: playing, reading, watching TV together, doing housework, having mealtime, doing sports, going to shops and/or other places, walking outdoors, going to cinema, and going on vacation. This scale reflects involvement of parents and other family members in the child's life and supplements the scale about the structure of the child’s activities. Assessment of parents’ implicit theories of intelligence: Evidence from Classic Test Theory and Rasch Modelling approaches Michelle Hood, Peter A. Creed Griffith University, Australia; michelle.hood@griffith.edu.au michelle.hood@griffith.edu.au An individual’s implicit theories of intelligence refers to their “implicit conception about the nature of ability” (Dweck & Legget, 1988; p. 262), which can be incremental (belief that intelligence is malleable) or entity (belief that intelligence is fixed). Dweck et al. (1995) argued that this is a unitary construct with incremental and entity beliefs as polar opposites. Therefore, scales typically use 3-4 items and a 6-point Likert-type response format to assess one direction (e.g., incremental belief). However, confirmatory factor analytic (CFA) studies support a 2-factor model, with the factors weakly negatively correlated (r = -.18; Ilhan & Çetin, 2013). We used CFA (1- versus 2-factor model fit) and Rasch modelling (via RUMM 2030, Andrich et al., 2010; Rasch, 1960) to assess overall model and individual item fit of 2 measures of parents’ incremental and entity beliefs (Dweck, 2000; Lynott & Woolfolk’s, 1994, Nature of Intelligence scale), and provide evidence for construct validity by examining correlations with learning and performance goal orientations. Participants were 371 parents (335% mothers) of primary-school children (aged 5-12 years; 163 girls) from South-east Queensland, Australia. Results support the assessment of separate, but related, sub-constructs of entity and incremental beliefs. Work orientations in adolescence as moderators of the influence of financial satisfaction on overall satisfaction in middle adulthood Kai Schudel University of Zurich, Switzerland; kai.schudel@ife.uzh.ch kai.schudel@ife.uzh.ch The submitted paper examines the long-term interactional effect of intrinsic and extrinsic work orientations on the regression of overall satisfaction on financial satisfaction. The question whether income buys happiness, has driven happiness researchfor some time. Various approaches – such as domain importance weighting (Hsieh, 2014; Cummins, 1997; Ferrans & Powers, 1985), the Aspiration Index (Kasser & Ryan, 1993) and the values as moderators perspective (Oishi, Diener, Suh, & Luca, 1999) - claim that life domains can individually differ in their relevance. Furthermore, intrinsic and extrinsic work orientations value different aspects of the work domain: work itself as fulfilment and work as a means to financial reward (Malka & Chatman, 2003). Therefore, the paper examines whether the influence - thus, the relevance - of financial satisfaction on overall satisfaction differs depending on the degree of intrinsic and extrinsic work orientation. A longitudinal moderator analysis was conducted on the unprecedented data of a representative German sample of 1600 participants accompanied over 30 years within the LifE study (Fend et al., 2012). No moderating effects concerning extrinsic work orientation have been found for women or men. However, intrinsic work orientation in adolescence diminishes the influence of financial satisfaction on overall satisfaction 30 years later (ΔR2=.036***) for men. Changes in ego strength in patients with neurotic and personality disorders treated with a short-term comprehensive psychodynamic psychotherapy Katarzyna Cyranka, Krzysztof Rutkowski, Michał Mielimąka, Jerzy A. Sobański, Łukasz Müldner-Nieckowski, Edyta Dembińska, Katarzyna Klasa, Bogna Smiatek-Mazgaj, Paweł Rodziński Jagiellonian University Medical College, Poland; katarzyna.cyranka@interia.pl katarzyna.cyranka@interia.pl This study involves the analysis of changes in ego strength in the course of group psychotherapy in patients treated with neurotic and selected personality disorders (F40-F61, ICD-10). 82 patients (61 women and 21 men) participated in the study. They underwent intensive short-term group psychotherapy treatment in a Day Hospital for the neurotic and behavioural disorders. The assessment of the patients’ personality functioning was carried out at the onset and the end of the psychotherapy. The assessment was reported as a value on the ego strength scale by means of the MMPI-2 questionnaire.The majority of the patients demonstrated a considerable increase in the ego strength as a result of the psychotherapy treatment. Short-term intensive comprehensive group psychotherapy with elements of individual psychotherapy results in obtaining the desired changes in the personality functioning manifested through the increase in ego strength. Students’ well-being at school: A review of existing measures Charlotte Larson¹, Lisa Wagner², Willibald Ruch² ¹Rice University, USA; ²University of Zurich, Switzerland; crl6@rice.edu crl6@rice.edu In the past 30 years, the field of psychology has enjoyed a notable increase in interest in the area of well-being at school. Recent research supports the notion that students’ well-being at school not only has important implications for educational outcomes, but also for children’s and adolescents’ overall well-being and social functioning. Several scales have been constructed and validated that specifically measure the individual student’s well-being in the context of schooling. However, these measures have not been systematically compared yet in terms of content and psychometrics. For several constructs related to well-being at school (e.g., school engagement), it has been noted that different measures assess very different contents. As a consequence, this study aims at reviewing the content areas and theoretical models that are covered in the scales targeting students in primary and secondary education, as well as the scales’ psychometric properties. Implications for the use of the measures and future research will additionally be discussed. Psychometric properties of video games involving thinking Félix Cuneo University of Lausanne, Switzerland; felix.cuneo@unil.ch felix.cuneo@unil.ch The purpose of this poster is to present the theoretical points regarding our research project. According to the commonly used CHC model (Carroll, 1993), one can conclude that intelligence is stratified. The general intelligence factor is constructed by the correlation between second order broad abilities, which are also constructed by the correlation between narrow abilities. Regarding video games involving thinking, one could argue that there is correlation between different games. Therefore, a latent factor should exist that characterize the performance on different types of games. If there is a ‘model of games’ which is stratified as in the CHC, video games could be used to assess aptitudes. More importantly, since video games can be programmed, it is possible to implement some unique measures, such as the strategy to solve a problem. Applicability of Signal Detection Theory (SDT) models to multiple-choice exams with a polytomous response format Sören Much Martin-Luther-Universität Halle-Wittenberg, Germany; soeren.much@psych.uni-halle.de soeren.much@psych.uni-halle.de Multiple-choice (MC) tests are widely used for educational assessment. It is common practice to use the number of correct answers as a measure of students' ability although psychometric requirements are often not met or not testable. In an alternative approach, a 4-point confidence rating scale for multiple true-false questions was used in a real-life exam for 55 students. Their performance was scored with dichotomized sum scores (number correct), polytomous sum scores, and Signal Detection Theory (SDT) measures. These performance estimates were compared regarding their correlation with scores from constructed-response (CR) and single-choice (SC) questions from the same exam. Furthermore, three methods of dealing with omissions (ignore, score as incorrect, and assign a random response) were examined. For all performance estimates, scoring omissions as incorrect yielded highest, but medium-sized correlations with scores from CR and SC questions. Validated with CR scores, SDT estimates yielded slightly higher correlations than both sum scoring methods. Validated with SC scores, polytomous scoring yielded slightly higher correlations than both other methods. SDT models showed very good fit and are a reasonable alternative for scoring MC exams, providing valid results. Nevertheless, all presented methods make strong assumptions that need to be carefully considered by examiners. Evaluation of the therapeutic outcome through psychological assessment Katerina Kaliakatsou, Christina Terlidou, Hara Haritaki, Ioannis K. Tsegos Open Psychotherapy Centre, Greece; info@opc.gr info@opc.gr This study attempts to evaluate personality changes after patients' successful completion of long-term group analytic psychotherapy. Test-retest method was applied in 163 patients, who were assessed by the MMPI test and the Rorschach projective technique before the beginning of their therapy and six months after the completion of their therapy. The analysis of data indicates that group analytic treatment appears to have an impact on functional and structural dimensions of personality. More specifically, a significant decrease of clinical symptomatology, improved social adaptation, more controlled and adjusted emotional expressions, maturity of internalized representations, and ability to establish and maintain personal relationships are observed. Psychometric properties of the Expectancy Questionnaire (EQ) about alcohol effects in Spanish adults Laura Mezquita, Laura Camacho, Micaela Moro, María Sigrid Gallego, Manuel Ignacio Ibañez, Generos Ortet Universitat Jaume I, Spain; moro@psb.uji.es moro@psb.uji.es Expectancies about the effects of alcohol are specific cognitive variables related to alcohol use and misuse. Among the available questionnaires to assess alcohol expectancies, the EQ is one of the most widely used. However, the psychometric properties of the Spanish EQ have only been studied in adolescents. For this reason, the aim of the present research was to study the structure of the EQ in a sample of 738 participants aged 18–53 years (470 females, 268 males; mean age = 23.27, SD = 3.75). The results of the Confirmatory Factor Analysis showed the adequacy of grouping the 34 items into 8 scales, which were also grouped into two second-order factors: positive expectancies (social positive, fun, sex, and tension reduction) and negative expectancies (social negative, emotional negative, physical negative, and cognitive negative). Positive expectancies were related to higher alcohol consumption at the weekend, rather than during the week, and also with higher alcohol-related problems (AP). Negative expectancies were mainly positively related to AP. These results, and previous findings, suggest that negative expectancies are the consequence of both bad experiences with alcohol consumption and AP, rather than their cause. The present research supports the use of the EQ in Spanish adults Assessing relations between parental acceptance and children´s behavioral problems from a multi-informant perspective Miguel A. Carrasco, Eva Izquierdo-Sotorrío, F. Pablo Holgado-Tello Universidad Nacional de Educación a Distancia, Spain; macarrasco@psi.uned.es macarrasco@psi.uned.es This study examines relations between the perceived parental acceptance and children's behavioral problems (externalizing and internalizing) using parents and children as sources of information. The sample was composed of 270 participants (90 children, 90 fathers, and 90 mothers). Children (34 male and 56 female) aged from 9 to 16 years (M=11.91, SD= 2.03). Parents and children completed both the Parental Acceptance Rejection Questionnaire (PARQ; Ronher & Khaleque, 2008) and the Achenbach System of Empirically Based Assessment (ASEBA). Intercorrelations between children and parents were moderated, and these were high between fathers and mothers. Relations, between perceived parental acceptance and behavioral problems tend to be higher when children were the source of information. Accordingly, the parental acceptance perceived by children was the only significant predictor of the children's externalizing (not internalizing) problems. This was true when children or mothers (not fathers) were informed about the behavioral problems. Results are discussed in terms of predictive and incremental validity of the parental acceptance on children´s behavioral problems. The use of MMPI-2 to study late sequelae of trauma Krzysztof Rutkowski, Edyta Dembińska, Katarzyna Cyranka, Jolanta Walczewska, Michał Mielimąka Jagiellonian University Medical College, Poland; katarzyna.cyranka@interia.pl katarzyna.cyranka@interia.pl MMPI-2 is the most widely used personality questionnaire in the diagnosis of PTSD, however most of the research comes from the English-speaking population of veterans. The aim of this study was to investigate the MMPI-2 profiles of a group of politically persecuted Poles diagnosed with chronic untreated PTSD. The MMPI-2 personality questionnaire results of 327 Poles persecuted for political reasons during the years 1939-1968 were analyzed. A detailed analysis of the results of the validation scales F, L, and K, and clinical scales was conducted.The obtained profile, similarly to the ones obtained in studies on other populations with PTSD, is characterized by increased validation scale F and 8 out of 10 clinical scales (Hs,D,Hy,Pd,Pa,Pt,Sc, and Si). The highest means were obtained in scales D and Hs, and the overall analysis of the obtained MMPI-2 profile indicates the dominance of symptoms of anxiety, depressed mood, somatic complaints, dissatisfaction over the symptoms of social withdrawal, social isolation, suspicion, and hostility in the study group. The observed configuration of symptoms may be related to the chronic course of PTSD, age of respondents, as well as not receiving adequate treatment for many years after the traumatic factor activation. Experience sampling method to assess mindfulness in children Tanya Lecchi, Paola Di Blasio Catholic University of the Sacred Heart, Italy; tanya.lecchi@unicatt.it tanya.lecchi@unicatt.it Mindfulness has been conceptualized as “awareness that emerges through paying attention on purpose, in the present moment, and nonjudgmentally to the unfolding of experience moment by moment” (Kabat-Zinn, 1994, p. 4). Several tools for assessing mindfulness in adults have been created, for example the Mindful Attention Awareness Scale (MAAS; Brown & Ryan, 2003) and the Five Facet Mindfulness Questionnaire (FFMQ; Baer, Smith, Hopkins, Krietemeyer, & Toney, 2006), but there is only one scale for youths; the Child and Adolescent Mindfulness Measure (CAMM; Greco, Baer, & Smith, 2011). These instruments use self-report methods to assess some elements of mindfulness, such as the capacity to observe the present-moment and to avoid automatic reactions. Assessing mindfulness, the ability to be in the here and now, using retrospective self-report measures appears contradictory: people who are not aware of their mental states probably do not recognize their moments of distraction and so their answers are not reliable. In an ambulatory assessment study, 40 elementary school children (7-10 years old) completed items about their thoughts, moods, and feelings on tablets 5 times per day for 5 days, describing also the situations they where living. We identified different levels of mindfulness skills, correlated to parents’ observations. Latent classes of child behavior in preschool Christine DiStefano, Fred Greer University of South Carolina, USA; distefan@mailbox.sc.edu distefan@mailbox.sc.edu Preschool students (3-5 years of age) must make important behavioral and emotional transitions as they adjust to the school environment. For many children, this is a time where they are learning how to interact with peers and adults, pay attention, and monitor behavioral impulses. Recently, responses to intervention techniques have been incorporated with behavior and emotion to provide early assistance and intervention to students who may exhibit problems. Universal screening information is often collected early in the school year to examine functioning of all preschoolers to then assist those children with problems. However, instead of treating students individually, it would assist teachers if students with similar problems could be treated in the same manner. Using subscale information from approximately 2000 preschoolers assessed with the Behavioral and Emotional Screening System, this study will use latent profile analysis to uncover different groups of students, when student characteristics (e.g., gender, race/ethnicity) are used as covariates. Differences among outcomes (e.g. referrals to special education, teacher perceptions of kindergarten readiness) can be examined across classes. Information about latent classes help school psychologists and teachers provide interventions for groups of students. Psychometric properties of the 20-item IPIP scale in career counseling assessment practice Jana Turzáková, Tomáš Sollár, Katarína Baňasová Constantine the Philospher University, Slovak Republic; janaturzak@gmail.com janaturzak@gmail.com The aim of the study is to provide evidence of validity and reliability of the Slovak version of 20-item IPIP scale, a short inventory based on the International Personality Item Pool five-factor model. To address the objectives internal consistency and aspects of convergent and discriminant validity were examined. The presumptions about expected relationship or lack of relationship are theoretically grounded in the concepts of vocational identity, decision making, motivation and values. The 20-item IPIP scale and self-report instruments measuring vocational identity status, career indecisiveness, career decision making, career motivation, values and career anchors were administered in a sample of N = 124 high school students (63 males and 59 females) aged 16-18 (M(age) = 17.9, SD(age) = 0.68). The results suggest that all five scales have acceptable internal consistency despite only four items measuring each personality factor. Expected relationships of personality factors and facets of career-related variables were found, e.g. emotional stability was found to correlate positively with self-doubt and negatively with career commitment, self-determination, and intrinsic motivation. As expected, only few significant correlations were found for personality factors and work-related values and career anchors. Brief test of attention: Normative data for the Latin American adult population Diego Rivera¹, Melina Longoni², Patricia Saracho³, Marco Tulio Garza⁴, Javier Galarza⁵, Claudia Martínez⁶, María Fernanda Luna⁷, Paul Perrin⁸, Juan Carlos Arango-Lasprilla^1,9 ¹University of Deusto, Spain; ²Clinica de rehabilitacion Las Araucarias, Argentina; ³Universidad Cetys, Mexico; ⁴Universidad Autónoma de Nuevo León, Mexico; ⁵Universidad Autónoma de Baja California, Mexico; ⁶Universidad Nacional Autónoma de Honduras, Honduras; ⁷Universidad Jose Matias Delgado, Salvador; ⁸Virginia Commonwealth University, USA; ⁹IKERBASQUE. Basque Foundation for Science, Spain; dfriverac@deusto.es dfriverac@deusto.es The objective of this study was to generate the first and most comprehensive demographic-adjusted norms for the Brief Test of Attention (BTA) in Latin-American adult population. The sample consisted of 4856 healthy adult participants from 10 countries in Latin-American representing the demographic distribution of the population. Inclusion criteria were to have a Mini-Mental State Examination (MMSE) score of≥23, have a Patient Health Questionnaire–9 (depression) score of≤4, and have a Barthel Index of≥90. 2967 were women and 1889 were male, the average age was 53.8±19.9 years (range 18-90), and the average length of education was 10.0±5.0 years. Participants completed the BTA. Pearson correlation coefficient and coefficient of determination showed significant effect between BTA scores and MMSE, age, and education (r-squared -0.324 to 0.431; p<.001). Multinomial logistic regression yielded main effects for age and education groups on the BTA scores (r-square Cox & Snell>.173, p<.001). Consequently, correction tables were created in order to adjust the raw scores based on age and education. The percentiles tables were calculated based on the distribution of adjusted scores. This study is the first to create BTA norms in the Latin-American population and include appropriate adjustments for age and education. These data represent a critical advancement in the assessment in Latin-American adults. Measuring decision making behavior with KETO - A short decision making test online Daniel Hausmann, Julia Stoll University of Zurich, Switzerland; d.hausmann@psychologie.uzh.ch d.hausmann@psychologie.uzh.ch At present there is no questionnaire that is capable of systematically measuring the variety of decision making strategies in a decision making situation under uncertainty. We developed the KETO (Kurzer Entscheidungs-Test Online) which includes four important aspects of decision making behavior: risk behavior, decision making strategies, demand for certainty, and consistency of the behaviour. KETO was implemented as a short, attractive online game with 22 main trials and lasts about 10 minutes (see http://keto.dah-media.ch/?xt=ECPA13). The test subjects are introduced to a scenario (commercial shipping) and have the opportunity to select one of four ships (options) that will bring their cargo with more or less certainty (i.e. with or without a profit) into the port of destination. More than 1,000 individuals participated in KETO. The most frequently behavior (60%) was a satisficing strategy using a specific and consistent individually desired level of confidence (DLC) between 33% and 97%. Retests and validation revealed satisfactory results. KETO was first implemented as a multiligual basic version, but can be applied for measuring adaptive and intelligent behaviour too while comparing individual behavior in the basic version with several modified scenarios. Consequently, KETO is ready to be implemented as an attractive and diversified tool in assessment centers. Assessment of career choice readiness: Application of different multitrait-multimethod approaches Stefan Hoeft, Matthias Ruebner, Michael Boesinger-Schmidt, Stephanie Sauer University of Applied Labour Studies, Germany; stefan.hoeft@hdba.de stefan.hoeft@hdba.de A self-report career choice readiness (CCR) inventory had been constructed as a tool for the career counseling for job starters (under the age of 25). It is called "BET-U25," and has 50 items (short version: 16 items). The underlying concept differentiates five dimensions that can often be identified in career choice readiness models: problem awareness, vocational self-evaluation, level of vocational information, decision behavior, and realization activity. Utilizing a self-other design, the analyses concentrate on the convergence of the CCR self-reports collected prior to the counseling session (in total n=643) with two other sources: career counselor assessment of candidate’s CCR (n=266) and ratings of independent observer (n=83). Basic correlations of ratings belonging to corresponding dimensions range between r=.3 and r=.6. In general, the “other”-ratings show higher convergences with each other. Relevant moderators are: the level of education of the counselee, the primary topics discussed in the career counseling session, and the specific CCR dimensions. The poster presents results of different multitrait-multimethod (MTMM) approaches to analyze the existing empirical interrelations. The focus is on structural equation models using a confirmatory factor analysis framework. Beside goodness-of-fit and MTMM criteria according to Campbell & Fiske (1959) a special emphasis lies on the content-related implications of different model variants. Psychometric and operative properties of the Portuguese short version of the Postpartum Depression Screening Scale at the sixth month postpartum Ana Telma Pereira¹, Vera Freitas¹, Sandra Bos¹, Mariana Marques¹, Maria João Soares¹, Berta Maia², José Valente¹, António Macedo² ¹Universitity of Coimbra, Portugal; ²The Catholic University of Portugal, Portugal; apereira@gmail.com apereira@gmail.com The Portuguese short version of the Postpartum Depression Screening Scale (PDSS-21; Pereira et al. 2013), proved to be superior to the 35-items PDSS, when validated in pregnancy and in the third month postpartum. As depression is the most prevalent affective disorder that can occur from conception to the first year postpartum, the aim of this study was to analyse the PDSS-21 psychometric and operative properties at the sixth month postpartum. Participants were 327 women (mean age=30.31±6.154 years) in the sixth month post-partum (M=6.154±.503). All women completed PDSS and were interviewed using the Mood disorders Section/Diagnostic Interview for Genetic Studies. ROC analysis was applied (using MedCalc) to determine cut-off points and associated conditional probabilities adjusted to the prevalence. PDSS-21 Cronbach alphas was .93. PDSS-21 total score highly and significantly correlated with the BDI-II total score (r=.68). For major depression(DSM-IV), PDSS-21 cut-off point of 36 (prevalence 4.3%), resulted in sensitivity 85.71%, specificity 87.50%, positive predictive value(PPV) 23.55% and negative predictive value(NPV) 99.27%; for depressive disorder(ICD-10) the cut-off of 33 (prevalence 4.6%) determined sensitivity 86.67%, specificity 82.96%, PPV 19.69% and NPV 99.23%. At the sixth month postpartum PDSS-21 revealed adequate reliability, concurrent validity and screening ability. Perseverative negative thinking mediates the relationship between perfectionism cognitions and OC symptoms Ana Telma Pereira¹, Tiago Ferreira¹, Antonio Pissara², Vasco Nogueira¹, Joana Andrade³, David Mota³, António Macedo¹ ¹Universitity of Coimbra, Portugal; ²Mental Health Department, Portugal; ³Coimbra Hospital and University Centre, Portugal; apereira@gmail.com apereira@gmail.com The objective of the study is to investigate if Perseverative Negative Thinking (PNT), measured with a new transdiagnostic measure, mediates the relationship between perfectionism cognitions and obsessive compulsive (OC) symptoms. 464 Students (mean age 22.5±4.5) participated in an online survey including the Portuguese validated versions of Maudsley OC Inventory (MOCI; Nogueira et al. 2012; to assess Doubting/Rumination, Checking and Cleaning), Perseverative Thinking Questionnaire (PTQ-15; Chaves et al. 2013; a content independent questionnaire to assesses the core characteristics of PNT - Repetitive thought and Cognitive interference and unproductiveness), and the Multidimensional Perfectionism Cognitions Inventory (MPCI; Macedo et al., 2013; to assesses the frequency of cognitions associated with dispositional perfectionism with three subscales - Concern over mistakes, Personal standards, and the Pursuit of perfection). Negative affect (NA) was assessed with a short version of the Profile of Mood States (Amaral et al. 2013). Both PTQ dimensions were partial mediators of the relationships between: Pursuit of Perfection and MOCI_Total; the three perfectionism subscales and Doubting/Rumination; Pursuit of Perfection, and Checking. PTQ-15_Total and IMCP_Total were mediators of the association of MOCI_Total with NA. PNT mediates the relationship between perfectionist cognitions and OC symptoms. PNT and perfectionist cognitions increases the strength of the relationship between OC symptoms and NA. Personality traits assessment in Alzheimer's dementia Joana Henriques-Calado, Maria Eugénia Duarte-Silva, Ana Sousa Ferreira University of Lisbon, Portugal; joana.calado@netcabo.pt joana.calado@netcabo.pt This research study is essentially geared towards the assessment of current and pre-morbid personality characteristics in Alzheimer's Dementia. The Five-Factor Personality Model is taken as a reference. The study was conducted with four groups, which were assessed using the NEO-Five Factor Inventory (NEO-FFI), mainly in the form of individual interview sessions. The group assessing current personality characteristics for induviduals with Alzheimer's Dementia consists of 44 female participants (M = 81.36 years) and well as a control group, that consists of 80 female participants from the population at large (M = 75.84 years). Additionally, the group of informants assessing the pre-morbid personality characteristics for induviduals suffering from Alzheimer's Dementia consists of 40 informants with a control group of 42 informants. The results are in line with the literature review and provide new research data. Some findings are analyzed as accentuations of previously existing characteristics, as high Neuroticism, and reflecting a possible continuum from pre-morbid to current personality, as low Openness to Experience and low Agreeableness. In order to increase sensitivity towards an early diagnosis with practical implications for current clinical diagnosis, future empirically-based research should take a set of personality variables into account in the evaluation of Alzheimer's Dementia diagnosis. Assessment and diagnosis of general anxiety disorder in primary care using the GAD-7. A validation with the CIDI in the PsicAP Project (psychology in primary care) Roger Muñoz Navarro¹, Antonio Cano Vindel², Paloma Ruíz Rodriguez², Cristina Mae Wood², Benigna Díaz-Ovejero², Esperanza Dongil¹, Itziar Iruarrizaga², Mar García Moreno¹, Fernando Chacón³, Francisco Santolaya³, Antonio Capafons Bonet¹, Patricia Tomás Tomás¹, PsicAP Research Group³ ¹University of Valencia, Spain; ²University of Madrid, Spain; ³Spanish Council of Psychologists, Spain; roger.munoz@uv.es roger.munoz@uv.es Anxiety disorders, along with depression and somatization, are the most prevalent Emotional Disorders (ED) in Spanish Primary Care (PC) settings. According to international clinical guidelines, the GAD-7 is a highly recommended self-report screening test for detecting the presence of generalized anxiety disorder (GAD) in PC. The CIDI is the structured diagnostic interview of the World Health Organization. The aim of this study was to determine the psychometric properties of the GAD-7 as an effective instrument in detecting GAD in PC centers. Trained psychologists applied the CIDI to 178 patients, who had also completed the GAD-7. Statistical analyses were performed to find out the psychometric properties of the PHQ, including sensitivity, specificity, positive and negative predictive values, and positive and negative likelihood ratios. The GAD-7 showed a high sensitivity (.87) and specificity (.78) with a cut-off of 10 in detecting GAD, taking the CIDI as standard criteria. Positive and negative predictive values of .93 and .64, and positive and negative likelihood ratios of 3.96 and 0.17 were also obtained. The GAD-7 is a very reliable tool in screening for GAD in Spanish PC centers that can improve diagnosis and referral of patients before they receive the appropriate psychological treatment. Assessment and diagnosis of the Panic Disorder in Spanish primary care centers. A study of the test characteristics of the PHQ-‐PD with the SCID-‐I in the PsicAP Project (psychology in primary care). Roger Muñoz Navarro¹, Antonio Cano Vindel², Paloma Ruíz Rodriguez², Cristina Mae Wood², Benigna Díaz-Ovejero², Esperanza Dongil¹, Itziar Iruarrizaga², Mar García Moreno¹, Fernando Chacón³, Francisco Santolaya³, Antonio Capafons Bonet¹, María Dolores Gómez Castillo³, Patricia Tomás Tomás¹, PsicAP Research Group³ ¹University of Valencia, Spain; ²University of Madrid, Spain; ³Spanish Council of Psychologists, Spain; roger.munoz@uv.es roger.munoz@uv.es Panic Disorder (PD) with or without agoraphobia is one of the most prevalent anxiety disorders and it is commonly misdiagnosed. Simple screening tests are required to reduce these errors in Spanish Primary Care (PC) settings. The section of the Patient Health Questionnaire (PHQ) for detecting Panic Disorder (PHQ-PD) is a self-screening test. The SCID-I is a structured diagnostic interview based on DSM-IV Axis I diagnoses. In this work, we studied the capacity of the PHQ-PD to detect effectively Panic Disorder. 178 Patients filled in the PHQ-PD and trained psychologists applied the SCID-I to them. Statistical analyses were performed to study the psychometric properties of the PHQ-PD. Using the original algorithm of the PHQ-PD, the sensitivity and specificity were .50 and .89, respectively. Indeed, after changing the algorithm using DSM-IV criteria we reached a sensitivity of .77 with a decrease in the specificity (.72). In addition, the screening question was analyzed finding the best sensitivity levels at .83 but with a specificity of .66. Using the modified algorithm, the PHQ-PD presented an optimal level of sensitivity and specificity in identifying PD; the screening question may be used by general practitioners to detect probable PD in a fast and simple way. Do playfulness questionnaires measure playful behavior? Findings from a diary study Noemi Marti, René T. Proyer University of Zurich, Switzerland; noemi.marti@uzh.ch noemi.marti@uzh.ch Adult playfulness is an understudied topic. Accordingly, the assessment of playfulness in adults is also somewhat under-developed. While many authors report convergent validity with other playfulness scales (and sometimes data on divergent validity), the basic question of whether playfulness scales assess playful behavior in daily life is rather open. In an effort to narrow this gap, we asked participants to report on their playful activities in 14 consecutive days. Each day, they completed a rating for 22 activities. These were derived from prevouis studies in which students reported on their playful activities, from in-depth interviews, a literature review, and expert judgments. The sample size will comprise > 300 psychology students; data collection is still ongoing. Additionally, participants completed four frequently used playfulness measures; i.e., (a) Playfulness Scale for Young Adults (Barnett, 2007); (b) the Need for play-scale of the Personality Research form (Jackson, 1997); (c) the Short Measure of Adult Playfulness (Proyer, 2012); and (d) the OLIW (Proyer, 2014). The data will allow testing to what degree the scores in the playfulness measures reflect daily playful behavior. Evaluating the rationale of a novel irony performance test Richard Bruntsch, Willibald Ruch Universität Zürich, Switzerland; r.bruntsch@psychologie.uzh.ch r.bruntsch@psychologie.uzh.ch The newly developed test at hand employs 30 scenarios with ironic target utterances and 10 distractors including no verbal irony. Irony detection performance is assessed via the judgment of the situations along indicative statements. In order to gain consensus scores for item polarity and judgments, the test was administered to 72 subjects. Four groups received different instructions. Participants in two groups were devised to consider all target utterances either as ironic (TIN1) or as literal (TIN2) regardless of their own appraisal. A third group (TIN3) was instructed to watch out for verbal irony and fill in the judgment according to their own interpretation. The fourth group (TIN4) filled in the judgments without being pointed to the occurrence of verbal irony. Differences in means between TIN1 and TIN2 indicated that the judgment statements are suitable to reflect irony detection. Comparing TIN1 and TIN2 on the one hand with TIN3 and TIN4 on the other hand illustrated that utterances designed to be ironic were de facto considered as ironic and literal utterances were consented to be literal. Outlined results suggest that the rationale of the test and the designed polarity are valid. Further steps of test construction are discussed. Rorschach changes after long-term group psychotherapy. A review Theano-Erifyli Moschona¹, Katerina Kaliakatsou², Dimitra Vekiari², Ioannis K. Tsegos³ ¹Piraeus University of Applied Sciences, Greece; ²Institute of Diagnostic Psychology-OPC, Greece; ³Open Psychotherapy Centre, Greece; tmoschona@gmail.com tmoschona@gmail.com This study presents a review of the empirical studies on the personality changes and outcomes following long-term group psychodynamic psychotherapy, assessed with Rorschach. Systematic literature searches of the studies, using Rorschach, were undertaken through Pubmed, PsychInfo, EmBase. The empirical studies comprised randomized controlled trials, observational studies, and meta-analyses, from 1990 to 2014. The studies of the review have been classified according to the design, the points of measurement, the diagnosis, the duration and therapy, the changes in Rorschach variables and indices, and the conclusions. All studies reported improvement in functional and structural dimensions of personality. To sense the humour: how to measure sensitivity to humour Jakub Grzegorz Górecki John Paul II Catholic University of Lublin, Poland; j_gorecki@o2.pl j_gorecki@o2.pl The experimental version of the cognitive sensitivity to humour was prepered. Theoretical inspiration for it was drawn from Koestler's assumption that humour and creativity arise from two formly discrepant frames of reference. Similarly to the traditional cognitive humour theories of Suls (1972), Shultz (1973), motivational - cognitive like Apter's (1982), or linguistic aproaches (Raskin & Attardo, 1991), there is the idea of joining merging of different scripts or meanings. Examining the cognitive sensitivity to humour should consider the ease of seeing subtle relations, or joining different or opposingscripts or ideas. Similar idea can be found in the field of creativity research: it is called sensitivity to problems. Pioneers of the psychometric research of creativity such as Guilford or Torrance appreciated this variable and they operationalized it in their tests. Getzels i Csiszkentmihalyi (1971) brought light to the fact that discovery oriented behavior was a good predictor of the artistic success in later life in art school students. Researchers nowadays assume that finding and constructing a problem precedes it's solving (Mumford, Mobley, Uhlman, Reiter-Palmon, & Doares, 1991).In my theoretical presentation I will show how those assumptions are possible to use for the preparation of the method which will make easier understanding individual differences in the sense of humour. Adolescents’ aggressiveness and attachment: Differences across populations Bárbara Torres-Gomez, Miriam Gallarin, Iñigo Ochoa de Alda, Itziar Alonso-Arbiol University of the Basque Country, Spain; itziar.alonso@ehu.eus itziar.alonso@ehu.eus Attachment insecurity to parents has been frequently pointed out as an important risk factor of aggressiveness in adolescence. Although attachment has been often called into question in the case of adopted children, it has not been confirmed that adopted adolescents show lower levels of attachment security or of aggressiveness than other populations. We aimed at examining adopted adolescents’ levels of attachment and aggressiveness as compared to those of biological adolescents and of adolescents with behavioural problems. A total of 204 adolescents (38.2 % adopted, 27.9 % biological, and 33.8 % with behavioral problems) took part in the study. Attachment security was assessed with the Inventory of Parent and Peer Attachment-Spanish (IPPA-S; Gallarin & Alonso-Arbiol, 2013), while aggressiveness was assessed with the Multifacet Aggressiveness Scale (MAS; Gallarin, Alonso-Arbiol, & Ittel, 2015). ANOVAs across the three groups revealed that only adolescents with behavioral problems statistically differed from the other two groups, showing higher scores in all dimensions of aggressiveness, and lower levels of attachment security. In a new analysis computed only with boys, similar results emerged: biological and adopted adolescents showed no differences, whereas the clinical group showed lower levels of attachment and higher levels of aggressiveness. Adopted adolescents do not differ substantively from the general population in aggressiveness and attachment security. Rasch rating scale modeling of Internet Game Addiction Scale (IGAS) for adolescents Unkyung No, Miri Yoon, Sehee Hong Korea University, South Korea; miriyoon816@gmail.com miriyoon816@gmail.com The purpose of this study was to re-evaluate fit and item difficulty of the Internet Game Addiction Scale (IGAS) developed through factor analysis, and to verify the adequacy of the response categories. For this purpose, we used Rasch rating scale modeling on Korean version of IGAS with 6,499 Korean adolescents (3,493 male and 3,006 female). IGAS has 20 items and each item was recorded on a 4-point scale (1=strongly disagree, 2= disagree, 3=agree, 4=strongly agree). This scale contains three factors which are; game addiction, lack of immunity and control, and experience of withdrawal and emotion. Unidimensionality is founded in each sub-dimension. So, Rasch modeling is applied to each sub-dimension. As a result, all items showed adequate mean square infit and outfit statistics within a 0.7 and 1.3 range. For comparison, the distribution of the trait ability estimates and item difficulty estimates are on a common metric. Estimates of item difficulty were placed at higher part, but trait ability estimates were at lower part. The response categories of this scale showed suitable in category probability curve. Therefore, it is necessary to modify for easy and diverse items in terms of item difficulty. Measuring the propensity to perceive good things and testing it as a mediator in interventions: The case of the three “good things”-intervention Sara Wellenzohn, René Proyer, Fabian Gander, Willibald Ruch University of Zurich, Switzerland; s.wellenzohn@psychologie.uzh.ch s.wellenzohn@psychologie.uzh.ch We argue that individual differences exist in the propensity to perceive positive aspects in one’s daily life (positivity propensity) and that this variable is a mediator in the effectiveness of positive psychology interventions such as the three “good things” intervention. The main goal of the present study is developing and evaluate a measurement paradigm for the propensity to perceive good things in one’s daily life as an objective personality test sensu R. B. Cattell. For this purpose, we collected 30 everyday situations/experiences that are ambiguous with regard to their valence (determined in a pre-study). Participants are asked to indicate whether a situation would be positive for them. It is expected that the number of situations that are perceived as positive are indicative of an individual’s propensity to perceive positive aspects. For evaluating the newly developed instrument, data from cross-sectional (i.e., convergence with subjective measures of the propensity to perceive positive aspects in daily life, and other measures of well-being), and experimental studies (i.e., whether the positive propensity increases following a well-established intervention for increasing well-being) are collected. Data collection is still ongoing and results will be presented at the conference.
4:30pm - 6:00pm	IS2: On the Effect of Item Positions in Tests Session Chair: Karl Schweizer Session Chair: Siegbert Reiß
KO2-F-180 (Ⅵ)
	On the effect of item positions in tests Chair(s): Karl Schweizer (Goethe University Frankfurt, Germany), Siegbert Reiß (Goethe University Frankfurt, Germany) The item-position effect is usually observable if test takers complete a homogeneous set of items that constitute a psychological scale because successively completing a number of items that are demanding to the same ability or trait is modifying performance. The repeated call of the same cognitive processes can involve automation, facilitation, clustering, maintenance of information and learning. The consequence is an increasing degree of dependency among the responses to the successively presented items. It means an increasing degree of consistency in responding from the first to last items. Although this effect has been known for quite a time, the major models of measurement do not take it into consideration. The presentations will provide further evidence of the item-position effect regarding different psychological scales and inform about new developments in improving the representation and investigation of it. There will be reports of the item-position effect in Advanced Progressive Matrices, Cattell’s Culture Fair Test and Viennese Matrices Test. The new developments will encompass the IRT and CFA approaches. These new developments aim to enable more appropriate representations of the item-position effect and better ways of separating what is due to the effect and what is a pure representation of the construct. Presentations of the Symposium The impact of the position effect on the factorial structure of the Culture Fair Test (CFT) Stefan J. Troche¹, Felicitas L. Wagner², Karl Schweizer³, Thomas H. Rammsayer²; Stefan.Troche@uni-wh.de Stefan.Troche@uni-wh.de ¹Private Universität Witten/Herdecke, Germany, ²University of Bern, Switzerland, ³Goethe-University Frankfurt, Germany The Culture Fair Test (CFT) is a psychometric test of fluid intelligence consisting of four subtests; Series, Classification, Matrices, and Topographies. The four subtests are only moderately intercorrelated, doubting the notion that they assess the same construct (i.e., fluid intelligence). As an explanation of these low correlations, we investigated the position effect. This effect is assumed to reflect implicit learning during testing. By applying fixed-links modeling to analyze the CFT data of 206 participants, we identified position effects as latent variables in the subtests; Classification, Matrices, and Topographies. These position effects were disentangled from a second set of latent variables representing fluid intelligence inherent in the four subtests. After this separation of position effect and basic fluid intelligence, the latent variables representing basic fluid intelligence in the subtests Series, Matrices, and Topographies could be combined to one common latent variable which was highly correlated with fluid intelligence derived from the subtest Classification (r=.72). Correlations between the three latent variables representing the position effects in the Classification, Matrices, and Topographies subtests ranged from r=.38 to r=.59. The results indicate that all four CFT subtests measure the same construct (i.e., fluid intelligence) but that the position effect confounds the factorial structure. The position effect in a Rasch-homogenous test: A fixed-links modeling approach Philipp Thomas¹, Thomas H. Rammsayer¹, Karl Schweizer², Stefan J. Troche³; philipp.thomas@psy.unibe.ch philipp.thomas@psy.unibe.ch ¹University of Bern, Switzerland, ²Goethe University Frankfurt, Germany, ³Private Universität Witten/Herdecke, Germany The position effect describes the influence of just-completed items in a psychological scale on subsequent items. This effect has been repeatedly reported for psychometric reasoning scales and is assumed to reflect implicit learning during testing. One way to identify the position effect is fixed-links modeling. With this approach, two latent variables are derived from the test items. Factor loadings of one latent variable are fixed to 1 for all items to represent ability-related variance. Factor loadings on the second latent variable increase from the first to the last item describing the position effect. Previous studies using fixed-links modeling on the position effect investigated reasoning scales constructed in accordance with classical test theory (e.g., Raven’s Progressive Matrices) but, to the best of our knowledge, no Rasch-scaled tests. These tests, however, meet stronger requirements on item homogeneity. In the present study, therefore, we will analyze data from 239 participants who have completed the Rasch-scaled Viennese Matrices Test (VMT). Applying a fixed-links modeling approach, we will test whether a position effect can be depicted as a latent variable and separated from a latent variable representing basic reasoning ability. The results have implications for the assumption of homogeneity in Rasch-homogeneous tests. Predictors of an individual decrease in test performance during the PISA assessments Johannes Hartig¹, Janine Buchholz¹, Dries Debeer², Rianne Janssen²; hartig@dipf.de hartig@dipf.de ¹DIPF, Germany, ²KU Leuven, Belgium Item position effects have been shown repeatedly in large-scale assessments of student achievement. In addition to a fixed effect of items becoming more difficult during the test, there are individual differences related to this effect, meaning that students differ in the extent to which their performance declines during the test. These interindividual differences have been labelled as “persistence” in previous studies. The present study aims at gaining a better understanding of the nature of these differences by relating them to student characteristics. The analyses make use of the the PISA 2006 and 2009 assessments on science and reading, respectively, using data from several European countries. Gender, the language spoken at home, the socio-economic status, the motivational scales “effort thermometer” (2006 assessment), and the “joy of reading” (2009 assessment) were used as predictors for persistence. Position effects and persistence are modelled by a logistic multilevel regression model which is equivalent to an extension of the Rasch model. Effects of gender, language, and reported test effort are inconsistent across countries, e.g. girls have a higher persistence only in some countries. The effect of the reported joy of reading is small but consistent across all countries, indicating that at least part of the individual differences is caused by individual differences in subject-specific motivation. Modeling response omissions in tests using a tree-based IRT approach Dries Debeer, Rianne Janssen; Rianne.Janssen@ppw.kuleuven.be Rianne.Janssen@ppw.kuleuven.be KU Leuven, Belgium Reported item position effects in large-scale assessments often pertain to an increased item difficulty towards the end of the test and to respondents differing in their level of persistence completing the test. Both phenomena may be partly due to the increased occurrence of missing responses towards the end of the test and individual differences therein. In fact, two types of missing responses are possible, respondents may omit certain items well before reaching their last answered item, leading to “skipped items” and respondents may not complete the entire test and drop out before the end of the assessment, leading to “not-reached items”. Both types of missing responses may be related to the proficiency of the respondent, and therefore, cause non-ignorable missingness. Several studies have proposed ways to deal with these missing responses. In the present paper, an IRTree-based approach will be presented in which both types of missing responses are modeled together with the proficiency process. The IRTree models can be applied to both power and speed tests and are modeled fairly easily. Apart from results of several simulation studies, the analyses of a speed test on mental arithmetic from a Flemish national assessment will be discussed. On the search for the best possible representation of the item-position effect: A simulation study based on APM Florian Zeller, Siegbert Reiss, Karl Schweizer; Florian.zeller@outlook.com Florian.zeller@outlook.com Goethe University Frankfurt, Germany The item-position effect describes the impact of prior completed items on the following items. In previous studies the item-position effect was represented by constraints reflecting functions, for example a linear function. This kind of representation was inflexible regarding the specificities of the items, and therefore, there was the question whether this is the best possible way of representing the effect. Accordingly, our aim was to optimize the representation of the item-position effect in considering the items of Raven’s Advanced Progressive Matrices (APM). We disassembled the 36 APM items into two, three, four, and six same–sized subsets of neighboring items for separate investigations. Analyses were conducted by means of data that were simulated according to the covariance matrix of the APM items based on the data of 530 participants. Similar to former studies we used fixed-links models for testing different representations of the item-position effect. Besides the standard model with only one latent variable we analyzed linear, quadratic and logarithmic trends of the item-position effect. The results revealed an increase of true variance from the first to last items, just as expected. But the course of increase varied in slope.
4:30pm - 6:00pm	S2: Online Assessment and Internet-Based Research Session Chair: Ulf-Dietrich Reips Session Chair: Stefan Stieger
KOL-G-217 (Ⅳ)
	Online assessment and internet-based research Chair(s): Ulf-Dietrich Reips (University of Konstanz, Germany), Stefan Stieger (University of Konstanz, Germany; University of Vienna, Austria) During the last decades, online assesment and testing became an indispensable data source for research, and not only in the fields of personality psychology, intelligence, and achievement. The Internet provides a powerful infrastructure for data collection and many researchers have been taking advantage of it to conduct basic and applied research. This symposium will cover new developments and present tools and examples of how to use the Internet for online assessment and research. This session intends to give an overview of the online assessment/testing expansion in recent years, including topics such as the rise of mobile computing (smartphones) in research, relationships between self-reported executive problems, personality, and cognitive performance, self-ratings versus observers' ratings of personality on Facebook profiles, and how to handle and analyze dropout in Internet-based research. The session will also explore the technical and ethical issues in the use of data generated by online assessment/testing as well as the added value and benefits of such data. Special attention will be given to the development of the International Personality Item Pool (IPIP) as a possible blueprint for online cross-cultural personality assessment in the public domain. Presentations of the Symposium Smartphone apps in psychological science: Results from an experience sampling method study Stefan Stieger¹, Ulf-Dietrich Reips²; stefan.stieger@uni-konstanz.de stefan.stieger@uni-konstanz.de ¹University of Konstanz, University of Vienna, Germany, Austria, ²University of Konstanz, Germany Data collection methods in the social and behavioral sciences have always been inspired by new technologies. The introduction of the Internet had a major impact in advancing the methodological repertoire of researchers, with Internet-based experiments, online questionnaires, and non-reactive online data collection methods, to name just a few. Meanwhile, the next major impact from technology is hitting research – smartphones. The penetration rate of these small mobile devices is increasing rapidly, and they offer a multitude of new sensors that can be used for scientific research (e.g., GPS, gyroscope, accelerometer, temperature sensors). We report a smartphone app field study about well-being conducted in German-speaking countries (n =219). It took place for 14 days with three measurements per day (8000+ well-being judgments). Based on this study, we discuss important aspects of the planning of a smartphone study (e.g., programming, implementation, pitfalls, and recruitment strategies). The presentation aims not only to present empirical data about an exemplary smartphone study, but also to present the unique aspects of smartphone studies compared to traditional research methods of data collection. What do self-report measures of problems with executive function actually measure? Data from internet and laboratory studies Tom Buchanan; T.Buchanan@westminster.ac.uk T.Buchanan@westminster.ac.uk University of Westminster, United Kingdom Measuring executive function interests researchers and practitioners in a number of psychological fields. Self-report measures of executive problems may have considerable value, especially for research conducted via the internet. They are easier to implement online than traditional cognitive tests, and arguably have greater ecological validity as indices of everyday problems. However, there are questions about whether they actually measure executive function, or other constructs such as personality. Relationships between self-reported executive problems, personality, and cognitive performance were assessed in three correlational studies using non-clinical samples. In Study 1, 49398 participants completed online measures of personality and self-reported executive problems. In Study 2, 345 participants additionally completed an online Digit Span task. In Study 3, 103 participants in a traditional laboratory setting completed multiple measures of personality, self-reported executive problems, and objective cognitive tests. Across all three studies, self-reported problems correlated with neuroticism and with low conscientiousness, with medium to large effect sizes. However self-reported problems did not correlate with performance on Trail Making, Phonemic Fluency, Semantic Fluency or Digit Span tests tapping aspects of executive function. These findings raise questions about self-report measures of executive problems, both on the Internet and offline. Self-ratings of personality and observers' ratings based on Facebook profiles Boris Mlačić, Goran Milas, Ivna Sladić; Boris.Mlacic@pilar.hr Boris.Mlacic@pilar.hr Institute of Social Sciences Ivo Pilar, Croatia The aim of the study was to investigate the relationship between self-ratings of personality and expert observers’ ratings of personality based on Facebook profiles of target persons. The self-rating sample consisted of 177 participants with active Facebook profiles between March and June 2014. Expert observers were students in the final year of masters’ course in psychology with training in personality psychology. Personality traits from the Big-Five model were assessed by the IPIP50 (Goldberg, 1999; Mlačić & Goldberg, 2007) while Facebook usage was assessed by the Questionnaire of Facebook use (Ross et al., 2009). Observers’ ratings of personality were based on data from Facebook profiles where each of the five personality dimensions (Extraversion, Agreeableness, Conscientiousness, Emotional Stability, and Intellect) was briefly defined. The results showed significant relations between Facebook usage and Agreeableness and Extraversion, respectively. Observers’ personality ratings correlated significantly with self-ratings of Conscientiousness and Intellect while the ratings between observers were the highest for the dimensions of Extraversion, Conscientiousness, and Emotional Stability. Dropout analysis with DropR: An R-based web app to analyze and visualize dropout Ulf-Dietrich Reips¹, Matthias Bannert²; reips@uni-konstanz.de reips@uni-konstanz.de ¹University of Konstanz, Germany, ²ETH Zurich, Switzerland With Internet-based research non-response such as lack of responses to particular items and dropout have become interesting dependent variables, due to highly voluntary participation and large numbers of participants (Reips, 2000, 2002). In this paper we develop and discuss the methodology of using and analyzing dropout in Internet-based research, and we present DropR, a Web App to analyze and visualize dropout. The Web App was written in R, a free software environment for statistical computing and graphics. Among other features, DropR turns input from datasets in various formats into visual displays of dropout curves. It calculates parameters relevant to dropout analysis, such as Chi Square values and odds ratios for points of difference, initial drop, and percent remaining in stable states. With automated inferential components, it identifies critical points in dropout and critical differences between dropout curves for different experimental conditions and produces related statistical copy. The visual displays are interactive, users can use mouse over and mouse drag and click to identify regions within a display for further analysis. DropR is provided as a free R package (http://cran.r-project.org/web/licenses/GPL-2) and Web service (http://dropr.eu) from researchers for researchers. Measuring narcissism online: Development and validation of a brief web‐based instrument Tim Kuhlmann, Michael Dantlgraber, Ulf-Dietrich Reips; tim.kuhlmann@uni-konstanz.de tim.kuhlmann@uni-konstanz.de University of Konstanz, Germany Narcissism continues to be a widely researched topic in psychology, and the scientific community is in need of validated online instruments. The present paper describes the development and validation of a questionnaire for the web-based assessment of sub‐clinical narcissism. Several versions were developed, including items from the original NPI-40 (Raskin & Terry, 1988) and from the open item database IPIP. Using the multiple-site-entry technique (Reips, 2000), a sample of 1972 participants was recruited. They answered the original 40 items of the NPI‐40 in either choice or Likert-type format as well as 80 items from the IPIP with a Likert-type answer format. The NPI‐40 in original choice format showed unsatisfactory fit-characteristics in a CFA. After factor analysis of all Likert-type items, an 18-item questionnaire for narcissism with three intercorrelated subscales emerged. These were labeled importance, manipulation, and vanity. The overall narcissism score had good internal consistency (α = .91), with the subscales showing acceptable reliabilities (α = .78 - .83). The final scale was validated in a separate sample with 549 participants. The three-factor-structure was replicated and similar psychometric properties were shown. The questionnaire provides researchers with a brief and validated instrument for the web-based assessment of narcissism and its sub‐facets.
4:30pm - 6:00pm	PA8: Educational Assessment 1 Session Chair: Anne-Kathrin Mayer
KOL-G-204 (Ⅱ)
	Equivalence of computerized versus paper-and-pencil testing of information literacy under controlled versus uncontrolled conditions: An experimental study Anne-Kathrin Mayer¹, Günter Krampen^1,2 ¹ZPID - Leibniz Institute for Psychology Information, Germany; ²University of Trier, Germany; mayer@zpid.de mayer@zpid.de Achievement tests, as well as self-report questionnaires, may provide reliable and valid results regardless of medium (e.g.paper-and-pencil vs. computerized testing) or mode (e.g. supervised vs. unsupervised testing) of test administration. However, because evidence is inconsistent and test-specific, it is recommended to review the equivalence of each assessment tool before applying it in various formats. Thus, the present study examines the equivalence of two information literacy measures by comparing their a) psychometric properties (internal consistencies, item-total correlations), b) means and standard deviations, and c) intercorrelations under different conditions. In an experimental study, educational students (n=141) completed a knowledge test which aims to assess individuals’ ability to find and evaluate scholarly information, and a questionnaire assessing information literacy related self-efficacy beliefs. Medium and mode of test administration were varied in a 2 x 2 between subjects design. Testing was conducted in a paper based or a computer based format either individually under supervision, or under uncontrolled conditions. While the self-efficacy scale yielded comparable results under the different experimental conditions, the knowledge test appeared to be more susceptible to variations of test administration. Results are discussed with respect to general differences in measurement equivalence of test versus questionnaire data. The predictive validity and stability of standardized assessment in early childhood education Niek Frans¹, Wendy J. Post¹, Mark Huisman¹, C.E. {Ineke} Oenema-Mostert^1,2, Anne L. Keegstra³, Alexander E.M.G. Minnaert¹ ¹University of Groningen, The Netherlands; ²Stenden University of Applied Sciences, The Netherlands; ³University Medical Center Groningen, The Netherlands; N.Frans@rug.nl N.Frans@rug.nl There has been a lot of controversy surrounding the use of standardized achievement tests in preschool. Several researchers claim that the performance of young children is too fickle to be reliably and validly tested. The goal of this study was to examine the predictive validity for future performance and the score-stability of two widely administered Dutch preschool tests. Language and arithmetic scores of 431 children were collected retrospectively over a four-year period. First, percentile scores of low scoring children were plotted to assess the stability of scores over time. Second, predictive validity of arithmetic and language scores was assessed by means of a multilevel model. Both the language and arithmetic tests were poor identifiers for low scoring children in first and second grade. The majority of low scoring first and second graders achieved above average in preschool, or fluctuated between top and bottom range scores. A small group did not show large fluctuations in scores. Low correlations (r= .09 to .30) between the preschool tests and subsequent tests indicated that both tests are weakly to moderately associated with first and second grade performance. The results are discussed in light of practical applications of these tests. A cross-cultural study of Curriculum-Based Measurement in Brazil Maria Cristina Joly¹, Suzanne Bamonto² ¹University of Brasilia, Brazil; ²Rochester Institute of Technology, USA; mcrisjoly@gmail.com mcrisjoly@gmail.com Curriculum-Based Measurement (CBM) is a set of assessment procedures designed to provide accurate and reliable, yet efficient, indicators of student performance in the basic skill areas of reading, writing, and mathematics. Educators use CBM for universal screening and progress monitoring supported by a number of studies establishing good reliability and validity and linking performance on the measures, particularly the oral reading measure, to performance on state-administered high stakes tests. Such a measurement system currently does not exist in Brazil, therefore teachers often rely on their informal measures of student performance to guide instruction, and school administrators and policymakers to guide programmatic decisions. The purpose of this session is to describe a cross-cultural project aimed at investigating the suitability of CBM for schools in Brazil. The paper presentation will include an overview of the education systems in the U.S. and Brazil and how CBM fits in. Results of some initial reliability and validity studies using two mathematics probes administered to a group of third-grade students will be presented, including test-retest and alternate-form reliability coefficients and exploratory factor analysis. Preliminary implications for implementation and plans for follow-up studies will be discussed. Longitudinal factorial invariance of a Childhood Career Exploration measure Iris Martins Oliveira¹, Maria do Ceu Taveira¹, Erik J. Porfeli² ¹University of Minho, Portugal; ²Northeast Ohio Medical University, USA; ioliveira@psi.uminho.pt ioliveira@psi.uminho.pt Career exploration is a central process of childhood career development, sustaining an emerging sense of self and learning about life-roles and work. Self-report measures have been used with middle-school children, but often present theoretical and psychometric limitations, lacking evidence of temporal validity. This study examines the longitudinal factorial invariance of the Childhood Career Exploration Inventory (CCEI) over a 14-month period spinning fifth- and sixth-grades. The CCEI is a self-report measure of middle-school children’s career exploration, yielding scores for three subscales and total career exploration. Attrition did not rely on gender, region, or previous CCEI scores. Analyses were derived from a final data set of 437 Portuguese children of both genders (M(age) first wave = 10.23). A hierarchical factor model constituted the baseline model. Results suggested configural and metric invariance of the first- and second-order factors over time for genders. The CCEI presented acceptable reliability at each time and relative construct stability, except from the third to the fourth occurrences of measurement. These results support the use of the CCEI with girls and boys to investigate change in career exploration over fifth- and sixth-grades. Possible reasons for the relative construct instability from the third to the fourth occurrences of measurement are discussed.
4:30pm - 6:00pm	PA9: Humor and Playfulness Session Chair: René Proyer
KOL-G-209 (Ⅲ)
	Assessing four facets of adult playfulness: The OLIW René Proyer University of Zurich, Switzerland; r.proyer@psychologie.uzh.ch r.proyer@psychologie.uzh.ch Based on a series of studies a new facet model of adult playfulness has been derived; it is argued that playfulness consists of (a) an other-directed; (b) a lighthearted; (c) an intellectual; and (d) a whimsical component (OLIW). The proposed structure was supported in exploratory and confirmatory factor analyses using several samples between n= 238 and n= 814. Correlations with other playfulness instruments and measures of broader personality traits were in the expected direction. Self- and peer-ratings of homologous scales were in the expected range (all ≥ .35; n = 120, ongoing data collection). The four scales were also positively related with averaged ratings of daily playful activities (over 14 days) of 364 adults; coefficients were between .33 and .35. Overall, the data are encouraging and may break the ground for a stronger consideration of research in playfulness as a personality trait in adults. Do others know my sense of humor? The self-peer convergence of comic and humor styles Sonja Heintz, Willibald Ruch University of Zurich, Switzerland; s.heintz@psychologie.uzh.ch s.heintz@psychologie.uzh.ch Besides aiming to measure a general “sense of humor”, specific tests have been developed to measure eight comic styles (sensu Schmidt-Hidding) and four humor styles (representing everyday functions of humor). The former comprises the eight comic styles of fun, humor, nonsense, wit, irony, satire, sarcasm, and cynicism. The latter measures two presumably adaptive humor styles (affiliative and self-enhancing) and two presumably maladaptive ones (aggressive and self-defeating). The present study investigates the self-peer convergence of these 12 styles in multitrait-multimethod analyses (i.e., 12 traits × 2 methods), employing one to two peer raters per participant. The findings revealed a good convergent validity for all comic and humor styles, except for the self-defeating humor style (< .30). Discriminant validity was high with two exceptions: Self-reported satire and sarcasm were also strongly related to peer-rated cynicism. Thus, the self-defeating humor style did show a low self-peer convergence, and the peers could hardly distinguish between the three rather criticizing comic styles of satire, sarcasm, and cynicism. Implications of these findings will be discussed. Psychometric properties of the Slovak version of BENCOR (preliminary findings) Robert Durka¹, Willibald Ruch² ¹Catholic University Ruzomberok, Slovakia; ²University of Zurich, Switzerland; robert.durka@ku.sk robert.durka@ku.sk The BENCOR (Ruch, 2012) is a 12-item self-report instrument for measuring benevolent and corrective humor. Benevolent humor describes a humorous outlook on life that entails the realistic observations of human weaknesses (and the imperfection of the world), but also their benevolent and forgiving humorous treatment. In contrast, corrective humor involves moral based ridicule; i.e., the use of mockery to fight badness and mediocrity. The English version of the scale was translated to Slovak language according to standard procedures including an independent back translation. Altogether 194 Slovak university students filled out a paper-and-pencil version of the questionnaire. Principal component analysis with direct oblimin rotation identified two factors that explained 45.84 % of the variance. The benevolent humor items had loadings ranging between .46 and .73, the corrective humor items had loadings ranging between .52 and .78 on the defining factors. The two scale were internally consistent (Cronbach`s α = .70/.77). Two items has somewhat lower item total correlations and were reworded. The scale with the refined items was administered to another sample of university students together with some validation criteria. The results and further applications will be discussed. Personality characteristics of witty students: Histrionic self-presentation style and (active) katagelasticism Filip Sulejmanov¹, Karl-Heinz Renner² ¹Ss. Cyril and Methodius University, Macedonia; ²Universität der Bundeswehr München, Germany; filip.sulejmanov@gmail.com filip.sulejmanov@gmail.com The objectives of this study were three fold. First, evaluation of the psychometric properties of the Macedonian version of the As-If-Scale (Renner et. al., 2008), next, comparison of the self-assessed wittiness with the ratings of peers, and finally, prediction of peer-rated wittiness by histrionic self- presentation style and (active) katagelasticism (Renner & Heydasch 2010, Ruch & Proyer 2009). MAP test and the parallel analysis led to an estimate of 1 component to retain for the Macedonian version of the As-If-Scale (103 participants; M = 18.51 years, SD = 1.01; 67 female, 36 male). To address the remaining objectives, a sample of 83 subjects (M = 17.70 years, SD = .49; 30 male, 53 female) from a high school in Macedonia was used. Students in each class rated all other students from their class how witty they are on a 4-point scale. There was only low positive correlation between self-assessed wittiness and peer-ratings of wittiness (r = .25, p < .05). Two hierarchical regression analyses revealed that gender (male), low (active) katagelasticism, and high histrionic self-presentation were all predictors of peer-rated wittiness scores. The discussion is mainly focused on the strengths and weaknesses of different methodologies used to assess who is humorous in the classroom.
4:30pm - 6:00pm	PA10: Cross-Cultural Assessment and Adaptation Session Chair: Isabel Benítez Baena
KOL-G-221 (Ⅴ)
	The assessment of bias between groups from different cultures speaking the same language Isabel Benítez Baena^1,2, José-Luis Padilla García¹, Fons van de Vijver^2,3,4 ¹University of Granada, Spain; ²Tilburg University, The Netherlands; ³University of Queensland, Australia; ⁴North-West University, South Africa; ibenitez@ugr.es ibenitez@ugr.es Numerous studies have been conducted aimed at discovering sources of bias in cross-cultural research. Translation issues have been targeted as one of the most impacting sources when developing different linguistic versions of tests and questionnaires. However, little is not known about sources of bias when groups involved are sharing the language. The aim of the study is to identify and investigate sources of bias when comparing groups from different cultures speaking the same language. Spanish speakers from Spain, Colombia and Mexico responded to “Quality of Life” (QoL) scales in their respective countries. In addition, immigrants with Spanish as mother tongue living in Spain also took part in the study. Responses to QoL scales were compared and inquired by conducting qualitative procedures with some of participants from those different groups. Results were integrated and conclusions about sources of bias are formulated in terms of cultural issues provoking differences between groups and elements of bias maintained and neutralized when living or not in the same country. Exploring the relationship between the Greek taxonomic study and the HEXACO personality structure in a Greek population sample Christoforos Thomadakis¹, Ioannis Tsaousis² ¹Hellenic National Defence General Staff, Greece,; ²University of Crete, Greece; christthomad@gmail.com christthomad@gmail.com The study of personality has received an increased research discussion within the Big Five domain and how personality domains can be summarized and explained within this era. However, recent studies have brought up a newer, more representative structure called the HEXACO, which consists of six rather than five dimensions. Under this perspective, the aim of this study was to explore the emergence of the HEXACO personality model in the Greek culture. Using the initial Greek taxonomic work by Saucier et al., (2005), we attempted to identify whether the HEXACO model could be replicated. The sample consisted of 3400 individuals candidates (ages 18-21 years old) for the Greek Military academies, also refered to as young cadets. Adopting modern psychometric techniques (i.e. EFA and CFA) we found that a six-factor solution best represents the data from the Greek sample. The results from this analysis suggest that this six-dimensional structure of personality is similar to the HEXACO model verified in other cultures. The results from this analysis could help both practitioners and researchers to assess personality characteristics in a variety of organizational settings using a comprehensive theoretical framework. Psychological distress among low-income U.S.- and foreign-born women of Mexican descent: Impact of acculturation Venera Bekteshi¹, Qingwen XU², Thanh Tran³ ¹University of Illinois, USA; ²Tulane University, USA; ³Boston College, USA; venerab@illinois.edu venerab@illinois.edu After testing the capacity of Kessler’s psychological distress (K6) scale to measure equally across low-income Mexican-born women and U.S.-born women of Mexican descent, this study assessed the impact of acculturation on this group’s psychological distress. Using the California Health Interview Survey (CHIS) 2007, we involved 1,138 low-income Mexican women, 319 born in U.S. and 819 born in Mexico. We employed descriptive and confirmatory factor analyses to test the cross-cultural equivalence of K6. Multivariate and logistic regression were used to test the association between acculturation and psychological distress among all participants. The key statistics which determined the cross-cultural equivalence of K6 were χ2 (>0.05), Root Mean Square Error of Approximation (RMSEA) (<0.05), Tucker Lewis Index (TLI) and Comparative Fit Index (> .90). The cross-cultural equivalence analysis demonstrated that some of the scale’s items had the capacity to measure psychological distress equally among participants. Regression indicated that the more acculturated these women became, the greater their psychological distress.The study recommends that researchers emphasize the cross-cultural equivalence of their measures and suggests a heightened awareness among practitioners of the multidimensional impact of acculturation on well-being of their clients.
6:15pm - 7:45pm	Members Meeting
KO2-F-180 (Ⅵ)	Members Meeting