KRITIK JURNAL NASIONAL Judul Volume : Proses Berpikir Kreatif berdasarkan Wallas Model dalam Memecahkan Masalah Matematika : 1. Hal. 177-184 Penulis : Hevy Risqi Maharani, Sukestiyarno, Budi Waluya Sumber : International Journal on Emerging Mathematics Education (IJEME) Pendahuluan Jurnal yang berjudul “Proses Berpikir Kreatif berdasarkan Wallas Model dalam Memecahkan Masalah Matematika” merupakan penelitian dari Hevy Risqi Maharani, Sukestiyarno, dan Budi Waluya pada tahun 2017. Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang mengembangkan proses berfikir kreatif, masalah matematika dan metodenya. The Problem (masalah ) 1. Latar belakang masalah dalam jurnal ini yaitu Suatu hal yang sulit bagi guru SMP adalah memposisikan dan mengembangkan anak didiknya yang masih pada masa transisi dalam berpikir kreatif. 2. Masalah dalam jurnal ini Pada kenyataannya, yang sering terjadi dalam pembelajaran matematika adalah siswa diberi masalah tertutup untuk dipecahkan. Siswa tidak diajar untuk menggunakan pemikiran dan penalaran yang berbeda yang penting bagi mereka untuk memahami konteksnya sehingga mereka memberikan respons positif dan berpartisipasi aktif dalam proses pembelajaran. The Design 1. Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya yaitu memperkenalkan Salah satu model proses berpikir kreatif disajikan oleh Wallas (2014). Model ini terdiri dari empat tahap berbeda: persiapan, inkubasi, iluminasi, dan verifikasi.. The Prosedure 1. Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Meskipun peneliti hanya menampilkan gambar/foto dari hasil tes. 2. Kemudian terdapat tabel yang mencakup kegiatan-kegiatan siswa selama menggunakan model wallas ini. The Measurement Pada bagian teori memang sudah dijelaskan secara teorinya, namun saya lihat tidak ada sesuatu angka atau apapun yang dihitung. Mungkin saya kira karena penelitian ini adalah deskriptif kualitatif. The Interpretation 1. Kesimpulan dan hasil penelitian dalam jurnal ini konsisten sehingga pembaca dapat memahaminya secara jelas 2. Namun dengan menggunakan model Wallas masih tetap tidak bisa mengatasi dalam kasus siswa pada tingkat berpikir kreatif rendah dan sedang. . KRITIK JURNAL NASIONAL Judul : Development of Student Worksheets to Improve the Ability of Mathematical Problem Posing Volume : 1. Hal. 1-10 Penulis : Harry Dwi Putra, Tatang Herman, Utari Sumarmo Sumber : International Journal on Emerging Mathematics Education (IJEME) Pendahuluan Jurnal yang berjudul “Pengembangan Lembar Kerja Siswa untuk Meningkatkan Kemampuan Posing Masalah Matematika” merupakan penelitian dari Harry Dwi Putra, Tatang Herman, Utari Sumarmo (2017). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris dan Bahasa Indonesia, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang lembar kerja siswa, pendekatan ilmiah, bagaimana jika bukan strategi, masalah matematika. The Problem (Masalah) 3. Latar belakang masalah dalam jurnal ini yaitu Bagaimana mengembangkan lembar kerja siswa melalui pendekatan ilmiah dengan bagaimana jika tidak strategi untuk meningkatkan kemampuan siswa dalam masalah matematika 4. Namun qpa dampak dari penggunaan lembar kerja siswa melalui pendekatan ilmiah dengan bagaimana jika tidak strategi pada kemampuan siswa dalam masalah matematika The Design 3. Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan yaitu dengan literature lembar kerja siswa untuk dikembangkan agar dapat menaikkan tingkat kemampuan posing siswa dalam menyelesaikan masalah matematika. The procedure 1. Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. 2. Meskipun akan lebih rapih bila diurutkan dengan angka/huruf untuk menjdabarkan perlangkahnya. The Measurment 1. Pada bagian perhitungan terlihat cukup jelas, namun ada beberapa bagian yang agak sulit dimengerti pada rumus yang digunakan. 2. Namun tabel yang digunakan sudah cukup jelas untuk melihat hasil angkanya. The interpretation 1. Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas, namun pada bagian ini hanya sedikit yang dibahas dari hasil penelitian yaitu langsung ke point/inti hasil dilapangan. 2. Hasil penelitian pada bagian diskusi sangat jelas untuk diargumentasikan. KRITIK JURNAL NASIONAL Judul : Penilaian Kapasitas Berpikir Logis pada Siswa Kelas Tujuh (ASSESSMENT OF SEVENTH GRADE STUDENTS’ CAPACITY OF LOGICAL THINKING) Volume : 1. Hal. 75-80 Penulis : M. Fadiana, S. M. Amin , A. Lukito , A. Wardhono , S. Aishah Sumber : Jurnal Pendidikan IPA Indonesia Pendahuluan Jurnal yang berjudul “Penilaian Kapasitas Berpikir Logis pada Siswa Kelas Tujuh” merupakan penelitian dari M. Fadiana, S. M. Amin , A. Lukito , A. Wardhono , S. Aishah (2019). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang berpikir logis, operasional konkret, operasional formal dan perilaku kognitif siswa sekolah menengah. The Problem (masalah ) 4. Latar belakang masalah dalam jurnal ini yaitu peneliti sering menemukan bahwa guru jarang mengukur keterampilan berpikir logis siswa sebelum merancang strategi pembelajaran yang akan dilakukan. 5. Apa perbedaan mendasar yang ditemukan antara siswa operasional formal dan siswa transisi. The Design 2. Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya memperkenalkan Instrumen yang digunakan dalam penelitian ini adalah Logical Thinking Test (LTT) yang direvisi dari Group Assessment of Logical Thinking (GALT). GALT diciptakan untuk mengukur enam jenis penalaran yang berbeda; yaitu penalaran konservasional, penalaran proporsional, variabel kontrol, penalaran kombinatorial, penalaran probabilistik, dan penalaran korelasional (Roadrangka et al., 1983). The Prosedure 5. Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Meskipun peneliti hanya menampilkan gambar/foto dari hasil tes. 6. Kemudian terdapat tabel yang mencakup beberapa hal terkait hasil tes dan perbandingan siswa. The Measurement Pada bagian perhitungan terlihat sangat baik dan rapih untuk pembaca. Namun tak ada gambar atau lampiran/foto contoh survei yang digunakan, agar menjadi referensi juga bagi peneliti lain. The Interpretation 3. Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas, namun kata-kata yang disajikan masih kurang rapih dan agak sulit dipahami sehingga pembaca kurang memahami jelas 4. Paragraph artikel akan mudah dibaca bila tidak dibagi jadi 2 kolom perhalaman. KRITIK JURNAL NASIONAL Judul : Using the 5E Learning Cycle with Metacognitive Technique to Enhance Students’ Mathematical Critical Thinking Skills Volume : 1 No. 1 Hal. 87-98 Penulis : Runisah, Tatang Herman, Jarnawi Afgani Dahlan Sumber : International Journal on Emerging Mathematics Education (IJEME) Pendahuluan Jurnal yang berjudul “Menggunakan Siklus Belajar 5E dengan Teknik Metakognitif untuk Meningkatkan Keterampilan Berpikir Kritis Matematis Siswa” merupakan penelitian dari Runisah, Tatang Herman, Jarnawi Afgani Dahlan (2017). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang keterampilan berpikir kritis matematika, siklus belajar 5E, teknik metakognitif. The Problem Latar belakang masalah dalam jurnal ini yaitu Hasil Tren dalam Studi Matematika dan Sains Internasional (TIMSS) pada tahun 2007 dan 2011 menunjukkan bahwa skor rata-rata pencapaian mata pelajaran matematika pada tahun 2011 adalah 38 th peringkat dari 42 negara yang berpartisipasi. Salah satu faktor yang mempengaruhi kurangnya kemampuan berpikir kritis matematis siswa adalah proses pembelajaran. Di Indonesia, praktik mengajar difokuskan pada konten materi dan mengabaikan pengembangan kemampuan berpikir siswa (Rohaeti, 2010). Dari hasil inkuiri Balitbang Depdiknas pada 2007, implementasi pembelajaran di Indonesia umumnya masih menggunakan metode ceramah dan tanya jawab (Balitbang Depdiknas, 2007). Berbagai masalah dalam aspek kehidupan terjadi pada 21 st abad. Untuk mengatasi masalah ini, keterampilan berpikir kritis diperlukan. The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya mempekenalkan 5E Learning Cycle (LC) dengan Metacognitive technique (LCM) adalah model pembelajaran yang mengintegrasikan langsung teknik Metacognitive di setiap tahap LC. LC memiliki lima tahap yaitu: terlibat, mengeksplorasi, menjelaskan, menguraikan dan mengevaluasi. metakognisi adalah kesadaran seseorang tentang proses kognitif dan kemandirian untuk mencapai hasil. The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus.. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dcari. KRITIK JURNAL NASIONAL Judul : PROSPECTIVE MATHEMATICS TEACHERS’ COGNITIVE COMPETENCIES ON REALISTIC MATHEMATICS EDUCATION Volume : 11. No. 1 Hal. 17-44 Penulis : Rezan Yilmaz Sumber : Journal on Mathematics Education (JME) Pendahuluan Jurnal yang berjudul “KOMPETENSI KOGNITIF GURU MATEMATIKA PROSPEKTIF DALAM PENDIDIKAN MATEMATIKA REALISTIS” merupakan penelitian dari Rezan Yilmaz (2020). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang Pendidikan Matematika Realistik, Calon Guru Matematika, Kompetensi Kognitif, Masalah Kontekstual. The Problem Dalam hal ini, studi terutama berfokus pada RME dalam hal pendekatan dan jawaban untuk pertanyaan "Bagaimana kognitifnya kompetensi calon guru matematika terkait dengan RME? "telah dicari. Dengan demikian, submasalah "Bagaimana calon guru matematika menjelaskan RME dan implementasinya? Bagaimana Apakah mereka menafsirkan persamaan dan perbedaan dengan pendekatan lain? Bagaimana masalah kontekstual lakukan mereka berpose sesuai dengan struktur teoritis RME? ". The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya. Desain penelitian adalah studi kasus, yang telah dievaluasi sebagai kasus RME The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus. Tahap-tahap teknis dijalankan secara beurut dan rapih, namun untuk RME yang digunakan masih pada tahap yang biasa atau mainstream. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dcari. Namun, masih diperlukan RME dikoneksikan ke internet atau sosmed yang relevan untuk mencapai pembelajaran yang bermakna dalam matematika. KRITIK JURNAL INTERNASIONAL Judul : ELEMENTARY PRESERVICE TEACHERS’ KNOWLEDGE, PERCEPTIONS AND ATTITUDES TOWARDS FRACTIONS: A MIXED-ANALYSIS Volume : 11 No. 1 Hal. 59-76 Penulis : Roslinda Rosli, Dianne Goldsby , Anthony J. Onwuegbuzie , Mary Margaret Capraro , Robert M. Capraro , Elsa Gonzalez Y. Gonzalez Sumber : Journal on Mathematics Education (JME) Pendahuluan Jurnal yang berjudul “PENGETAHUAN PENGAJARAN, PERSEPSI DAN SIKAP GURU SD TERHADAP PECAHAN: ANALISIS CAMPURAN” merupakan penelitian dari Roslinda Rosli, Dianne Goldsby , Anthony J. Onwuegbuzie , Mary Margaret Capraro , Robert M. Capraro , Elsa Gonzalez Y. Gonzalez (2020). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang Sekolah Dasar, Masalah posing, Program Persiapan Guru, Calon Guru, Metode Campuran. The Problem (masalah ) Latar belakang masalah dalam jurnal ini yaitu Sejumlah literatur telah mendokumentasikan bahwa mayoritas guru dalam jabatan dan guru pra-jabatan memiliki pengetahuan mendalam yang terbatas untuk mengajar matematika. Terlepas dari bukti ini, sedikit yang diketahui tentang sifat pengetahuan untuk mengajar fraksi, persepsi dan sikap yang harus dimiliki guru dipelihara selama persiapan guru, tempat di mana guru seharusnya mendapatkan pengajaran mereka. The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya. Dalam studi ini, peneliti fokus pada mengukur integrasi modul Koneksi Matematika TEKS dengan model pembelajaran beton praktik dalam memfasilitasi konstruksi pengetahuan guru pra-layanan, persepsi dan sikap terhadap pecahan selama program persiapan guru. The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Meskipun peneliti hanya menampilkan data secara penuh tulisan, tanpa ada bantuan gambar/foto maupun diagram. The Measurement Pada bagian perhitungan terlihat sangat baik, namun tidak rapih untuk pembaca. Tak ada gambar atau lampiran/foto contoh survei yang digunakan, agar menjadi referensi juga bagi peneliti lain. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas, namun kata-kata yang disajikan masih kurang rapih dan agak sulit dipahami sehingga pembaca kurang memahami jelas. Masih butuh penambahan data dari beberapa instansi terkait subjek penelitian untuk melengkapi hasil data yang disimpulkan. KRITIK JURNAL INTERNASIONAL Judul : The Relation Between Reinforcement Learning Parameters And The Influence Of Reinforcement History On Choice Behavior Volume : 66. Hal. 59-69 Penulis : Kentaro Katahira Sumber : Journal of Mathematical Psychology Pendahuluan Jurnal yang berjudul “Hubungan Antara Parameter Pembelajaran Penguatan Dan Pengaruh Sejarah Penguatan Pada Perilaku Pilihan” merupakan penelitian dari Kentaro Katahira (2015). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang Penguatan pembelajaran, Sejarah ketergantungan, Model regresi, dan Analisis berbasis model. The Problem Dalam hal ini, studi terutama berfokus pada menyelidiki hubungan antara parameter model RL dan model regresi akan memberikan informasi berharga tentang faktor perilaku mana yang mungkin mendasari perbedaan dalam parameter model. Sebaliknya, dengan menggunakan relasi, orang dapat memprediksi jenis urutan perilaku mana yang dapat diharapkan diberikan seperangkat parameter model tertentu. The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya memperkenalkan Model Reinforcement learning (RL) yang telah banyak digunakan untuk menganalisis perilaku pilihan manusia dan hewan lain dalam berbagai bidang, termasuk psikologi dan ilmu saraf. Model berbasis regresi linier yang secara eksplisit mewakili bagaimana hadiah dan sejarah pilihan mempengaruhi pilihan masa depan juga telah digunakan untuk memodelkan perilaku pilihan The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus. Tahap-tahap teknis dijalankan secara beurut dan rapih, namun untuk Untuk mencapai tujuan ini, diperlukan perhitungan analitik dan simulasi numerik karena terdapat materi tugas-tugas probalitas. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dicari. Namun, masih belum bisa menentukan kebiasaan seseorang atau siswa dalam membuat pilihan. KRITIK JURNAL INTERNASIONAL Judul : An Analysis for the Qualitative Improvement of Education and Learning based on the Way of Learner Errors in Descriptive Questions Volume : 15. No. 3 Penulis : Michiko Tsubaki, Wataru Ogawara, Kenta Tanaka Sumber : INTERNATIONAL ELECTRONIC JOURNAL OF MATHEMATICS EDUCATION Pendahuluan Jurnal yang berjudul “Suatu Analisis untuk Peningkatan Kualitatif Pendidikan dan Pembelajaran berdasarkan Kesalahan Pelajar dalam Pertanyaan Deskriptif” merupakan penelitian dari Michiko Tsubaki, Wataru Ogawara, dan Kenta Tanaka (2020). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang pembelajaran dan pendidikan dalam probabilitas dan statistik, pertanyaan deskriptif, ekstraksi karakteristik jawaban, analisis kesalahan, analisis jaringan Bayesian, dan penalaran probabilistic. The Problem Dalam hal ini, menyelidiki hal yang telah disimpulkan oleh Kahneman dan Tversky (1982), Sulistyani (2019) menunjukkan bahwa ada 4 tahap kesalahan siswa dalam statistik inferensial. 1) Kesalahan dalam pemahaman terjadi karena siswa tidak dapat membaca tabel statistik atau membaca output dalam pertanyaan. 2) Kesalahan transformasi terjadi karena siswa tidak tepat dalam menerapkan / memilih jenis statistik uji yang digunakan atau menulis hipotesis. 3) Kesalahan keterampilan proses terjadi karena siswa kurang berhati-hati dalam menghitung dan ketidakmampuan untuk menginterpretasikan hasil perhitungan. 4) Kesalahan dalam tahap pengkodean terjadi karena siswa tidak menjawab dengan benar atau tidak tepat dalam menarik kesimpulan dalam pengujian hipotesis. Hasil-hasil ini menunjuk pada sifat-sifat yang khas pada isi pembelajaran di bidang probabilitas dan statistik, menunjukkan kecenderungan yang mungkin dalam cara para pelajar probabilitas dan statistik membuat kesalahan. The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya. Penelitian ini mengusulkan ekstraksi karakteristik jawaban sebagai variabel, berdasarkan pada bentuk kesalahan yang menjadi ciri perbedaan dalam tingkat pemahaman peserta didik menggunakan pertanyaan deskriptif lengkap terkait dengan probabilitas dan statistik. Kemudian, dengan menggunakan variabel-variabel ini, kami melakukan analisis pohon regresi pada pemahaman peserta didik dari konten penelitian untuk setiap masalah, serta analisis bagan karakteristik item untuk memperkirakan pemahaman umum, dan analisis relasional antara pemahaman dan strategi belajar menggunakan Jaringan Bayesian. Dengan demikian, selanjutnya kami mengusulkan metode untuk mengekstraksi kebijakan pembelajaran yang efektif atau strategi pembelajaran untuk peserta didik secara keseluruhan, serta peserta didik individu, melalui penalaran probabilistik berdasarkan jaringan Bayesian. The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus. Tahap-tahap teknis dijalankan secara beurut dan rapih, namun untuk Untuk mencapai tujuan ini, diperlukan perhitungan analitik dan simulasi numerik karena terdapat materi tugas-tugas probalitas. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dcari. Namun, disarankan adanya strategi pembelajaran yang efektif secara terpisah untuk setiap konsep dalam setiap pertanyaan, serta untuk setiap tingkat keahlian, apakah itu dasar (memahami definisi, dll.) Atau diterapkan (Sadar objek derivasi, deskripsi lengkap, dll. ), dalam karakteristik jawaban. KRITIK JURNAL INTERNASIONAL Judul : A SPARSE LATENT CLASS MODEL FOR COGNITIVE DIAGNOSIS Volume : 85. No. 1 Hal. 121-153 Penulis : Yinyin Chen, Steven Culpepper, dan Feng Liang Sumber : Psychometrika Pendahuluan Jurnal yang berjudul “MODEL KELAS LATEN YANG JARANG UNTUK DIAGNOSIS KOGNITIF” merupakan penelitian dari Yinyin Chen, Steven Culpepper, dan Feng Liang (2020). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang model kelas laten jarang, pemilihan variabel Bayesian, dan identifikasi. The Problem Dalam hal ini, menyelidiki hal yang telah disimpulkan pra-menentukan CDM yang tepat dapat sulit dalam praktiknya, terutama ketika tidak ada pengetahuan sebelumnya tentang tes yang tersedia. Lebih lanjut, ada kemungkinan bahwa berbagai pertanyaan (item) dalam satu pengujian perlu dimodelkan oleh CDM yang berbeda. Misalnya, tes matematika dapat mencakup pertanyaan yang dapat diselesaikan dengan menggunakan keterampilan yang berbeda, yang menyiratkan model disjungtif The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuannya memperkenalkan Cognitive diagnostic models (CDMs) adalah variabel laten yang dikembangkan untuk menyimpulkan keterampilan laten, pengetahuan, atau kepribadian yang mendasari respons terhadap tes dan tindakan pendidikan, psikologis, dan ilmu sosial. Penelitian terbaru berfokus pada teori dan metode untuk menggunakan model kelas laten jarang (SLCMs) dalam cara eksplorasi untuk menyimpulkan proses laten dan struktur respon yang mendasarinya. The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus. Tahap-tahap teknis dijalankan secara beurut dan rapih, namun Kontribusi penting untuk praktik adalah bahwa kondisi pengidentifikasian generik baru kami lebih cenderung terpenuhi dalam aplikasi empiris daripada kondisi yang ada yang memastikan pengidentifikasian yang ketat. Mempelajari struktur laten yang mendasarinya dapat dirumuskan sebagai masalah pemilihan variabel. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dicari. Namun, pada bagian diskusi belum diberikan kesimpulan yang lebih kongkret lagi. KRITIK JURNAL INTERNASIONAL Judul : Teachers’ Real and Perceived of ICTs Supported-Situation for Mathematics Teaching and Learning Volume : 1. No. 1 Hal. 11-24 Penulis : Maman Fathurrohman, Anne L. Porter, Annette L. Worthy Sumber : International Journal on Emerging Mathematics Education (IJEME) Pendahuluan Jurnal yang berjudul “Keadaan Sebenarnya dan Persepsi Guru pada dukungan TIK-Situasi untuk Pengajaran dan Pembelajaran Matematika” merupakan penelitian dari Maman Fathurrohman, Anne L. Porter, Annette L. Worthy (2017). Isi jurnal terdiri atas judul, nama penulis, abstrak dalam Bahasa inggris, kata kunci, pendahuluan, perangkat penelitian, metode yang di gunakan, teori dan contoh, penutup dan daftar pustaka. Secara garis besar jurnal tersebut membahas tentang survai, pembelajaran berbasis teknologi, TIK, infrastruktur, fasilitas, dan sumber daya. The Problem Di bidang pendidikan, teknologi telah diadopsi sebagaimana dapat dilihat dalam kegiatan akademik sehari-hari, tidak hanya di universitas tetapi juga sekolah menengah dan dasar di Indonesia. Mengajar dan belajar dengan teknologi telah terjadi di banyak tempat karena manfaatnya memberikan keuntungan dalam membuat proses belajar mengajar lebih efektif dan efisien. Satu hal penting yang perlu dipertimbangkan adalah perspektif guru terkait komponen-komponen TIK. The Design Pembahasan maupun isi dalam jurnal ini sudah sesuai dalam menjawab permasalahan yang disampaikan serta tujuanny. Metode yang digunakan adalah survai. Instrumen yang digunakan adalah kuesioner, pedoman wawancara tidak terstruktur, dan handycam. The Prosedure Pada jurnal sudah di sebutkan beberapa bahan, tahap dan pengujiannya. Secara keseluruhan sudah cukup bagus. Meskipun latar belakang pendidikan resmi guru baik, pelatihan mereka, menghadiri atau berpartisipasi dalam pelatihan terkait TIK masih rendah. The Interpretation Kesimpulan dan hasil penelitian dalam jurnal ini cukup jelas. Hasil data yang diberikan sudah memuat jawaban dari masalah-masalah yang dicari. Namun, mungkin bisa disebutkan komponen yang paling dibutuhkan dan lebih efektif pada kesimpulan. psychometrika https://doi.org/10.1007/s11336-019-09693-2 A SPARSE LATENT CLASS MODEL FOR COGNITIVE DIAGNOSIS Yinyin Chen UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Steven Culpepper UNIVERSITY OF ILLINOIS AT URBANA–CHAMPAIGN Feng Liang UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Cognitive diagnostic models (CDMs) are latent variable models developed to infer latent skills, knowledge, or personalities that underlie responses to educational, psychological, and social science tests and measures. Recent research focused on theory and methods for using sparse latent class models (SLCMs) in an exploratory fashion to infer the latent processes and structure underlying responses. We report new theoretical results about sufficient conditions for generic identifiability of SLCM parameters. An important contribution for practice is that our new generic identifiability conditions are more likely to be satisfied in empirical applications than existing conditions that ensure strict identifiability. Learning the underlying latent structure can be formulated as a variable selection problem. We develop a new Bayesian variable selection algorithm that explicitly enforces generic identifiability conditions and monotonicity of item response functions to ensure valid posterior inference. We present Monte Carlo simulation results to support accurate inferences and discuss the implications of our findings for future SLCM research and educational testing. Key words: sparse latent class models, Bayesian variable selection, identifiability. 1. Introduction Cognitive diagnostic models (CDMs) are latent class models developed for the inference of educational, psychological, and social science tests. In CDMs, the latent variables are often defined as skills, knowledge, or personalities needed by a subject to solve a given test item. Consider a test that consists of J items and involves K skills. The observable response Y = (Y1 , . . . , Y J ) for a subject is a binary random vector, indicating the correctness of the subject’s answers to the J items. The latent class of a subject is indexed by a K -dimensional binary vector α, which are referred to as an attribute profile, which suggests the mastery of each skill. Given α, Y is modeled by a product of J independent Bernoulli random variables with parameter θ j,α = P(Y j = 1|α). In many CDMs, θ j,α depends on a K -dimensional binary vector q j , where element q jk = 1 if attribute k is relevant to item j and zero otherwise. The relevant skills to all items are usually presented by a J × K matrix, Q = [q 1 . . . , q J ]T , called the Q-matrix. CDMs provide a statistical framework to identify relevant attributes of test items and to classify the mastery/non-mastery of test subjects on those attributes, which provide useful insights for researchers and educators. Various CDMs have been proposed in the literature and they differ from each other in their assumptions on how θ j,α depends on the Q-matrix. For example, the Correspondence should be made to Steven Culpepper, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana–Champaign, 725 South Wright Street, Champaign, IL 61820, USA. Email: [email protected] © 2020 The Psychometric Society PSYCHOMETRIKA DINA (Deterministic Input, Noisy ‘And’ gate) model (Haertel 1989; Junker and Sijtsma 2001), the generalized DINA model (de la Torre 2011) and the reduced reparameterized unified (rRUM) model (Hartz 2002; Rupp et al. 2010) are conjunctive models where all relevant skills are needed to have the highest positive response probability. On the other hand, under disjunctive models, such as the DINO (Deterministic Input, Noisy ‘Or’ gate) model (Templin and Henson 2006), at least one relevant skill is needed, whereas compensatory models (e.g., see Davier (2005), for a special case of the general diagnostic model) allow students to compensate for missing some skills by having others. However, pre-specifying an appropriate CDM can be difficult in practice, especially when no prior knowledge of the test is available. Further, it is likely that different questions (items) in a single test need to be modeled by different CDMs. For instance, a mathematics test may include questions that can be solved using different skills, which implies a disjunctive model. The same test may also include questions that involve multiple steps and require students to master all relevant skills in order to get the final correct answer, which implies a conjunctive model. For such tests, the specification of a single CDM would fail to capture the real latent patterns. To address this issue, a novel, model-free approach was proposed by Chen et al. (2015), which is based on an alternative representation of CDMs via a mixture of generalized linear models (GLMs). In particular, the jth Bernoulli parameter for latent class α is modeled as K K β j,k αk + β j,kk αk αk + · · · + β j,12...K αk P Y j = 1 α, β j = β j,0 + k=1 k>k k=1 (1) where (·) is an arbitrary cumulative distribution function (CDF) and β j is a coefficient vector to be estimated. It can be shown that all of the aforementioned CDMs are special cases of (1) with particular sparsity patterns of β j . In addition, the sparsity pattern of β j provides information about which attributes are relevant. Consequently, the estimation of Q can be reformulated as a variable selection problem involving GLMs. A fundamental issue with latent mixture models is model identifiability. Throughout, identifiability is defined up to a permutation of the K attributes. That is, we do not discuss the trivial identifiability issue due to label switching, since it is a well-understood issue and we know how to handle it in practice. The first rigorous study on the identifiability of the Q-matrix was given by Liu et al. (2013) with a focus on DINA models with known guessing parameters. Chen et al. (2015) extended the result of Liu et al. (2013) to DINA and DINO models with unknown model parameters. Xu (2017) established identifiability conditions for general CDMs when Q is known; Xu and Shang (2017) later provided identifiability conditions for general CDMs when Q is unknown. Although the results from Xu (2017) and Xu and Shang (2017) are applicable to the general model based on latent mixture of GLMs, their identifiability conditions, which require two identity matrices embedded in Q, are too strong to be satisfied in practice. Recently, Fang et al. (2019) proposed identifiability conditions in terms of the distribution of Y rather than Q, but their conditions are still stronger than the ones needed for our result. In this paper, we provide a new set of generic identifiability conditions for general CDMs. Our conditions are weaker than the ones in the aforementioned papers since those papers studied conditions for strict identifiability. As stated by Allman et al. (2009), “…generic identifiability implies that the set of points for which identifiability does not hold has measure zero," which is enough for practical data analysis. For example, Bernoulli mixtures are not strictly identifiable (Goodman 1974; Gyllenberg et al. 1994a), but given Bernoulli mixtures are generically identifiable, they often lead to valid statistical inference in practice (Carreira-Perpiñán and Renals 2000; Allman et al. 2009). Results for generic identifiability are established in Allman et al. (2009) for YINYIN CHEN ET AL. latent class models, which cannot be directly applied to CDMs, since CDMs are restricted latent class models. However, we can extend their proof technique, which is based on the tensor product framework in Kruskal (1976, 1977), to handle CDMs. For a feasible implementation, we adopt a Bayesian approach to the estimation and variable selection of (1), and develop a Gibbs sampling algorithm for computation. Different from the vanilla Gibbs algorithm for Bayesian variable selection, our Gibbs algorithm is specially designed to ensure that each posterior draw of the sparse model parameters β j ’s is from the identifiable space, while the algorithms from Chen et al. (2015), Xu and Shang (2017) and Fang et al. (2019) cannot ensure that their estimation of Q or other model parameters satisfies their identifiability conditions. In a recent work, Chen et al. (2018) also used MCMC method to stochastically search over identifiable parameter space. However, Chen et al. (2018) focus on DINA and DINO models with strict identifiability conditions, whereas we develop an MCMC sampling algorithm for general CDMs with weaker, generic identifiability conditions. The remainder of this paper is organized as follows. Section 2 introduces the setup of the sparse latent class models. Section 3 addresses identifiability issues and discusses the generic identifiability conditions. Section 4 introduces the Gibbs sampler for posterior inference. We report results from a simulation study in Sect. 5 and results from two real applications in Sect. 6, and close with a discussion in Sect. 7. Proofs and other technical details are included in “Appendix”. 2. Model and Applications 2.1. Model Setup Consider a test consisting of J items and involving K latent skills. Let Y = (Y1 , . . . , Y J ) denote the J binary responses from a subject. Based on the mastery of the K skills, each subject has an attribute profile α ∈ {0, 1} K , where αk = 1 indicates that the subject masters skill k and zero otherwise. In SLCM, given the attribute profile α, the J binary variablesY1 , . .. , Y J are modeled as independent Bernoulli variables with Bernoulli parameter θ j,α = aTα β j where aα = 1, α1 , . . . , α K , α1 α2 , . . . , α K −1 α K , . . . , K T αk (2) k=1 is a 2 K -dimensional alternative representation of the binary vector α with aTα β j = β j,0 + K k=1 β j,k αk + k>k β j,kk αk αk + · · · + β j,12...K K αk . (3) k=1 The regression coefficients β j form a sparse vector, in which the nonzero elements represent the effects of skills or combinations of skills on the response of item j. In particular, where coefficients in (3) are nonzero imply the dependence between item j and the K skills. To identify which main effects and/or interactions of the K skills are relevant to item j, we introduce a 2 K -dimensional binary structure vector K δ j = (δ j,0 , δ j,1 , . . . , δ j,K , δ j,12 , . . . , δ j,(K −1)K , . . . , δ j,1...K )T ∈ {0, 1}2 , PSYCHOMETRIKA Table 1. Sparse patterns of various CDMs DINA DINO G-DINA NC-RUM C-RUM δ j,0 δ j,1 δ j,2 1 1 1 1 1 1 1 1 1 1 1 1 1 δ j,3 δ j,12 1 1 1 δ j,13 δ j,23 δ j,123 Note β j,1 = β j,2 = −β j,12 (x) = x (x) = exp(x) (x) = logit−1 (x) with 1 indicating that the corresponding coefficient is active, i.e., its β value is nonzero, and 0 indicating that the coefficient is inactive, i.e., its β value is zero. The intercept β j,0 is usually assumed to be active, so δ j,0 is fixed at 1. From now on, we will use B J ×2 K , referred to as the coefficient matrix, to denote the collection of β j ’s, and J ×2 K , referred to as the sparsity matrix, to denote the collection of δ j ’s. 2.2. Sparsity Patterns Many popular CDMs can be reparameterized as special cases of SLCM with particular sparsity patterns, including the DINA model, the DINO model, the G-DINA (Generalized DINA) model, the NC-RUM (reduced noncompensatory reparameterized unified model) (DiBello et al. 1995; Rupp et al. 2010), and the C-RUM (Compensatory-RUM) (Hagenaars 1993; Maris 1999). In Table 1, we present the sparsity patterns of these aforementioned CDMs, if being reparameterized as SLCMs. For simplicity, we assume there are K = 3 latent skills, and the first two skills are relevant to each item. The detailed derivation for reparameterization of CDMs is provided in “Appendix A”. The DINA model is a conjunctive model with θ j,α taking only two possible values: one for students who have all the relevant skills, and the other for students who miss any of the relevant skills. If reparameterized as an SLCM, the DINA model is a sparse model with only one active coefficient, the highest interaction term that involves all the relevant skills, in addition to the intercept. In contrast, the DINO model is a disjunctive model, in which one value of θ j,α is for students who have at least one of the relevant skills, and the other is for students who miss all of the relevant skills. If reparameterized as an SLCM, the DINO model turns out to be a dense model with all the main effects and interactions among the relevant skills being active. In addition, there are some constraints on the coefficients: the active coefficients in odd orders are all equal and positive, while the values of those in even orders are the additive inverse of the odd ones. The G-DINA model is a generalization of the DINA model, which assumes θ j,α can be decomposed into the sum of the effects from relevant skills and their interactions. So similar to the DINO model, the G-DINA model is also a dense model with all the main effects and interactions among the relevant skills being active. The NC-RUM model assumes that missing any of the relevant skills reduces the positive response probability by a multiplicative penalty term. Xu (2017) has shown that the NC-RUM model is equivalent to a log-link additive model with just the main effects. Similar to the NC-RUM model, the C-RUM model is also an additive model involving only the main effects, but with a logit link function. An advantage of SLCM is that it contains all CDMs that admit a K -dimensional binary attribute profile, including the ones that do not fall into any of the aforementioned CDMs. Consider two sparsity patterns shown in Table 2 and assume the active coefficients are all positive. In the first case, mastering skill 1 alone increases the positive response probability, while mastering YINYIN CHEN ET AL. Table 2. Other sparse patterns of q = (1, 1, 0) δ j,0 1 1 δ j,1 δ j,2 1 1 δ j,3 δ j,12 δ j,13 δ j,23 δ j,123 1 1 skill 2 alone does not; mastering skill 2, however, increases the positive response probability conditioning on the mastery of skill 1. The second case has a similar interpretation. These two cases do not belong to any of the CDMs mentioned above, but can be modeled by SLCM. 2.3. From Q-matrix to -matrix Although the Q-matrix has been widely used for cognitive diagnostic modeling, it only provides partial information regarding the item–attributes relationship (Fang et al. 2019). Given q j , we know the relevant attributes, but it is not clear how they interact with each other to affect the positive response probability without specifying a particular CDM. In practice, it is challenging to pre-specify a CDM when no prior information of the test is available. Furthermore, it is possible that different items in a test follow different CDMs, so there is no single CDM that is appropriate for all the items in the test. In SLCM, the sparsity matrix J ×2 K , which does not require any pre-specified CDM, provides a more general and informative description of the item–attributes relationship. For example, all the CDMs listed in Tables 1 and 2 have the same q j = (1, 1, 0) but their structure vectors δ j ’s could be different. In cases, the Q-matrix is preferred as a summary of the relevant skills, we extract q j based on δ j as follows: for any attribute k, if there exists a subset of the K attributes, {k1 , k2 , . . . , kl } ⊆ {1, . . . , K }, such that, k ∈ {k1 , k2 , . . . , kl } and δ j,k1 k2 ...,kl = 1, then q jk = 1, otherwise, q jk = 0. That is, q jk = 1 if any element of δ j that is relevant to skill k is nonzero. 3. Identifiability Model identifiability is of great importance in the study of CDMs. In Statistics, a model is identifiable if it is theoretically possible to learn the true values of its underlying parameters after obtaining an infinite number of observations. Mathematically, this is equivalent to saying that different values of the parameters must correspond to different probability distributions of the observable variables. Identifiability conditions are technical restrictions, under which the model is identifiable. In this section, we establish a set of identifiability conditions of SLCM in terms of . We first introduce the identifiability issue encountered in CDMs in Sect. 3.1 and review prior research in Sect. 3.2. Then, we introduce the generic identifiability in the context of SLCM in Sect. 3.3 and propose a set of generic identifiability conditions in Sect. 3.4. The proof sketch of generic identifiability is provided in Sect. 3.5, and detailed proofs are given in “Appendix B.” 3.1. Identifiability Issue In CDMs, the observable variables are Y = (Y1 , . . . , Y J ),, and the parameters of interest are the latent class proportion vector π and the coefficient matrix B. Denote the parameter space of (π , B) by (π , B) = {(π, B) : π ∈ (π ), B ∈ (B)}, PSYCHOMETRIKA K where (π ) = {x ∈ R2 : x1 + · · · + x2 K = 1, xi > 0} is a (2 K − 1)-dimensional simplex and K (B) is the parameter space of the coefficient matrix B, which could be the whole space R J ×2 K or a subset of R J ×2 if we constrain the Q-matrix or the -matrix. Given an attribute profile/class α, the joint distribution of Y = {Y1 , . . . , Y J } is a product of Bernoulli distributions, which can be described by a J -dimensional 2 × · · · × 2 table J Pα (B) = (θ j,α , 1 − θ j,α ) j=1 where denotes the Kronecker product. The marginal distribution of Y over different classes is given by P(π , B) = πα Pα (B). α Definition 1. (Identifiable). A parameter set (π , B) ∈ (π , B) is identifiable, if P(π , B) = P(π̄ , B̄) ⇔ (π , B) ∼ (π̄, B̄), where (π̄, B̄) is another parameter set from (π , B) and “∼" means the two sets of parameters are identical up to label switching. 3.2. Prior Work CDMs are mixtures of Bernoulli distributions, which are known to be non-identifiable even if we ignore the non-identifiability due to label switching (Teicher et al. 1961; Yakowitz and Spragins,1968; Goodman 1974; Gyllenberg et al. 1994b). It is, however, possible for CDMs to be identifiable if certain restrictions on the Q-matrix and/or {θ j,α } are satisfied. For instance, as discussed in Chiu et al. (2009) and Liu et al. (2013), the completeness of Q-matrix, a condition that requires the Q-matrix to contain an identity matrix after row permutation, is believed to be a necessary condition for the identifiability of DINA models. Different identifiability conditions have been proposed for different CDMs in the literature. For example, Liu et al. (2013) studied the DINA model with complete knowledge of guessing parameters, and Chen et al. (2015) studied the DINA/DINO models. For general CDMs, Xu (2017) and Xu and Shang (2017) proposed a set of identifiability conditions in terms of the Q-matrix and the monotonicity constraints. But their conditions are too strong in practice, especially when K is large, which will be discussed in the next paragraph. Another set of identifiability conditions for general CDMs was proposed by Fang et al. (2019), which are still stronger than ours and also difficult to be incorporated in algorithms since they are expressed in terms of {θ j,α }. The monotonicity constraints imposed by Xu (2017) and Xu and Shang (2017) are as follows: min θ j,α ≥ θ j,0 , α∈S0 max θ j,α < min θ j,α = max θ j,α , α∈S0 α∈S1 (4) α∈S1 where S0 = {α|α q j , α 0} is the set of classes with at least one skill but not mastering all the relevant skills and S1 = {α|α q j } is the set of classes mastering all the relevant skills, where the YINYIN CHEN ET AL. notation “α α̃" means αk ≥ α̃k for all k = 1, . . . , K , and “α α̃" means αk ≥ α̃k for all k and there exists at least one k0 , such that αk0 > α̃k0 . Constraints (4) imply that students without any skill (i.e., α = 0) should have the lowest probability to give a positive answer to item j, students with all the relevant skills (α ∈ S1 ) should have the highest probability, and additional skills beyond the relevant skills are not expected to increase the probability. Under such constraints, the model identifiability is guaranteed (Xu 2017; Xu and Shang 2017) when the Q-matrix (after row swapping) takes the form of ⎛ ⎞ IK Q = ⎝ IK ⎠, Q (5) where I K is an identity matrix of size K and there are additional constraints on Q . However, condition (5) implies that the first 2K items degenerate into items in a DINA/DINO model with only one relevant skill. When K is relatively large, e.g., close to J/2, such a Q-matrix essentially forces the model to be a DINA/DINO model, which is contrary to the original intention of general CDMs. 3.3. Generic Identifiability Generic identifiability, which is less stringent than Definition 1, is introduced in Allman et al. (2009). Generic identifiability allows some parameter values to be non-identifiable as long as these exceptional values are of measure zero with respect to the parameter space. As pointed out in Allman et al. (2009), generic identifiability of a model is generally sufficient in practice, since one is unlikely to face the non-identifiability problem when almost all parameters (except a measure zero set) are identifiable. Allman et al. (2009) has proved that a general mixture of Bernoulli products is generically identifiable, provided that the number of items is larger than twice the number of classes. However, there is a gap between the result established in Allman et al. (2009) and SLCM K studied in this paper. The parameter space of the former is fixed, corresponding to (B) = R J ×2 in the context of SLCM, whereas the parameter space (B) of an SLCM varies with and is usually of dimension less than J × 2 K . In other words, unless = 1 J ×2 K (i.e., each coefficient in B is active), for any other , the whole parameter space of B is a measure zero subspace of K R J ×2 . So the result from Allman et al. (2009) is not applicable to SLCM, as it is meaningless to discuss measure zero subspace without a fixed parameter space. To discuss generic identifiability for SLCM, we need to first define the parameter space by taking into consideration of their sparsity structures. For a given sparsity structure , we denote its corresponding parameter space by (π , B) = {(π, B) : π ∈ (π ), B ∈ (B)}, where (B) consists of the coefficient matrices that can only have nonzero entries at positions where the corresponding elements in is 1. So it suffices to think (B) = R|| , where || denotes the total sum of the entries in . Let C denote the set of non-identifiable parameters from (π, B): C = {(π , B) : P(π , B) = P(π̃ , B̃), (π, B) ∼ (π̄, B̄), (π, B) ∈ (π , B), (π̃, B̃) ∈ ˜ (π , B)}. PSYCHOMETRIKA Note that the non-identifiability of a parameter set (π , B) ∈ C could be due to another parameter ˜ set (π̃, B̃) with a different sparsity structure . If the non-identifiable set C is of measure zero with respect to (π, B), we say (π, B) is a generically identifiable parameter space. Definition 2. (Generically identifiable). The parameter space (π , B) is generically identifiable, if the Lebesgue measure of C with respect to (π, B) is zero. To distinguish the two definitions of identifiability, in the following text, we refer to the identifiability defined in Definition 1 as strict identifiability. 3.4. Identifiability Conditions In this section, we discuss two sets of conditions for generic identifiability and strict identifiability. We start with the following two conditions needed for generic identifiability. ⎞ ⎛ D1 (G1) The true sparsity matrix takes the form of = ⎝ D2 ⎠ after row swapping, where K is a (J − 2K ) × 2 binary matrix and D1 , D2 ∈ Dg with Dg = ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ D ∈ {0, 1} K ×2 K ⎡ ∗ 1 ∗ ... ∗ ... ⎢∗ ∗ 1 . . . ∗ . . . ⎢ : D = ⎢. . . .. ⎣ .. .. ∗ ∗ ∗ ... 1 ... ⎤⎫ ∗ ⎪ ⎪ ⎪ ⎬ ∗⎥ ⎥ .. ⎥ , . ⎦⎪ ⎪ ⎪ ⎭ ∗ where ∗ can be either 0 or 1. (G2) For any k = 1, 2, . . . , K , there exists a jk > 2K , such that δ jk ,k = 1 . Theorem 1 The parameter space (π , B) is generically identifiable, if conditions (G1) and (G2) are satisfied. Remark 1 Theorem 1 does not require the monotonicity constraints (4), but it remains valid if, in addition, the monotonicity constraints are imposed on B. Remark 2 Theorem 1 applies to any CDM that has a real analytic link function, including but not limited to the probit link, the logit link, the log link, and the identity link. Remark 3 If the ∗ entries in condition (G1) are all 1’s, i.e., = 1 J ×2 K , the corresponding model is a general mixture of Bernoulli products. Theorem 1 implies that a mixture of Bernoulli products is generically identifiable up to label switching, provided that J ≥ 2K + 1, which is consistent with Corollary 5 in Allman et al. (2009). If the ∗ entries in condition (G1) are all 0’s except the intercept, the corresponding Q-matrix is similar to (5), the Q-matrix from Xu (2017) and Xu and Shang (2017) for strict identifiability. In fact, the technique we use to prove generic identifiability (Theorem 1) can be easily extended to show strict identifiability (Theorem 2) under the monotonicity constraints (4) and the following two conditions. YINYIN CHEN ET AL. ⎛ ⎞ D1 (S1) The true sparsity matrix takes the form of = ⎝ D2 ⎠ after row swapping, where K is a (J − 2K ) × 2 binary matrix and D1 = D2 = Ds with ⎡ 1 0 ... 0 ... 0 1 ... 0 ... .. . . . . 1 0 0 ... 1 ... 1 ⎢1 ⎢ Ds = ⎢ . ⎣ .. ⎤ 0 0⎥ ⎥ .. ⎥ . .⎦ 0 (S2) For any two classes of subjects, there exists at least one item in such that they have different positive response probabilities. Theorem 2 (Strict identifiability) Any parameter from (π, B) is strictly identifiable, if conditions (S1) and (S2) and the monotonicity constraints (4) hold. Theorem 2 is similar to the strict identifiability results in Xu (2017) and Xu and Shang (2017), although our proof technique, which is based on tensor product, is different from theirs. The proof of Theorem 2 is given in “Appendix B.2.” 3.5. Proof of Generic Identifiability In this section, we describe the core of our proof for Theorem 1, which is based on the results in Kruskal (1976, 1977) for the uniqueness of tensor decomposition and the tripartition approach in Allman et al. (2009). First, we introduce some notation. Definition 3 The class-response matrix M(, B) is defined as a matrix of size 2 K × 2 J , where the entries are indexed by a row index α and a column index y. The αth row and yth column element of M(, B) is the probability that a subject with attribute profile α gives response y, i.e., P(Y = y|α, B) = J y θ j,αj (1 − θ j,α )1−y j . j=1 Definition 4 For a class-response matrix M, the Kruskal rank of M is the largest number I such that every I rows of M are independent. Remark 4 If M is full row rank, the Kruskal rank of M is its row rank. Next, we reformulate a result in Allman et al. (2009) as follows. Theorem 3 (Allman et al. 2009) Consider a general latent class model with r classes and J features, where J ≥ 3. Suppose all entries of π are positive. If there exists a tripartition of the set J = {1, 2, · · · , J } that divides J into three disjoint, nonempty subsets J1 , J2 , J3 such that the Kruskal ranks of the three class-response matrices M 1 , M 2 , and M 3 satisfy I1 + I2 + I3 ≥ 2r + 2, (6) where Is denote the Kruskal rank of M s for features in Js , then the parameters of the model are uniquely determined, up to label switching. PSYCHOMETRIKA We can view Theorem 3 from the perspective of tensor decomposition. The distribution of Y can be represented as a 2|J1 | × 2|J2 | × 2|J3 | -dimensional three-way tensor T according to the tripartition J1 , J2 , and J3 defined in Theorem 3. The ( y1 , y2 , y3 )th element is the probability of observing ( y1 , y2 , y3 ), namely, T ( y1 , y2 , y3 ) = P(Y J1 = y1 , YJ2 = y2 , Y J3 = y3 |π, B) πα P(Y J1 = y1 , Y J2 = y2 , Y J3 = y3 |B, α) = α = πα P(Y J1 = y1 |B, α)P(Y J2 = y2 |B, α)P(Y J3 = y3 |B, α), α where the last equation is due to the fact that given attribute profile α, components of Y are independent. Therefore, identifiability is equivalent to the uniqueness of the following tensor decomposition: T= πα M 1,α ⊗ M 2,α ⊗ M 3,α α = M̃ 1,α ⊗ M 2,α ⊗ M 3,α , α where M s,α is the αth row of M s , and M̃ 1,α = πα M 1,α . Results from Kruskal (1976, 1977) state that if the sum of the Kruskal ranks of M̃ 1 , M 2 , M 3 is larger than or equal to 2r + 2, then the tensor decomposition is unique up to simultaneous permutation and rescaling of the rows. Since M 1 , M 2 , M 3 are all class-response matrices, of which every row sums to 1, the uniqueness of tensor decomposition implies model identifiability. Now, we are ready to give our proof. We divide into D1 , D2 , , which correspond to J1 , J2 , J3 , respectively. For this tripartition, both J1 and J2 contain K items and their sparsity matrices are from Dg , respectively; the remaining J − 2K items are included in J3 corresponding to satisfying condition (G2). Accordingly, we decompose the parameter space into three parts, (B) = D1 ⊗ D2 ⊗ . To check inequality (6), we show that under conditions (G1) and (G2), we have I1 = 2 K , I2 = 2 K and I3 ≥ 2 hold almost everywhere in D1 , D2 , , respectively. Consequently, identifiability holds almost everywhere in (B). In Theorem 4, we show that for any D ∈ Dg , the class-response matrix M( D, B) with size 2 K × 2 K , is of rank 2 K (full rank) almost everywhere in D . Therefore, with condition (G1), we have I1 + I2 = 2·2 K hold (almost everywhere in D1 ⊗ D2 ). To prove Theorem 1, it then suffices to show that with condition (G2), we have I3 ≥ 2 (almost everywhere in ). In fact, under condition (G2), the following statement holds almost everywhere in : for any two different classes, α 1 , α 2 , there must exist one j0 > 2K , such that θ j0 ,α 1 = θ j0 ,α 2 . The exceptional case is that when β jk ,k = 0 holds for some k, which is of Lebesgue measure zero with respect to . J Note that the αth row of M 3 is given by M 3,α = j=2K +1 (θ j,α , 1 − θ j,α ). So θ j0 ,α 1 = θ j0 ,α 2 implies M 3,α 1 = M 3,α 2 . Therefore, rows of M 3 are unique, which implies that the Kruskal rank of M 3 is at least 2, i.e., I3 ≥ 2, (almost everywhere in ). Theorem 4 Given D ∈ Dg , the corresponding class-response matrix, M( D, B), is of full rank except some values of B from a measure zero set with respect to D , i.e., λ D {B ∈ D : det[M( D, B)] = 0} = 0, where λ D {A} denotes the Lebesgue measure of set A with respect to D . (7) YINYIN CHEN ET AL. The proof of Theorem 4 for general D ∈ Dg is given in “Appendix B.1.” Next, we look at two special cases. Example 1 Da = 1 K ×2 K . Proof Da implies a general mixture of K -dimensional Bernoulli products with 2 K classes. There K exists a one-to-one mapping between the probability matrix θ = {θ j,α } ∈ [0, 1] K ×2 and the K coefficient matrix B ∈ Da (B) = R K ×2 . Equation (7) holds if the solution set det[M( D, B)] = K det[M(θ )] = 0 in terms of θ is of measure zero with respect to [0, 1] K ×2 , i.e., λ[0,1] K ×2 K {θ ∈ [0, 1] K ×2 : det[M(θ)] = 0} = 0. K Note that det[M(θ )] is a polynomial function of θ with finite degrees and is not constantly zero K for any θ ∈ [0, 1] K ×2 , the solution set forms a proper subvariety which must be of dimension K less than K × 2 (Cox et al. 1994; Allman et al. 2009), hence, of Lebesgue measure zero. Example 2. Ds = (1 K , I K , 0). Proof. Ds implies a DINA model with K items, K skills and Q = I K . For any item, β j can be reparameterized by the guessing parameter g j = (β j,0 ) and the slipping parameter s j = 1 − (β j,0 + β j, j ). We can rewrite the class-response matrix as the Kronecker product of K of 2 × 2 sub-matrices, K M( Ds , B) = j=1 gj 1 − gj := 1 − sj sj K Mj j=1 By the property of Kronecker products, we have rank(M( Ds , B)) = Kj=1 rank(M j ). Therefore, M( Ds , B) is of full rank unless there exists at least one j0 , such that g j0 = 1 − s j0 , or equivalently β j0 , j0 = 0. Therefore, the dimension of the exceptional set is less than the dimension of Ds , hence of Lebesgue measure zero. Remark 5. In the above two special cases, after re-parameterizing B, we simply compare the dimension of the solution set and that of the parameter space. This technique, however, cannot be applied to general settings where the equation det[M( D, B)] = 0 is not easy to solve. 4. MCMC for Model Estimation 4.1. Bayesian Formulation Consider a SLCM with N subjects, J items and K skills. Let α i denote the attribute profile of subject i, and Yi j denote the response of subject i to item j. Symbols like β j and δ j are defined in Sect. 2. Throughout, we use subscript i = 1, . . . , N to index subjects, j = 1, . . . , J to index items, and p = 1, . . . , 2 K to index elements of the coefficients with p = 1 corresponding to the intercept. We formulate the Bayesian model as follows. PSYCHOMETRIKA • The probit model on Yi j : Yi j | α i ∼ Bernoulli β Tj aαi , where (·) is the probit link function. • The spike-and-slab prior on B: ! β jp | δ jp ∼ N (0, σβ2 ) δ jp = 1 , I(β j p = 0) δ j p = 0 δ j p |ω ∼ Bernoulli (ω) , ω ∼ Beta(w0 , w1 ), where I is an indicator function. The intercept is always set active with δ j1 = 1. In K addition, the prior distribution on B is restricted on B(B), a subset of R J ×2 where the monotonicity constraints (4) and strict/generic identifiability conditions are satisfied. • The prior on latent attributes α i : α i | π ∼ Multinomial(π ), π ∼ Dirichlet(d 0 ). Here, (σβ2 , w0 , w1 , d 0 ) are user-specified hyper-parameters. A similar sparsity inducing prior specification was used by Culpepper (2019) for general CDMs. 4.2. The Gibbs Sampling Algorithm Following the data augmentation approach in Albert and Chib (1993), we introduce an augmented variable Z i j ∼ N (β Tj aαi , 1) and Yi j = I(Z i j > 0). In particular, let Z j denote the N augmented variables for item j, and Z j can be written as the following Gaussian regression model, Z j α 1:N , β j ∼ N N Xβ j , I N , T where X = aα 1 , . . . , aα N is an N × 2 K design matrix shared for all j = 1, . . . , J. Although vanilla MCMC algorithms are available for Bayesian variable selection with Gaussian regression models, they are not applicable here. We derive a tailored MCMC scheme for our model, which ensures that each posterior draw of B is from the valid parameter space B(B). A key step in our algorithm is to sample the indicator variable δ j p . First, we need to decide whether δ j p is allowed to be changed, i.e., whether the identifiability conditions and the monotonicity constraints still hold when δ j p is replaced by 1 − δ j p . For strict identifiability or generic identifiability, the explicit conditions of are given in Sect. 3. Checking whether a matrix satisfies these conditions is straightforward. If these conditions are satisfied either with δ j p = 0 or 1, then δ j p is allowed to be updated, otherwise leave δ j p unchanged in this iteration. Next, we derive the sampling distribution for δ j p if it is allowed to be updated. Since δ j p is deterministic if β j p is given, we need to derive the conditional distribution of δ j p given other variables except β j p , namely, δ j p Z j , α 1:N , β j ( p) , ω, σβ2 ∼ Bernoulli ω̃ j p , (8) YINYIN CHEN ET AL. where β j ( p) is the coefficient vector β j without the pth element and ω̃ j p is the Bernoulli parameter we need to determine. To compute ω̃ j p , we need to evaluate the integrated likelihood of Z j with respect to β j p , as the evidence for δ j p = 1. Express the density function of Z j as T N 1 " Z j − X pβ jp Z j − X pβ jp p Z j α 1:N , β j = (2π )− 2 exp − " 2 (9) where " Z j = Z − X ( p) β j ( p) , and X ( p) is an N × 2 K − 1 matrix that excludes the pth column. If there is no constraint on β j p , it is straightforward to compute the integrated likelihood of (9) with respect to β j p ∼ N (0, σβ2 ). However, with the monotonicity constraints on B, it is not clear what kind of values β j p is allowed to take. Recall the monotonicity constraints on B: for j = 1, . . . , J , min β Tj aα = β j1 , α max β Tj aα < min β Tj aα . α:aα δ j α:aα δ j Next, we show that if β j p is lower bounded, the constraints above are satisfied. Proposition 1. Suppose the coefficient matrix in iteration t satisfies identifiability conditions and the monotonicity constraints, B (t) ∈ B(B), and only β j p ( p > 1) is sampled in iteration t + 1. If (t+1) β j p > L, then B (t+1) ∈ B(B). The lower bound L is given by # L = max max (−γ j,α ), max γ j,α − γ α∈L1 (t)T (t) α∈L0 $ (t) j,q j , (t) = β where γ j,α aα − β j1 − β j p , L1 = {α|aα, p = 1}, L0 = {α|aα, p = 0, α 0} and aα, p j denotes the pth element of aα . The proof of Proposition 1 is given in “Appendix E.” Using Proposition 1, we can show that the Bernoulli parameter in (8) is given by ω̃ j p = −L σβ −L σβ −1 ω −1 ω σ̃ p σβ σ̃ p σβ μ̃ j p −L σ̃ p μ̃ j p −L σ̃ p exp # 2 1 μ̃ j p 2 σ̃ p2 exp # 2 $ 1 μ̃ j p 2 σ̃ p2 $ , +1−ω −1 where X p denotes the pth column of matrix X, σ̃ p2 = X p X p + σβ−2 and μ̃ j p = −1 . See “Appendix D” for the detailed derivation. X p " Z j X p X p + σβ−2 After updating δ j p , we update β j p based on the full conditional distribution below: % &δ % &1−δ j p β j p Z j , α 1:N , β j ( p) , σβ2 , δ j p ∼ N μ̃ j p , σ̃ p2 I β j p > L j p I β j p = 0 . (10) We summarize the sampling algorithm of B and in Algorithm 1, and the full sampling algorithm in Algorithm 2. PSYCHOMETRIKA Algorithm 1: Sample B and for j ← 1 to J do for p ← 1 to 2 K do K for α ∈ {0, 1}2 do γα ← β Tj aα − β j0 − β j p end −1 −1 Z j X p X p + σβ−2 σ̃ p2 ← X p X p + σβ−2 ," Z j = Z − X ( p) β j ( p) , μ̃ j p ← X p " . L ← max{maxα∈L1 −γα , maxα∈L0 γα − γq }, j if ( L ≤ 0) and (δ j p = 1 − δ j p satisfies the identifiability conditions) then ⎞1 ⎞ ⎛ ⎛ # $ μ̃2j p μ̃ j p −L σ̃ p2 2 exp⎝ 21 2 ⎠ ω⎝ 2 ⎠ σ̃ p σβ σ̃ p . ⎞ ⎛ ⎛ ⎞1 $−1 # # $ 2 μ̃ μ̃ j p −L σ̃ 2 2 −L ⎝ p⎠ ⎝ 1 j p ⎠+1−ω ω exp σβ 2 σ̃ 2 σ̃ p σβ2 p # ω̃ j p ← −L σβ $−1 Sample δ j p from Bernoulli (ω̃ j p ). end if δ j p = 1 then Sample β j p from truncated normal distribution, N (μ̃ j p , σ̃ p2 )I (β j p > L). end if δ j p = 0 then β j p = 0. end end end return B , 4.3. Bayesian Penalization In this subsection, we show that the spike-and-slab prior, a Bayesian variable selection technique, is equivalent to a mixture of L 0 and L 2 penalty terms on B. (1 − ω)2 , and the prior distribution of π 2π ω2 is Dirichlet(1), then the Bayesian MAP estimate of (π , B) is equivalent to the optimum of the objective function with a mixture of L 0 and L 2 penalty terms on B, i.e., Proposition 2. If σβ2 , ω are fixed such that σβ2 > ' ( arg max p(π , B|Y N ) = arg max − (π , B|Y N ) + λ1 B0 + λ2 B2 π ,B π,B where Y N is the observed responses matrix of N test subjects, (π, B|Y N ) is the log-likelihood function, and λ1 , λ2 > 0 are constants. Proof. The marginal posterior distribution of (π, B) is written as p(π , B|Y N ) ∝ f (Y N |π, B) p(π) p(B|σβ2 , ) p(|ω) YINYIN CHEN ET AL. Algorithm 2: Full Gibbs sampling algorithm Input: Y N ×J , initial values of π , α 1:N , B , total chain length T , burn-in period b, and hyper-parameters σβ2 , w0 , w1 , d 0 . for t ← 1 to T do for j ← 1 to J do for α ∈ {0, 1} K do θ j,α ← (aT α β j ). end end for i ← 1 to N do Sample α i from the multinomial distribution, yi j 1−yi j P(α i = α|π , yi ) ∝ πα Jj=1 θ j,α (1 − θ j,α ) end for α ∈ {0, 1} K do )N Ñα ← i=1 1(α i = α) Sample π from Dirichlet2 K ( Ñ + d 0 ) end for j ← 1 to J do Sample Z j from the truncated normal distribution, N y 1−yi j N N (Xβ j , I N ) i=1 [I (Z i j > 0)] i j [I (Z i j < 0)] . end Sample B and using )Algorithm 1, ) Sample ω from Beta j, p (1 − δ j p ) + ω0 , j, p δ j p + ω1 . if t > b then B (t−b) ← B , (t−b) ← end end return B (1) · · · , B (T −b) , (1) · · · , (T −b) . Taking natural logarithm on both sides and ignoring the constant, we have log p(B, π |Y N ) =(Y N |π, B) + log p(π) + log p(B|σβ2 , ) + log p(|ω) * 1 ω B2 − B0 log =(π , B|Y N ) − B0 log 2π σβ2 − 2 1 − ω 2σβ * ω 2π σβ2 1 − =(π , B|Y N ) − B0 log B2 1−ω 2σβ2 =(π , B|Y N ) − λ1 B0 − λ2 B2 , where λ1 = log * ω 2π σβ2 1−ω and λ2 = 1 . 2σβ2 Remark 6. Although L 0 penalty is the natural choice to account for the model complexity, it is computationally inefficient. For tractable computation, Xu and Shang (2017) proposed to use truncated lasso penalty (TLP) as an approximation, while we use the spike-and-slab prior. 5. Monte Carlo Simulation 5.1. Overview To test the performance of the proposed algorithm, we employ Monte Carlo simulation, using different attribute sizes (K = 3, 4), different sample sizes (N = 500, 1000, 2000), and different PSYCHOMETRIKA correlations among attributes (ρ = 0, 0.15, 0.25). The unknown true Q-matrices, with J = 20 items, satisfying the strict identifiability conditions, are as follows: ⎞ ⎛ ⎞ ⎛ 1000 100 ⎜0 1 0 0⎟ ⎜0 1 0⎟ ⎟ ⎜ ⎟ ⎜ ⎜0 0 1 0⎟ ⎜0 0 1⎟ ⎟ ⎜ ⎟ ⎜ ⎜0 0 0 1⎟ ⎜1 0 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 0 0 0⎟ ⎜0 1 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 1 0 0⎟ ⎜0 0 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 0 1 0⎟ ⎜1 0 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 0 0 1⎟ ⎜0 1 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 1 0 0⎟ ⎜0 0 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 1 1 0⎟ ⎜1 1 0⎟ ⎜ ⎟ ⎟ Q = Q=⎜ ⎜0 0 1 1⎟. ⎜1 0 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 0 0 1⎟ ⎜0 1 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 0 1 0⎟ ⎜1 1 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 1 0 1⎟ ⎜1 0 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 1 0 0⎟ ⎜0 1 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 0 1 1⎟ ⎜1 1 0⎟ ⎜ ⎟ ⎜ ⎟ ⎜0 1 1 1⎟ ⎜1 0 1⎟ ⎜ ⎟ ⎜ ⎟ ⎜1 0 1 1⎟ ⎜1 1 1⎟ ⎜ ⎟ ⎜ ⎟ ⎝1 1 0 1⎠ ⎝1 1 1⎠ 1110 111 For the ρ = 0 cases, the attribute profile α is generated uniformly from the 2 K classes. In other words, different attributes are independent. For the ρ > 0 cases, dependence among attributes is introduced using the method of Chiu et al. (2009). Specifically, ξ = (ξ1 , . . . , ξ K ) is generated from the multivariate normal distribution N (0, ) with variance 1 and correlation ρ such that = (1 − ρ)I K + ρ1 K ×K . The attribute profile α = (α1 , . . . , α K ) is given by αk = I(ξk ≥ 0), k = 1, . . . , K . The positive response probability of each item is set between 0.2 and 0.8 using the method of Xu and Shang (2017). In particular, we set the probability of attribute profiles with K j out of the K j relevant attributes to be 0.2 + (0.8 − 0.2) × K j /K j . Our model does not explicitly include a Q-matrix, but in order to compare with other methods, we recover the Q-matrix by aggregating B̂, the posterior mean of B, in the following way: 1. sum up the values of the relevant coefficients (t) (t) q̃ jk = β̂ j,k + (t) β̂ j,k1 k + k1 (t) β̂ j,k1 k2 k + . . . , t = 1, . . . , T k1<k2 2. standardize )T q̄ jk = (t) t=1 q̃ jk )T (t) maxk∈{1,...,K } t=1 q̃ jk 3. identify q̂ jk as 1 if q̄ jk exceeds a fixed threshold. Here, we set the cutoff to 0.5 in all configurations q̂ jk = 1{q̄ jk > cutoff} . YINYIN CHEN ET AL. Table 3. Recovery of Q for K = 3 ρ 0 0.15 0.25 N 500 1000 2000 500 1000 2000 500 1000 2000 Strictly identifiable space Generically identifiable space Matrix Item TPR FPR Matrix Item TPR FPR 0.582 0.936 1.000 0.530 0.934 0.998 0.540 0.960 0.994 0.974 0.997 1.000 0.970 0.997 1.000 0.971 0.998 1.000 0.988 0.999 1.000 0.988 0.999 1.000 0.989 0.999 1.000 0.005 0.001 0.000 0.008 0.001 0.000 0.009 0.001 0.000 0.586 0.946 0.998 0.564 0.928 0.996 0.508 0.898 0.990 0.975 0.997 0.999 0.971 0.996 1.000 0.966 0.995 1.000 0.989 0.999 1.000 0.990 0.999 1.000 0.990 0.999 1.000 0.006 0.001 0.000 0.010 0.001 0.000 0.014 0.003 0.000 Table 4. Recovery of B for K = 3 and K = 4 ρ N Strictly identifiable space K =3 0 0.15 0.25 1000 2000 1000 2000 1000 2000 Generically identifiable space K =4 K =3 K =4 RMSE aBias RMSE aBias RMSE aBias RMSE aBias 0.112 0.074 0.108 0.075 0.106 0.077 0.065 0.044 0.064 0.045 0.064 0.047 0.114 0.091 0.108 0.091 0.107 0.095 0.057 0.047 0.057 0.049 0.058 0.052 0.115 0.075 0.109 0.077 0.110 0.080 0.071 0.046 0.069 0.049 0.071 0.052 0.128 0.088 0.136 0.112 0.145 0.129 0.086 0.059 0.096 0.074 0.102 0.080 5.2. Results We generated 500 independent replications for each configuration. Table 3 summarizes the results of the recovery of the Q-matrix for K = 3. The metrics we use here are the same as in Xu and Shang (2017). The column “Matrix” records the matrix-level Q-matrix recovery rates of the 500 replications. The column “Item” gives the item-level recovery rates. “TPR” and “FPR” are two entry-level rates. The column “TPR” is the true positive rate, i.e., the proportion of 1’s in the true Q-matrix correctly estimated; “FPR” is the false positive rate, i.e., the proportion of 0’s in the true Q-matrix incorrectly estimated as 1’s. Table 4 summarizes the results of the recovery of B for K = 3 and K = 4. RMSE is the averaged root-mean-square error, and aBias is the averaged absolute values of the estimated biases. For each configuration, we restricted the parameter search to the strictly identifiable space (under strict identifiability conditions) and the generically identifiable space (under generic identifiability conditions), respectively; we report the results in the columns “Strictly Identifiable Space” and “Generically Identifiable Space.” The recovery rates of the former are generally higher. Note that the true parameters live in both spaces, and the strictly identifiable space is much smaller than the generic one. The posterior draws might be likely to be closer to the true parameters if we were to restrict the parameter search to a smaller space. However, we observe that such differences of accuracy become smaller when the sample size becomes larger. PSYCHOMETRIKA Table 5. Computation time (minutes) per replication K =3 K =4 N = 500 N = 1000 N = 2000 2.94 3.82 4.91 6.23 8.48 11.38 It is worth mentioning that our MCMC algorithm is quite efficient. It can provide estimates in several minutes. Table 5 gives average computation times for a Markov chain with 30,000 burn-in samples and 10,000 post-burn-in samples in a MacBook Pro (Retina, Early 2015) with 2.9 GHz Intel Core i5 processor. 6. Real Data Analysis In this section, we apply our algorithm to two real applications. The first dataset is Tatsuoka’s Fraction Subtraction dataset (Tatsuoka 1984, 2002). The second dataset is from an experimental IQ test offered on the Open Psychometrics website.1 6.1. Fraction Subtraction Data This dataset contains responses to a set of fraction subtraction items collected from N = 536 middle school students. It has been widely analyzed in the literature, e.g., Xu and Shang (2017), Chen et al. (2018), Chen et al. (2015), de la Torre and Douglas (2004). Table 6 presents the estimated coefficient matrix by our Gibbs sampling algorithm with K set at 3. The estimated B is one of the posterior draws in the Markov chain such that its corresponding matrix is closest to the average of among all the posterior draws in the chain. It is not hard to see that the estimated B follows the generic identifiability conditions and the monotonicity constraints. The three attributes can be roughly interpreted as (i) applying subtraction to the integer and the fraction separately, (ii) determining common denominator, (iii) converting to improper fraction. The detected skills are consistent with the findings of Chen et al. (2015, 2018). The estimated coefficients illustrate one benefit of focusing on versus Q. That is, it provides insight into how underlying attributes function together. For example, Item 4, 3 21 − 2 23 , requires improper fraction conversion, followed by common denominator determination and subtraction. In our results, Item 4 has three positive coefficients, β3 , β13 and β23 . It suggests that mastering improper fraction conversion, skill (iii), helps to solve the item. Conditioning on the mastery of skill (iii), knowing skill (i) and (ii) increases the success rate. But for those who do not master skill (iii), mastery of the other two skills does not compensate. In most previous analyses (e.g., Xu and Shang 2017; Chen et al. 2015, 2018), the estimated Q-matrices suggest only the crucial requirement of skill (iii); they do not indicate that skills (i) and (ii) are also relevant. Another observation is that the signs of interaction terms imply the logic gates of the involved skills. For example, the estimated β13 ’s of Items 5, 6, and 7 are positive, while those of Items 9 and 10 are negative. In fact, the former three items all have two distinct solution paths: (a) convert the first number to an improper fraction (skill (iii)), and then apply subtraction; or (b) apply subtraction to the integer and the fraction separately (skill (i)). The negative β13 ’s of those items indicate that 1 https://openpsychometrics.org/_rawdata/. YINYIN CHEN ET AL. Table 6. Estimated B for fraction subtraction data Item 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Content 5 − 3 3 4 3 3 4 − 8 5 − 1 6 9 3 21 − 2 23 1 18 − 18 3 45 − 3 25 4 57 − 1 47 4 4 35 − 3 10 3 − 2 15 2 − 13 4 −2 7 4 12 12 4 13 − 2 43 7 35 − 45 1 −2 8 4 10 10 4 − 1 43 4 13 − 2 53 3 38 − 2 56 β0 β1 β2 β3 β12 − 1.74 2.80 − 1.54 2.18 1.46 − 2.65 3.72 0.03 − 0.81 β13 0.29 1.12 0.42 1.56 0.55 2.24 − 0.97 − 1.21 2.06 0.72 2.27 − 1.55 − 1.44 1.99 0.65 1.89 − 1.01 − 0.68 0.09 0.42 − 2.48 0.70 1.78 0.83 1.81 − 2.24 1.02 1.66 1.30 1.49 0.85 1.94 0.64 − 1.55 2.19 − 2.12 3.06 − 1.82 0.65 − 3.11 0.65 1.48 − 2.39 − 2.34 0.66 0.89 0.15 0.71 0.66 0.80 2.11 1.97 − 0.59 1.24 2.09 2.90 0.18 0.40 β123 1.23 − 0.99 − 2.14 β23 0.43 0.82 1.15 2.08 − 0.09 students who have mastered either skill (i) or (iii) will have high success probabilities; mastery of both is not expected to increase success rates, implying an “or” gate. In contrast, Items 9 and 10 can be solved only by a two-step solution path, i.e., improper fraction conversion (skill(iii)) followed by separate subtractions (skill(i)). The positive β13 ’s of these items indicate that mastering the two skills together would largely increase the success probability, which implies that these two skills function through an “and” gate. 6.2. Experimental Matrix Reasoning Data This dataset contains responses of N = 400 subjects to a set of IQ test questions. In each question, a matrix with one tile missing is given, as well as eight options for that missing tile. Participants are required to choose the most appropriate tile among the eight options. An example question is given in Fig. 1. We study the J = 20 questions for which the corresponding correct rates are larger than 25%. We list the question matrices in Fig. 2. For more details (the options and the correct answers), please refer to the documentation on Open Psychometrics. We observe that the patterns of Q1, Q2, Q6, Q8, Q10, Q11, Q14-Q20, and Q23-Q25 are row-wise; as a result, for each question, for the participants to figure out the correct answer to each such question, they must learn the pattern from the first two rows and apply it to the last row. For example, in Q2, the three tiles in each complete row are the same, so the last tile in the incomplete row must be the same as the ones to its left. In Q6, the tiles from left to right on each row are sequentially rotated 90 degrees in the clockwise direction, so we recover the missing tile by 90 degrees clockwise rotation of the tile to its left. On the other hand, for Q1, Q3, Q4, Q5, Q12, PSYCHOMETRIKA Figure 1. An example question (Q1): the left is the matrix missing one tile, and the right is the eight possible tiles for participants to choose, and the red marked one is the correct answer and Q13, the participants are required to infer the whole picture of the matrix from the provided tiles and find the one tile that can complete the best overall pattern. For example, in Q3, if the participants can recognize that the whole picture contains one small diamond hollow square and one large spade hollow square, then they are able to choose the proper tile, a tile with a diamond on top left corner and spades on the bottom and the right, to perfectly complete a symmetrical pattern. Similarly, the matrix of Q4 is shaped like a number “2”, so we can infer that the last tile should include a horizontal line in the middle. At the same time, for Q6, Q8, Q10, Q11, Q14, Q15, Q19, Q20, Q23, and Q24, the missing tiles are similar to some tiles on the same matrices but with little change, including suites changes (Q19, Q20, Q23), rotation (Q6, Q14), and stretching (Q8). However, for Q1, Q2, Q4, Q5, Q9, Q16, and Q25, the missing tiles are exact the same as one of their neighboring tiles. From the above analysis, we summarize the four skills to solve these questions as (i) learning the row-wise pattern, (ii) learning the pattern from the whole picture, (iii) changing from neighbors, and (iv) copying from neighbors. Table 7 presents the estimated coefficients provided by our algorithm with K set at 4; the four estimated attributes are roughly consistent with the four skills analyzed above. 7. Discussion This work focuses primarily on the study of model identifiability and parameter estimation of SLCM, in which most of the challenges are caused by the restrictions introduced from the context of CDMs. We prove the generic identifiability conditions for SLCM, which relaxes the constraints in the strict identifiability conditions in Xu (2017) and Xu and Shang (2017). We develop a Gibbs sampling algorithm for SLCM, which enforces identifiability conditions and the monotonicity constraints for valid posterior inference. The simulation results demonstrate that our algorithm YINYIN CHEN ET AL. Figure 2. Twenty questions of IQ test PSYCHOMETRIKA Table 7. Estimated B for IQ test Data: the boldfaced coefficients are the ones consistent with the analysis β0 Q1 Q2 Q3 Q4 Q5 Q6 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q19 Q20 Q23 Q24 Q25 0.0 0.3 0.0 − 0.6 0.1 − 0.2 − 0.6 0.0 − 0.7 − 0.2 − 1.0 − 0.9 − 0.8 − 0.8 − 0.2 − 0.8 − 1.0 − 1.1 − 0.9 − 0.3 β1 β2 β3 β4 1.3 1.0 0.7 0.7 1.6 1.1 1.0 0.8 0.5 0.9 0.3 0.5 1.3 0.4 0.6 1.6 0.9 0.7 1.6 0.9 1.5 0.9 0.4 0.8 2.0 1.6 0.7 1.2 1.0 0.2 1.0 β12 β13 β14 β23 β24 β34 β123 β124 β134 β234 β1234 0.3 0.4 0.5 0.9 0.6 0.5 1.1 0.2 1.2 0.9 0.2 0.7 0.4 0.2 0.5 0.4 0.1 efficiently estimates the model in different configurations, with accuracy comparable to that of alternative models. Our proof for generic identifiability is based upon the technique of generic identifiability in Allman et al. (2009) and the sufficient conditions of Kruskal (1976, 1977) for the uniqueness of three-way tensor decomposition. We note that the conditions we provide are sufficient but may not be necessary. We note further that Xu (2017) has developed a different technique, working directly on the class-response matrix. An interesting direction for future research is to draw connections between the two sets of techniques, perhaps shedding light on the identification of sufficient and necessary conditions for model identifiability for SLCM. In this study, the number of attributes, K , is assumed to be known and fixed, a limitation in practice, because it is usually difficult to specify K beforehand, especially when no prior information is available. It is always of interest to study the model with unknown K , and to develop algorithms to select K in model fitting. One promising method is nonparametric Bayesian, which is able to infer from the data an adequate model size. Nevertheless, the model identifiability may continue to present challenges. Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. A Connections of SLCM to Popular CDMs In this section, we discuss the connections between SLCM and popular CDMs. To simplify the expression, we assume the relevant skills of item j are k1 , . . . , k R , i.e., q jk1 = · · · = q jk R = 1, q jk = 0, otherwise. YINYIN CHEN ET AL. Example 3. (DINA model). The deterministic input noisy output “and” gate model (Haertel 1989; Junker and Sijtsma 2001) is a conjunctive model. It assumes that a student is most capable of answering question j positively only if he/she masters all of its relevant skills. The item response function takes the following form, P(Y j = 1|α, q j ) = (1 − s j )1(α q j) g j 1(αq j ) , where s j = P(Y j = 1|α q j ) is the slipping parameter, which is the probability that a student capable for item j but response negatively and g j = P(Y j = 1|α q j ) is the guessing parameter, which is the probability that a non-master answers positively. It is assumed that g j < 1 − s j in most applications. The DINA model can be written as P(Y j = 1|α, β j ) = β j,0 + β j,k1 ...k R αk1 . . . αk R where only one coefficient, besides the intercept, in β j is active, δ j,0 = δ j,k1 ...k R = 1, δ j, p = 0 otherwise. The guessing parameter g j and slipping parameter s j is given by, g j = (β j,0 ), s j = 1 − (β j,0 + β j,k1 ...k R ). Example 4. (DINO model). The deterministic input noisy output “or” gate model (Templin and Henson 2006) is a disjunctive model, which assumes that a student is capable to answer question j positively if at least one of the relevant skills is mastered. The item response function is P(Y j = 1|α, q j ) = (1 − s j )1(α Tq j >0) g j 1(α Tq j =0) where s j and g j are defined the same as in DINA, and g j < 1 − s j is assumed. The DINO model can be reparameterized as ⎛ P(Y j = 1|α, β j ) = ⎝β j,0 + R r =1 β j,kr αkr + kr >kr β j,kr kr αkr αkr + · · · + β j,k1 ...k R R ⎞ αkr ⎠ r =1 where the coefficients containing only the relevant skills are active, δ j,0 = δ j,k1 = · · · = δ j,k R = δ j,k1 k2 = · · · = δ j,k R−1 k R = · · · = δ j,k1 ...k R = 1, δ j, p = 0 otherwise The coefficients with odd orders are all equal and positive. The coefficients with even orders are the additive inverse of those with odd orders. β j,k1 = β j,k2 = · · · = β j,k R = β j,k1 k2 k3 = · · · = β j,k R−2 k R−1 k R = · · · = − β j,k1 k2 = · · · = −β j,k R−1 k R = −β j,k1 k2 k3 k4 = · · · = −β j,k R−3 k R−2 k R−1 k R = · · · . PSYCHOMETRIKA The guessing parameter g j is in the same form of the one in DINA model and slipping parameter s j is given by 1 − (aTα β j ), with α satisfying α T q j > 0, which is equivalent to 1 − (β j,0 + β j,kr ), r = 1, . . . , R, g j = (β j,0 ), s j = 1 − (β j,0 + β j,kr ), r = 1, . . . , R. Example 5. (G-DINA model). The DINA model is generalized to the G-DINA model by de la Torre (2011), which takes the form of P(Y j = 1|α, q j ) = β j,0 + K β j,k q jk αk + k>k k=1 K β j,kk q jk αk q jk αk + · · · + β j,12...K q jk αk . k=1 By using the identity link in Eq. (1), it can be written as P(Y j = 1|α, β j ) = β j,0 + R β j,kr αkr + R β j,kr kr αkr αkr + · · · + β j,k1 ...k R αkr kr >kr r =1 r =1 where the coefficients containing only the relevant skills are active, δ j,0 = δ j,k1 = · · · = δ j,k R = δ j,k1 k2 = · · · = δ j,k R−1 k R = · · · = δ j,k1 ...k R = 1, δ j, p = 0 otherwise. Example 6. (NC-RUM model). Under the reduced noncompensatory reparameterized unified model (DiBello et al. 1995; Rupp et al., 2010), attributes have a noncompensatory relationship with observed response. It assumes missing any relevant skill would inflict a penalty on the positive response probability. P(Y j = 1|α, q j ) = b j K q (1−αk ) r j,kjk k=1 b j is the positive response probability for students who possess all relevant skills and r j,k , 0 < r j,k < 1, is the penalty for not mastering kth attribute. As pointed by Xu (2017), by using the exponential link function, NC-RUM can be equivalently written as P(Y j = 1|α, β j ) = exp β j,0 + R β j,kr αkr , r =1 where the main effects of relevant attributes are active, δ j,0 = δ j,k1 = · · · δ j,k R = 1, δ j, p = 0 otherwise. The parameters are given by b j = exp β j,0 + R r =1 ! β j,kr , r j,k = exp(−β j,kr ), if k ∈ {k1 , . . . , k R } 1, otherwise YINYIN CHEN ET AL. Example 7. (C-RUM model). Compensatory-RUM (Hagenaars 1993; Maris 1999) is given by, )K exp β j,0 + k=1 β j,k q jk αk . P(Y j = 1|α, q j ) = )K exp β j0 + k=1 β j,k q jk αk + 1 Equivalently, P(Y j = 1|α, q j ) = logit −1 β j,0 + R β j,kr αkr r =1 where (·) is the inverse of the logit function and the main effects of relevant attributes are active, δ j,0 = δ j,k1 = · · · δ j,k R = 1, δ j, p = 0 otherwise. B Proof of Theorems In this section, we provide the proof of Theorems 4 and 2. B.1 Proof of Theorem 4 We first introduce Lemma 5 (Mityagin 2015; Dang 2015) which shows that the solution set of a real analytic function is of Lebesgue measure zero if the function is not constantly 0. Then in Proposition 3, we show that G D (B) := det[M( D, B)] is a real analytic function, and in Proposition 4, we show that G D (B) is not constantly zero for any B ∈ D (B) if D ∈ Dg , so that Lemma 5 applies and Theorem 4 is proved. Lemma 5. (Mityagin 2015; Dang 2015) If f : Rn → R is a real analytic function which is not identically zero, then the set {x : f (x) = 0} has Lebesgue measure zero. Proposition 3. G D (B) = det[M( D, B)] : D → R is a real analytic function of B. Proof. G D (B) is a composition function: G D (B) = det[M] = h(θ α 0 , . . . , θ α 2 K −1 ) = h (B D aα 0 ), . . . , (B D aα 2 K −1 ) where h(θ ) : [0, 1] K ×2 → R is a polynomial function and (·) is a CDF. (·) is a real analytic function because a CDF is an integral of a real analytic function, and h(θ) is also a real analytic function since it is a polynomial. Therefore, the composition function G D (B) is a real analytic function due to the fact that the composition of real analytic functions is a real analytic function. K Proposition 4. If D ∈ Dg , there exists some B ∈ D (B), s.t., G D (B) = 0. Proof. Let B 1 = (1 K , I K , 0) ∈ D (B), ∀ D ∈ Dg . As shown in Example 2, M( D, B 1 ) is of full rank, so that G D (B 1 ) = 0. Remark 7. G D (B) ≡ 0 is not a trivial conclusion holds for all kinds of D. D ∈ D is a sufficient condition for G D (B) ≡ 0. If D ∈ D, it is possible G D (B) ≡ 0. See the following example. PSYCHOMETRIKA Example 8. Assume K = 3 and the main effect of the first skill is inactive for all items, i.e., δ j,1 = 0, ∀1 ≤ j ≤ K , then D3×8 takes the form ⎡ ∗ ⎣∗ ∗ ∗ 1 ∗ 0 0 0 ∗ ∗ 1 ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ ⎤ ∗ ∗⎦ . ∗ For any B ∈ D (B) and any response y ∈ {0, 1}3 , M α=(0,0,0), y ( D, B) = M α=(1,0,0), y ( D, B). So the two rows of M( D, B) are identical, and M( D, B) is not full row rank, i.e., det[M( D, B)] ≡ 0. By Lemma 5 and Propositions 3 and 4, Theorem 4 is proved. B.2 Proof of Theorem 2 Proof. As shown in Example 2, for any B ∈ Ds (B), the corresponding class-response matrix is of full rank, rank(M( Ds , B)) = 2 K , holds if and only if, for each item, the success probabilities for students with the relevant skill and those without the relevant skill are different. In fact, if the two probabilities are the same, the monotonicity constraints would be violated. Then, using notation from Sect. 3.5, we conclude that under condition (S1) and the monotonicity constraints, rank(M 1 ) = rank(M 2 ) = 2 K . For M 3 , as each element is nonnegative and each row sums to 1. Under condition (S2), there must exist one item j, such that θ j,α s = θ j,α t , so rank(M 3 ) ≥ 2. C Initialization from the Identifiable Space (0) Initialization of the sparsity matrix J ×2 K : 1. activate the intercepts. (0) Fix the entries in the first column of (0) (i.e., ·1 ) as 1. Denote the remaining J × ˜ (0) . (2 K − 1) sub-matrix as (0) (0) 2. construct D1 and D2 . ˜ (0) to be Fix the first 2K rows of # $ IK 0 . IK 0 ˜ (0) . 3. construct (a) Randomly select K indexes, j1 , . . . , j K , from the set {2K + 1, . . . , J } with ˜ (0) replacement and set jk ,k = 1. YINYIN CHEN ET AL. ˜ (0) by (b) Sample the remaining entries in (0) δ j p |w (0) ∼ Bernoulli(w (0) ), K j > 2K , ( j, p) ∈ / {( jk , k)}k=1 where w (0) ∼ Beta(w0 , w1 ) and w0 , w1 are the parameters of the prior distribution and are treated as fixed. (c) Check the row sum. If any row of (0) sums to 0, then we randomly pick up an entry on this row and set it at 1. 4. shuffle the rows. Draw a J × J permutation matrix P = (e j1 , . . . , e j J ) where ( j1 , . . . , j J ) is a permu˜ (0) ← P ˜ (0) . tation of (1, . . . , J ), and let The above initialization is designed for strict identifiability conditions. To generate a under the generic identifiability conditions, we just need to enlarge the range of entries sampling from the prior distribution in step 3b. Specifically, we change step 3b to ˜ (0) by Sample the remaining entries in (0) δ j p |w (0) ∼ Bernoulli(w (0) ), K ( j, p) ∈ / {( jk , k), (k, k), (K + k, k)}k=1 . (0) Initialization of the coefficients matrix β J ×2 K = (β 1 , . . . , β J )T : (0) (0) β j p = 0, if δ j p = 0, (0) (0) (0) β j p |δ j p = 1 ∝ N (0, σβ2 )I (β j p > 0). D Derivation of ω̃ j p δ j p Z j , α, β j ( p) , ω, σβ2 ∼ Bernoulli ω̃ j p ω̃ j p = ω -∞ L -∞ ω L p Z j α, β j p β j p σβ2 dβ j p p Z j α, β j p β j p σβ2 dβ j p + (1 − ω) p Z j α, β j ( p) , β j p = 0 PSYCHOMETRIKA The numerator is . ω p Z j α, β j p β j p σβ2 dβ j p L . ∞ 1 " − N2 " =ω exp − Z j − A p β j p Z j − A p β j p (2π ) 2 L $−1 # 2 β 1 1 −L j p exp − 2 dβ j p · (2π )− 2 σβ σβ 2σβ # $−1 N −L ω = (2π )− 2 σβ σβ ! 01 / . ∞ 1 1 " − 21 2 " " × dβ j p Ap Ap + 2 β jp − 2 Ap Z j β jp + Z j Z j (2π ) exp − 2 σβ L $ # 2 N −L −1 ωσ̃ p 1 " 1 = (2π )− 2 exp − " Zj Z j Z j + σ̃ p2 Ap " σβ σβ 2 2 / 0 . ∞ 2 1 − 21 1 2 " exp − 2 β j p − σ̃ p A p Z j dβ j p × (2π ) σ̃ p 2σ̃ p L . $ # 2 ∞ 1 −L −1 ωσ̃ p 1 " " 1 μ̃ j p − N2 = (2π ) exp − Z j Z j + (2π )− 2 μ̃ j p −L 2 σβ σβ 2 2 σ̃ p − σ̃ p / $2 0 # $ # − μ̃ − μ̃ β β 1 jp jp jp jp × exp − d 2 σ̃ p σ̃ p # $ # $ $ # 2 μ̃ j p − L 1 μ̃ j p −L −1 ωσ̃ p 1 " " − N2 = (2π ) exp − Z j Z j exp 2 σβ σβ 2 σ p2 σ̃ p ∞ where σ̃ p2 = Ap A p + σβ−2 ω̃ j p = = −1 and μ̃ j p = Ap " Z j Ap A p + σβ−2 N Zj" Zj (2π)− 2 exp − 21 " −L σβ −1 ωσ̃ # 2 $ 1 μ̃ j p exp σβ 2 σ p2 # 2 $ −1 σ̃ μ̃ j p −L p −L 1 μ̃ j p ω exp σβ σβ 2 σ̃ p2 σ̃ p # 2 $ −1 σ̃ μ̃ j p −L p −L 1 μ̃ j p ω exp +1−ω σβ σβ 2 σ̃ 2 σ̃ p N Zj" Zj (2π)− 2 exp − 21 " −L σβ −1 ωσ̃ p # p σβ exp μ̃ j p −L σ̃ p −1 2 1 μ̃ j p 2 σ p2 $ . Accordingly, ω̃ j p is, μ̃ j p −L σ̃ p N + (1 − ω) (2π)− 2 exp − 21 " Zj" Zj p E Proof of Lower Bound L In this section, we derive the lower bound L of β j p (Proposition 1). Suppose at time t, we have a B (t) ∈ B(B) satisfying the monotonocity constraints (4), and we only sample β j p at time t + 1 and leave any other coefficient the same as the one at time t, i.e., (t+1) (t) β js = β js , ∀s = p. YINYIN CHEN ET AL. In what follows, we introduce some notations. We denote β j p and δ j p as β p and δ p , respectively. That is, we omit the subscript of item, j, as the lower bound of coefficient β j p does not depend on any other coefficient of other items. Let γα = β T aα − β0 = −1 (θα ) −!β0 be the sum of the linear component excluding the intercept p γα − β p α ∈ L1 for class α. Further, let γα,− p = p denote the sum of the linear component γα α ∈ L0 , p excluding the intercept and the pth coefficients for class α, where L1 = {α|aα, p = 1} and p L0 = {α|aα, p = 0, α α 0 }. We rewrite the monotonicity constraints (4) as follows, min θα ≥ θα 0 αα 0 max θα < min θα = max θα α∈S0 α∈S1 () α∈S1 where S0 = {α|α q, α α 0 = 0} is the set the classes that not mastering all the relevant skills, and S1 = {α|α q} is the set of classes mastering all the relevant skills. Note that (·) is a strictly increasing function, we have the following equivalent form of the monotonicity constraints (): min γα ≥ γα 0 = 0, αα 0 γq = max γα = min γα > max γα . α∈S1 α∈S1 α∈S0 (11) (12) In SLCM, q is uniquely determined by the structure vector δ. Mathematically, q = arg minα:aα δ |α|, where |·| is the cardinality. By such definition, γq = maxα∈S1 γα = minα∈S1 γα always holds, therefore, to verify (12), we only need to check, γq > max γα . (13) α∈S0 In the following two remarks, we list some observations that are useful in the proof. 2 p2 p Remark 8. 1. L0 L1 = S0 S1 = {α|α α 0 } p p 2. a q, p = 1 ⇒ S1 ⊆ L1 , S0 ⊇ L0 . Remark 9. (t) 1. ∀α, (t+1) γα,− p = γα,− p := γα,− p γα(t+1) = γα(t) (t+1) (t) (t+1) γα = γα − β (t) p + βp p 2. ∀α ∈ L0 , p 3. ∀α ∈ L1 , p 4. ∀α 1 , α 2 ∈ L1 , p 5. ∀α 1 , α 2 ∈ L0 , (t) (t) (t+1) (t+1) γ α1 > γ α2 ⇒ γ α1 > γ α2 (t) (t) (t+1) (t+1) γ α1 > γ α2 ⇒ γ α1 > γ α2 In the following lemma, we give the sufficient and necessary condition for (11). Lemma 6. (Lower bound 1) min γ (t+1) αα 0 α ≥ γα(t+1) =0 0 PSYCHOMETRIKA holds if and only if β (t+1) ≥ maxp (−γα,− p ). p (14) α∈L1 Proof. Since B (t) ∈ B(B), we have minα∈L p γα(t+1) = minα∈L p γα(t) ≥ 0. So we only need to 0 p consider α ∈ L1 , such that 0 (t) minp γα(t+1) = minp (γα,− p + β (t+1) ) ≥ 0, p α∈L1 α∈L1 which holds if and only if (14) holds. (t) We show the relationship between γq(t+1) (t+1) and γq (t) in the following lemma. Lemma 7. (t+1) γq (t+1) = γq (t) ,− p + β (t+1) . p Proof. (t+1) = γq (t) ,− p + β (t+1) . • If q (t+1) = q (t) , γq(t+1) p (t+1) = γq (t) (t) (t+1) • If q (t+1) q (t) , it implies δ p = 0 and δ p (t) (t) = 1. Therefore, q (t) , q (t+1) ∈ S1 , so that, (t) γq (t+1) = γq (t) = γq (t) ,− p . Then, (t+1) γq (t+1) = γq (t+1) ,− p + β (t+1) = γq (t) ,− p + β (t+1) . p p (t+1) = δ (t+1) = 0, and a q (t+1) + e p = a q (t) . Then, • If q (t+1) ≺ q (t) , it means δ (t) p = 1, β p p (t+1) γq(t+1) . (t+1) = γq (t+1) ,− p = γq (t) ,− p = γq (t) ,− p + β p Next, we give the sufficient and necessary condition for (13) in the following lemma. = 1, Lemma 8. (Lower bound 2) Suppose δ (t+1) p (t+1) γq (t+1) > max γα(t+1) , (t+1) α∈S0 if and only if, > maxp γα,− p − γq (t) ,− p . β (t+1) p α∈L0 (15) YINYIN CHEN ET AL. (t+1) (t+1) p Proof. Since δ p = 1, by Remark 2, we have S0 ⊇ L0 . It is easy to see that if (12) holds at time t + 1, then (15) holds, because (t+1) > max γα(t+1) ≥ maxp γα(t+1) = maxp γα,− p . γq (t+1) = γq (t) ,− p + β (t+1) p (t+1) α∈L0 α∈S0 α∈L0 Next we show that if (15) holds, then (13) holds at time t + 1. Because (12) holds at time t, we have, (t) γq(t) (t) > max γα ≥ max 3 (t) α∈S0 p α∈L1 (t) S0 γα(t) . (16) Next, we check (13) in two different scenarios. (t) (t+1) • If q (t) = q (t+1) , S0 = S0 , then by (16) and Remark 9.4, we obtain (t+1) (t+1) γq (t+1) = γq (t) > max 3 (t+1) S0 p α∈L1 (t) γα(t+1) . (t) (t) • If q (t) ≺ q (t+1) , then since q (t+1) , ∈ S1 , we have γq (t+1) > maxα∈L p 3 S(t) γα . By 1 0 Remark 9.4, we have (t+1) γq (t+1) > max 3 p α∈L1 (t) S0 γα(t+1) . On the other hand, since δ (t+1) = δ (t) + e p , {α|α ∈ S(t+1) ,α ∈ / S(t) 0 0 } = {α|α δ (t) , α δ (t+1) } ⊆ L0 = (L1 )c , p p leading to, p L1 4 (t+1) S0 p = L1 4 (t) S0 . (t+1) Proof of Proposition 1. Suppose δ p hold at time t + 1, if = 1, by Lemmas 6 and 8, the monotonicity constraints ! β (t+1) p 1 > max maxp (−γα,− p ), maxp γα,− p − γq (t) ,− p α∈L1 α∈L0 := max(L 1 , L 2 ) = L . In the following two lemmas, we discuss the flipping rule of δ p . (t) (t+1) Lemma 9. (Flipping rule 1) If δ p = 0, δ p t + 1 and L = 0. = 0, the monotonicity constraints hold at time PSYCHOMETRIKA Proof. The monotonicity constraints hold at time t + 1, because B (t) = B (t+1) and B (t) satisfy the constraints. (t) • L 1 = − minα∈L p γα ≤ 0 because (11) holds at t. 1 (t) (t) • L 2 = maxα∈L p γα − γq (t) = 0 because 0 (t) (t) γq (t) = min γα(t) ≤ maxp γα(t) ≤ max γα(t) = γq (t) , (t) p since L0 3 α α∈L0 α∈S1 (t) S1 is not empty. Therefore, L = max(L 1 , L 2 ) = 0. (t) (t+1) Lemma 10. (Flipping rule 2) Suppose δ p = 1, δ p at time t + 1 if L ≤ 0. = 0. The monotonicity constraints hold Proof. If q (t) = q (t+1) , the statement can be proved easily by Lemma 6 and Lemma 8. We check (11) and (13) in for the case that q (t) q (t+1) . (t+1) (t+1) ≤ 0 and minα∈L p γα • Since L 1 = − minα∈L p γα 1 0 at t + 1. p 3 (t+1) • By Remark 9.4, for any α ∈ L1 S0 , (t+1) (t+1) γq (t+1) = γq (t) (t+1) (t) = minα∈L p γα ≥ 0, (11) holds > γα(t+1) 0 (17) (t+1) Together with L 2 = maxα∈L p γα − γq (t+1) ≤ 0, (17) holds for any α ∈ 0 p 2 p 3 (t+1) L0 (L1 S0 ). p ,α ∈ / S(t) Further, as shown in the proof of Lemma 8, we have {α|α ∈ S(t+1) 0 0 } ⊆ L0 , such 2 3 p p (t+1) (t+1) that S0 ⊆ L0 (L1 S0 ). References Albert, J. H., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88(422), 669–679. Allman, E. S., Matias, C., & Rhodes, J. A. (2009). Identifiability of parameters in latent structure models with many observed variables. The Annals of Statistics, 37, 3099–3132. Carreira-Perpiñán, M., & Renals, S. (2000). Practical identifiability of finite mixtures of multivariate Bernoulli distributions. Neural Computation, 12, 141–152. Chen, Y., Culpepper, S. A., Chen, Y., & Douglas, J. (2018). Bayesian estimation of the DINA Q-matrix. Psychometrika, 83, 89–108. Chen, Y., Liu, J., Xu, G., & Ying, Z. (2015). Statistical analysis of q-matrix based diagnostic classification models. Journal of the American Statistical Association, 110(510), 850–866. Chiu, C.-Y., Douglas, J. A., & Li, X. (2009). Cluster analysis for cognitive diagnosis: Theory and applications. Psychometrika, 74(4), 633–665. Cox, D., Little, J., O’Shea, D., & Sweedler, M. (1994). Ideals, varieties, and algorithms. American Mathematical Monthly, 101(6), 582–586. Culpepper, S. A. (2019). Estimating the cognitive diagnosis Q matrix with expert knowledge: Application to the fractionsubtraction dataset. Psychometrika, 84, 333–357. Dang, N. V. (2015). Complex powers of analytic functions and meromorphic renormalization in QFT. arXiv preprint arXiv:1503.00995. Davier, M. (2005). A general diagnostic model applied to language testing data. ETS Research Report Series, 2005(2) de la Torre, J. (2011). The generalized DINA model framework. Psychometrika, 76(2), 179–199. YINYIN CHEN ET AL. de la Torre, J., & Douglas, J. A. (2004). Higher-order latent trait models for cognitive diagnosis. Psychometrika, 69(3), 333–353. DiBello, L. V., Stout, W. F., & Roussos, L. A. (1995). Unified cognitive/psychometric diagnostic assessment likelihoodbased classification techniques. In P. D. Nichols, S. F. Chipman, & R. L. Brennan (Eds.), Cognitively Diagnostic Assessment, Chapter 15 (pp. 361–389). New York: Routledge. Fang, G., Liu, J., & Ying, Z. (2019). On the identifiability of diagnostic classification models. Psychometrika, 84(1), 19–40. Goodman, L. A. (1974). Exploratory latent structure analysis using both identifiable and unidentifiable models. Biometrika, 61(2), 215–231. Gyllenberg, M., Koski, T., Reilink, E., & Verlaan, M. (1994a). Non-uniqueness in probabilistic numerical identification of bacteria. Journal of Applied Probability, 31(2), 542–548. Gyllenberg, M., Koski, T., Reilink, E., & Verlaan, M. (1994b). Non-uniqueness in probabilistic numerical identification of bacteria. Journal of Applied Probability, 31(2), 542–548. Haertel, E. H. (1989). Using restricted latent class models to map the skill structure of achievement items. Journal of Educational Measurement, 26(4), 301–321. Hagenaars, J. A. (1993). Loglinear models with latent variables (Vol. 94). Newbury Park, CA: Sage Publications Inc. Hartz, S.M. (2002). A Bayesian framework for the unified model for assessing cognitive abilities: Blending theory with practicality. Ph.D. thesis, University of Illinois at Urbana-Champaign. Junker, B. W., & Sijtsma, K. (2001). Cognitive assessment models with few assumptions, and connections with nonparametric item response theory. Applied Psychological Measurement, 25(3), 258–272. Kruskal, J. B. (1976). More factors than subjects, tests and treatments: An indeterminacy theorem for canonical decomposition and individual differences scaling. Psychometrika, 41(3), 281–293. Kruskal, J. B. (1977). Three-way arrays: Rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics. Linear Algebra and Its Applications, 18(2), 95–138. Liu, J., Xu, G., & Ying, Z. (2013). Theory of the self-learning q-matrix. Bernoulli, 19(5A), 1790–1817. Maris, E. (1999). Estimating multiple classification latent class models. Psychometrika, 64(2), 187–212. Mityagin, B. (2015). The zero set of a real analytic function. arXiv preprint arXiv:1512.07276. Rupp, A. A., Templin, J., & Henson, R. A. (2010). Diagnostic measurement: Theory, methods, and applications. New York: Guilford Press. Tatsuoka, C. (2002). Data analytic methods for latent partially ordered classification models. Journal of the Royal Statistical Society, Series C (Applied Statistics), 51(3), 337–350. Tatsuoka, K. K. (1984). Analysis of errors in fraction addition and subtraction problems. Final report, Technical report. Teicher, H., et al. (1961). Identifiability of mixtures. The Annals of Mathematical Statistics, 32(1), 244–248. Templin, J. L., & Henson, R. A. (2006). Measurement of psychological disorders using cognitive diagnosis models. Psychological Methods, 11(3), 287. Xu, G. (2017). Identifiability of restricted latent class models with binary responses. The Annals of Statistics, 45(2), 675–707. Xu, G., & Shang, Z. (2017). Identifying latent structures in restricted latent class models. Journal of the American Statistical Association, 113, 1284–1295. Yakowitz, S. J., & Spragins, J. D. (1968). On the identifiability of finite mixtures. The Annals of Mathematical Statistics, 39, 209–214. Manuscript Received: 10 AUG 2018 Final Version Received: 13 DEC 2019 J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 Contents lists available at SciVerse ScienceDirect Journal of Behavior Therapy and Experimental Psychiatry journal homepage: www.elsevier.com/locate/jbtep Effects of depression on reward-based decision making and variability of action in probabilistic learning Yoshihiko Kunisato a, Yasumasa Okamoto b, c, Kazutaka Ueda d, Keiichi Onoda e, Go Okada b, f, Shinpei Yoshimura b, g, Shin-ichi Suzuki a, Kazuyuki Samejima h, Shigeto Yamawaki b, c, * a Faculty of Human Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama 359-1192, Japan Department of Psychiatry and Neurosciences, Graduate School of Biomedical & Health Sciences, Hiroshima University, 1-2-3, Kasumi, Minami-ku, Hiroshima-city, Hiroshima 734-8551, Japan c Core Research for Evolutional Science and Technology, 5, Sanbancho, Chiyoda-ku, Tokyo 102-0075, Japan d Graduate School of Engineering, The University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-8656, Japan e Department of Neurology, Shimane University, 89-1, Enya-cho, Izumo-city, Shimane 693-8501, Japan f Department of Psychiatry, University of Michigan, 4250 Plymouth Road, Ann Arbor, MI 48109, USA g Research Fellow of the Japan Society for the Promotion of Science, 6, Ichibancho, Chiyoda-ku, Tokyo 102-8471, Japan h Tamagawa University Brain Science Institute, Graduate School of Brain Sciences, 6-1-1, Tamagawagakuen, Machida-city, Tokyo 104-8610, Japan b a r t i c l e i n f o a b s t r a c t Article history: Received 28 October 2011 Received in revised form 22 May 2012 Accepted 22 May 2012 Background and objectives: Depression is characterized by low reward sensitivity in behavioral studies applying signal detection theory. We examined deficits in reward-based decision making in depressed participants during a probabilistic learning task, and used a reinforcement learning model to examine learning parameters during the task. Methods: Thirty-six nonclinical undergraduates completed a probabilistic selection task. Participants were divided into depressed and non-depressed groups based on Center for Epidemiologic Studies eDepression (CES-D) cut scores. We then applied a reinforcement learning model to every participant’s behavioral data. Results: Depressed participants showed a reward-based decision making deficit and higher levels of the learning parameter s, which modulates variability of action selection, as compared to non-depressed participants. Highly variable action selection is more random and characterized by difficulties with selecting a specific course of action. Conclusion: These results suggest that depression is characterized by deficits in reward-based decision making as well as high variability in terms of action selection. Ó 2012 Elsevier Ltd. All rights reserved. Keywords: Depression Reward sensitivity Reinforcement learning Decision making Probabilistic learning Variability of action selection 1. Introduction Evidence suggests that depression is associated with hyposensitivity to reward (Eshel & Roiser, 2010). Depression is characterized by low positive affect derived from pleasant events and high negative affect in response to unpleasant events (Brown, Chorpita, & Barlow, 1998; Kasch, Rottenberg, Arnow, & Gotlib, 2002; McFarland, Shankman, Tenke, Bruder, & Klein, 2006; Watson et al., 1995). In laboratory behavioral studies, a signal detection theory approach has been used to examine the decreased reward * Corresponding author. Department of Psychiatry and Neurosciences, Graduate School of Biomedical & Health Sciences, Hiroshima University, 1-2-3, Kasumi, Minami-ku, Hiroshima 734-8551, Japan. Tel.: þ81 82 257 5208; fax: þ81 82 257 5209. E-mail address: [email protected] (S. Yamawaki). 0005-7916/$ e see front matter Ó 2012 Elsevier Ltd. All rights reserved. doi:10.1016/j.jbtep.2012.05.007 sensitivity said to be associated with depression. Henriques, Glowacki, and Davidson (1994) carried out a verbal recognition task that featured three conditions (no monetary feedback, monetary reward and monetary punishment), using signal detection theory to examine the effects of depression on recognition sensitivity and response bias. In their task, participant responses were more liberal in the reward and punishment conditions as compared to the neutral condition. Liberal responding in this context means optimizing responding so as to increase earnings rather than to make correct responses per se, such that a liberal response bias leads to increased earnings. Compared to nondepressed participants, both subclinically (Henriques et al., 1994) and clinically depressed participants (Henriques & Davidson, 2000) evidence less liberal response bias during a reward condition but not punishment or neutral conditions. Pizzagalli, Jahn, and O’Shea (2005) developed another signal detection task that was designed Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 to examine responsiveness to rewards. In their probabilistic reward task, participants were asked to identify whether the mouth length of a cartoon face was short or long, with participants receiving asymmetrical rewards depending on whether correct responses were provided. Non-depressed participants learned a response bias in which the highly reinforced choice was preferred, while nonclinical depressed participants and MDD patients showed low levels of response bias (Pizzagalli, Iosifescu, Hallett, Ratner, & Fava, 2008; Pizzagalli et al., 2005). These researchers have postulated that depression may impair the integration of reinforcement history (Pizzagalli et al., 2008). These earlier studies have linked depression with hyposensitivity to reward during recognition and discrimination tasks. However, some questions remain. The first is whether depression affects hyposensitivity to rewards in probabilistic learning that mixes reward and punishment contingencies. Although earlier studies have examined sensitivity to reward and punishment independently, in our daily lives we encounter many situations in which both reward and punishment are potential outcomes. For example, depressed people often avoid social gatherings in order to avoid having negative experiences in these situations (increased avoidance of punishment), at the potential cost of obtaining social rewards (decreased approach of reward). Most behaviors involve possible positive and negative outcomes, and it is not clear whether depressed individuals consistently exhibit hyposensitivity to reward in such learning scenarios; this possibility needs to be examined. The second question concerns whether depression does indeed affect the integration of reinforcement history. It has been postulated (based on signal detection studies) that depression is associated with impairments in the integration of reinforcement history over time (Pizzagalli et al., 2008), but such integration is not directly observable. Integration is not a direct measure of learning per se but is instead a learning parameter that modulates outcomes. Investigating an observable learning parameter that corresponds to the integration of reinforcement history is necessary for a complete understanding of learning impairments in depression. In an attempt to answer these questions, we used a probabilistic selection task (Frank, Seeberger, & O’Reilly, 2004) along with a reinforcement learning model to analyze the results (Behrens, Woolrich, Walton, & Rushworth, 2007; Daw, O’Doherty, Dayan, Seymour, & Dolan, 2006; Frank, Moustafa, Haughey, Curran, & Hutchison, 2007; O’Doherty et al., 2004; Samejima, Ueda, Doya, & Kimura, 2005; Tanaka et al., 2004). The probabilistic selection task is used to determine the degree to which participants’ decisions are based on reward and/or punishment during probabilistic learning. In this task (Frank et al., 2004; Frank, Woroch, & Curran, 2005), three stimulus pairs (AB pair, CD pair, EF pair) are presented in a random order. Participants have to choose one of two stimuli and are probabilistically reinforced via positive and negative feedback (correct or incorrect). A choice of stimulus A for the AB pair is associated with an 80% likelihood of positive feedback (and a corresponding 20% likelihood of negative feedback), whereas a choice of stimulus B leads to a 20% likelihood of positive feedback (and an 80% likelihood of negative feedback). CD and EF pair are less reliable than are AB pair. A choice of stimulus C for the CD pair leads to 70% positive feedback (with stimulus D leading to 30% positive feedback), while a choice of stimulus E for the EF pair leads to 60% positive feedback (stimulus F leads to 40% positive feedback). Participants are trained to choose stimuli A, C, and E more often than B, D, and F. Stimulus A leads to the highest probability of a positive outcome, whereas stimulus B leads to the highest probability of a negative outcome. After the training session, participants are evaluated the extent to which participants learn from positive and negative feedback of their decisions. 1089 Participants are tested with the same training pairs (i.e., AB, CD, EF) and all novel combinations of stimuli (e.g. AC, BE, etc.) in a random order. If participants learned from positive feedback, they should reliably choose stimulus A when presented with novel test pairs. In contrast, if participants learned from negative feedback, they should reliably avoid stimulus B. Modeling participants’ sequential behavioral data from a reinforcement learning standpoint provides a useful approach to examining an observable learning parameter that corresponds to the integration of reinforcement history (Sutton & Barto, 1998). Computational reinforcement learning models can be used to simulate the learning process and have been developed in a machine learning context. Such models allow us to examine internal learning parameters which are not observable via simple descriptions of behavioral data. We adopted the modified Q learning model (Frank et al., 2007) which contains three learning parameters: aG, controlling the updating speed of selected choice value following positive feedback, aL, controlling the updating speed of selected choice value following negative feedback, and s, controlling the degree of exploiting learned choice value. These learning parameters are associated with the integration of reinforcement history over time, in particular s, which represents direct regulation of this process. Chase et al. (2010) used a probabilistic selection task to examine hypersensitivity to punishment in MDD patients, finding no significant differences between MDD patients and healthy controls on task performance. We speculate that the reasons for this null finding include long decision duration (a maximum of 4 s), type of feedback provided, and antidepressant medication effects. MDD patients responded more slowly than did healthy controls (Chase et al., 2010) and patients also show a blunted response to feedback in terms of reaction times during reinforcement learning (Steele, Kumar, & Ebmeier, 2007). These results suggest that long decision durations will be unlikely to permit detection of group differences in probabilistic learning. Additionally, Chase et al. (2010) used verbal feedback (e.g. “correct”, “incorrect”), but it has been reported that monetary feedback enhances behavioral performance relative to non-monetary feedback (Small et al., 2005). A probabilistic selection task study of depression should feature short decision-making durations and the provision of monetary feedback. The objective of the present study was to use a probabilistic selection task to examine depression-related bias on reward or punishment based decision-making, as well as the effects of depression on the learning parameters derived from a computational reinforcement learning model. We hypothesized that depressed participants would show lower sensitivity to reward during probabilistic learning, as compared to non-depressed participants. We also explored possible depression-related impairments in learning parameters related to the integration of reinforcement history. 2. Materials and methods 2.1. Participants Sixty undergraduate student participants (17 men, 43 women; mean age ¼ 19.45 1.43 years) were recruited from Hiroshima University in Japan. Informed written consent was obtained from all participants. All participants completed the Center for Epidemiologic Studies - Depression scale (CES-D) (Radloff, 1977) when they were first recruited to participate in the experiment. Potential participants were ultimately excluded from further data analysis if they did not reach the attainment criterion during the probabilistic selection task, if they reported neurological illness, or if their CES-D score changed to a value over the cut-off point (CES-D score of 16) 1090 Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 from time of recruitment to time of the experiment. One excluded participant was being treated with antidepressant medication. Twelve participants were excluded due to changes in CES-D scores (6 potentially depressed participants and 6 potentially nondepressed participants). Eleven participants were excluded for not meeting the attainment criterion (8 depressed and 3 nondepressed). The final sample consisted of 18 depressed participants (CES-D score >¼ 16, mean age ¼ 19.44 0.98 years) and 18 non-depressed participants (CES-D score < 16, mean age 19.33 1.14 years). The study protocol was approved by the Hiroshima University ethics committee. 2.2. Procedure Before the behavioral task was conducted, participants completed several self-report measures of mood and depressive symptoms. The CES-D and Beck Depression Inventory-II (BDI-II) (Beck, Steer, & Brown, 1996) were used to assess levels of depressive symptoms. The trait form of the Positive and Negative Affect Scales (PANAS) (Watson, Clark, & Tellegen, 1988) was used to assess fundamental affect. We conducted the probabilistic selection task developed by Frank et al. (2004). Participants were verbally instructed in how to perform the task, and were told that the aim of the task was to maximize total gains. The task was presented using Presentation version 7.3 (Neurobehavioral Systems, California, USA) and lasted approximately 45 min. Participants sat in front of a computer screen and chose one of two paired visual stimuli by pressing either the left or right button on a mouse. We used meaningless pictures (Endo, Saiki, Nakao, & Saito, 2003) as visual stimuli. These pictures have been used in previous probabilistic learning studies (Ohira et al., 2009; Ohira et al., 2010). The task consisted of separate training and test phases. Fig. 1 shows the timing of visual stimulus presentation and participant responses. During the training phase, the stimulus events consisted of a fixation period of 500 ms, followed by a stimulus pair presentation for 750 ms, followed by a blank screen for 350 ms, and finally visual feedback for 600 ms (Fig. 1). Three different stimulus pairs (AB, CD, EF) were presented in a random order, such that each stimulus was randomly presented on the right or left side. Feedback was provided after each choice, in the form of a reward (þ10 yen) or punishment (10 yen). This feedback was probabilistic. Choosing A led to a reward on 80% of the AB pairs, and led to punishments (10 yen) on 20% of the AB pairs. Choosing B led to a reward on 20% of the AB pairs, and led to punishment on 80% of the AB pairs. The CD and EF pairs were less predictable. Choosing C led to reward on 70% of the CD pairs, while choosing E led to reward on 60% of the EF pairs. Participants continued the learning phase until they had met the criterion (choosing A over B on 65% of the trials, choosing C over D on 60% of the trials, and choosing E over F on 50% of the trials). Learning achievements were evaluated every block. A block consisted of 60 trials, and participants performed up to a maximum of 10 blocks (600 trials). Reward 600ms feedback Punishment or -10 yen +10 yen 350ms blank 750ms choice 500ms blank 750ms choice 500ms blank trial onset trial onset Learning session Combinations: AB,CD,EF Probability of reward: A(80%) B(20%) C(70%) D(30%) E(60%) F(40%) Test session Combinations: AB, CD, EF (original pair) AC, AD, AE, AF (Choose A) BC, BD, BE, BF (Avoid B) CE, CF, DE, DF (others) Fig. 1. Schematic outlining the probabilistic selection task. During the training phase, participants were presented with the three pair stimulus and learned to make selections such that rewards could be maximized. During the test phase, participants selected the stimulus from the training pair and all novel combinations of stimulus in a random order. Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 Once participants reached the criterion, they were tested with the same training pairs (i.e., AB, CD, EF) and all novel combinations of stimuli (e.g. AC, BE, etc.) in a random order. Each test pair was presented six times, for a total 90 trials. Participants were again instructed to choose stimuli in order to maximize total gains. If they were unsure of which stimulus to choose, they were instructed to use ‘gut instinct’. No feedback was provided during the test phase. The trials in which A (i.e., AC, AD, AE, AF) or B (i.e. BC, BD, BE, BF) was presented in novel pairings were of primary interest in the present task. Because stimulus A was most often associated with positive feedback during training, the more the participant chooses stimulus A, the more the participant learned or benefited from the positive feedback. Similarly, stimulus B was most often associated with negative feedback during training, such that the more the participant avoids stimulus B, the more the participant learned from the negative feedback. In addition, the correct response rate for the original stimulus pairs (i.e. AB, CD, EF) serves as an index of learning efficacy. After the behavioral task, participants completed the post experiment questionnaire (Frank et al., 2004), which assessed awareness judgments. Finally, participants were paid the money they acquired during the task (500yen w 1380yen). 1091 probability using the temperature parameter s, which is defined as randomness of action selection. The temperature parameter s is related to the degree to which learned values are exploited during task performance (i.e., when temperature parameter s becomes large, the selections become more random). For the probability of choosing stimulus A over stimulus B, we applied the following equation: Q A ðtÞ PA ðtÞ ¼ e Q A ðtÞ e s s (2) Q B ðtÞ þe s The same procedure applies for other trial types, replacing A and B with C, D, E, or F. The s parameter regulates the explorationeexploitation parameter of the reinforcement history. The Q-learning model was then fitted to each participant’s behavioral data by searching for the parameter values (aG, aL, s) that resulted in the best fit. Fitness of data was measured using the log likelihood estimation (LLE). The parameters were searched using the MATLAB Optimization Toolbox. LLE is computed as follows: LLE ¼ log Y Pi* ;t (3) t 2.3. Data analysis Between-groups t-tests were conducted on psychological assessment scores (CES-D, BDI-II, PANAS), the number of trials that participants need to reach the criterion, and the expected value of money gained during the test session. The expected value was computed for every selection during the test session via the following equation: (value of positive feedback (10 yen) * probability of positive feedback for selected choice) þ (value of negative feedback (10 yen) * probability of negative feedback for selected choice). Total expected value was then computed by summing all expected values for each selection. Test session measurements consisted of three scores (number of stimulus A choices, number of stimulus B avoidances, and number of correct responses in original training pair during test session). To examine whether the memory maintenance capability between training and test sessions did not differ across the groups, the number of correct responses for the original training pairs during the test session was included in test session measurements. Multivariate ANOVA was used to examine the group effects on these test session measurements. We used a reinforcement learning framework called the Qlearning model to clarify the parameters related to the learning process, including integration of reinforcement history, which is not directly observable on the basis of behavioral data (Sutton & Barto, 1998). In the Q learning models (Sutton & Barto, 1998), the Q value (action value) is computed using the learning rate (a), which controls speed of selected choice value updating. We applied the following modified Q learning model (Frank et al., 2007) to our behavioral data, computing the Q value for selecting each stimulus i during trial t. The updating rule for the Qi value involves Q i ðt þ 1Þ ¼ Q i ðtÞ þ aG ½rðtÞ Q i ðtÞþ þ aL ½rðtÞ Q i ðtÞ 3. Results Participants’ demographic and self-report data are shown in Table 1. Depressed participants reported significantly higher levels of depressive symptoms on the CES-D and the BDI-II than nondepressed participants, as well as higher levels of negative affect on the PANAS-NA. In addition, depressed participants reported significantly lower positive affect on the PANAS-PA than did nondepressed participants. Based on the post experiment questionnaire results (Frank et al., 2004), all participants were judged to be unfamiliar with the experimental stimuli and to have no irrational belief about feedback that would affect the results during the learning or test sessions. There were no significant group differences in the number of trials needed to reach the criterion (depressed: 360.00 146.97, non-depressed: 320 120.00; t(34) ¼ 0.894, p ¼ 0.38, Hedge’s g ¼ 0.29, 95%CI [0.37 0.95]) and the expected monetary value Table 1 Demographics and self-report data for depressed and non depressed participants (1) A for A > 0 A for A < 0 and ½A 0 otherwise 0 otherwise In this model, the Q value is updated using either learning rate for Gain: aG or learning rate for Loss: aL, depending on whether the outcome is better or worse than expected. aG controls the updating speed of selected choice value following positive feedback, whereas aL controls the updating speed of selected choice value following negative feedback. In addition, choice behavior was modeled using the softmax function (Sutton & Barto, 1998). We computed behavior selection where ½Aþ Pi*,t denotes the estimated probability of participant choice on trial t, which is computed via the softmax function (2). The maximum LLE value is the most predictive of participants’ behavioral data. We examined fit of the Q-learning model using the G2 statistic (Busemeyer & Stout, 2002). The G2 statistic is used to compare the log likelihood for Q-learning and baseline models. The baseline model in this case refers to one in which stimuli are chosen randomly (p ¼ 0.5 for each trial). Multivariate ANOVA was conducted to examine group effects on learning parameters. n Female/male Age CES-D (screening) CES-D (experiment) BDI-II PANAS PA PANAS NA Depressed Non depressed t p 18 13/5 19.44 32.11 28.06 21.94 20.44 23.56 18 13/5 19.33 1.14 9.61 3.40 8.72 3.51 4.83 2.92 28.39 6.46 13.11 3.82 0.756 0.10 <.001 3.99 <.001 3.22 <.001 3.55 <.001 1.26 <.001 1.75 0.98 7.21 7.73 6.17 6.12 7.52 0.31 11.98 9.66 10.64 3.79 5.25 Hedge’s g Note. n ¼ number of subject, CES-D ¼ the Center for Epidemiologic Studies Depression scale, BDI-II ¼ Beck Depression Inventory-II, PANAS PA ¼ Positive affect of Positive and Negative Affect Scales, PANAS NA ¼ Negative affect of Positive and Negative Affect Scales. Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 gained during the test session (depressed: 84.67 46.76, nondepressed:109.33 32.07; t(34) ¼ 1.846, p ¼ 0.07, Hedge’s g ¼ 0.62, 95%CI [1.29 0.05]). During the test session, participants correctly choose all relevant stimuli at an above chance level (greater than 50%). For all three test measurements, participants’ performance significantly departed from chance level (Choose A: mean ¼ 63%, 95%CI [57 69], t(35) ¼ 4.65, p < 0.001; Avoid B: mean ¼ 62%, 95%CI [56 69], t(35) ¼ 3.89, p < 0.001; Original pairs: mean ¼ 75%, 95%CI [71 80], t(35) ¼ 12.23, p < 0.001). A multivariate ANOVA (one factor: depression) with the three test measurements as dependent variables (scores of Choose A, Avoid B and original pairs) revealed a significant group difference for choosing A, F(1, 34) ¼ 10.45, p ¼ 0.003, Hedge’s g ¼ 1.08, 95%CI [1.78 0.38]. There were no significant group differences in terms of avoiding B, F(1,34) ¼ 0.10, p ¼ 0.745, Hedge’s g ¼ 0.11, 95%CI [0.76 0.55], and performance on the original pairs, F(1,34) ¼ 0.13, p ¼ 0.717, Hedge’s g ¼ - 0.12, 95% CI [0.78 0.53]. Depressed participants showed a lower propensity to choose A as compared to non-depressed participants (Fig. 2). G2 statistics for both groups were positive (12.59 7.04) and significantly greater than 0 (t (35) ¼ 10.73, p < 0.01). There were no significant log likelihood differences between the groups (depressed: 101.31 47.07, non-depressed: 86.65 36.67; t(34) ¼ 1.04, p ¼ 0.31, Hedge’s g ¼ 0.34, 95%CI [0.32 1.00]). A multivariate ANOVA (one factor: depression) with the three parameters entered as dependent variables showed a main effect of depression on the s (F(1,34) ¼ 4.83, p ¼ 0.035, Hedge’s g ¼ 0.73, 95% CI [0.06 1.41]) but not the aG (F(1,34) ¼ 3.12, p ¼ 0.09, Hedge’s g ¼ 0.59, 95%CI [0.08 1.26]) or aL (F(1,34) ¼ 1.25, p ¼ 0.271, Hedge’s g ¼ 0.37, 95%CI [0.29 1.03]). Depressed participants showed higher s values than non-depressed participants (Fig. 3). Correlational analyses were conducted to examine the relationship between learning parameters and learning and test phase performance (number of trials to reach criterion during the learning session, and performance on the test session as based on choosing stimuli to maximize gain). The learning parameter s correlated positively with the number of trials required to reach criterion (r ¼ 0.40, p < 0.05, 95%CI [0.08 0.64]). There were no significant correlations between learning parameters and test performance. 4. Discussion This was the first depression study to clarify the nature of reward-based decision making deficits during probabilistic Fig. 2. Proportion of choices made during the test session. The proportion of choose A stimulus (a), the proportion of avoid B stimulus (b), the proportion of correct selection in the original pairs (c). Error bars represent 95% confidence intervals. a Coefficient of metaparameters 1092 b 1.4 1.4 depressed non depressed c 1.4 1.2 1.2 1 1 1 0.8 0.8 0.8 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 0 0 Gain * 1.2 0 Loss Fig. 3. Effect of depressed mood on the learning rate for gain (a), learning rate for loss (b), and the variability of action choice (c) estimated from training session behavioral data. Error bars represent 95% confidence intervals. learning, as indexed by the learning parameter s. Our results demonstrate that depressed participants benefit less from rewardbased probabilistic learning and show greater variability in action selection than do non-depressed participants. Notably, our depressed and non-depressed groups showed no differences in punishment-based decision making, ability to memorize the original training pairs, and time needed to reach the criterion: Depression selectively affects reward-based decision making. This deficit is likely associated with hyposensitivity to reward, as demonstrated in previous signal detection studies (Henriques & Davidson, 2000; Henriques et al., 1994; Pizzagalli et al., 2008; Pizzagalli et al., 2005). Depression is characterized by anhedonia: An inability to experience pleasant affect (American Psychiatric Association, 2000). We examined both learning and decision-making processes, with our results suggesting that depressed participants appear to have no learning deficits per se, but that they do have a diminished tendency to base their decisions on likelihood of obtaining rewards. Our results support the hypothesis that diminished functioning of an approach-related system plays a key role in depression (Costello, 1983; Depue & Iacono, 1989; Eshel & Roiser, 2010; Loas, 1996). We also found that depressed participants showed higher learning parameter s values than non-depressed participants. This is the first study that has used an observable learning parameter to provide evidence of impaired integration of reinforcement history in depressed participants. The learning parameter s is defined as randomness of action selection, which is related to the degree to which learned values are exploited during task performance. Participants with high s values have trouble deciding upon and selecting a specific action as compared to participants who have low s values, such that action selection is more variable in such cases. Impairments in the integration of reinforcement history represent deficits in the exploitation of learned choice values, such that s is positively correlated with the length of time to achievement of the learning criterion. This suggests that participants with high s tended to spend more time completing the learning, incurring greater costs during this process than did those with low s values. High s values seem to index decision-making deficits that occur after learning has taken place, but such values were not related to reward or punishment based decision-making during the test session. Further studies are needed to clarify the relationship between s values and decision- making at test, along with possible mechanisms. Huys and Dayan (2009) formulated a behavioral control model of depression, based on a Bayesian reinforcement learning model. They argued that depression results from a mismatch between the Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 environment’s characteristics and a participants’ assumptions about them (Huys & Dayan, 2009). The high s values that we observed might reflect a depression-related tendency to assume that one’s environment is more uncertain than is actually the case, such that depressed participants have greater difficulty making choices that should lead to positive outcomes. However, our study does not permit us to make firm statements about the relationship between learning parameter s and specific assumptions about environmental characteristics. Future studies should examine the relationship between learning parameter s and such assumptions using a Bayesian reinforcement learning model and adequate tasks. Our findings are not consistent with those of Chase et al. (2010). In agreement with our prediction, a probabilistic selection task with short durations and monetary feedback is able to detect reward-based decision making deficits in depression. In our study, depressed undergraduates who were not taking antidepressant medication completed a probabilistic selection task. Recent functional neuroimaging studies have shown that antidepressant medication affects areas of the brain reward system such as the striatum (Abler, Gron, Hartmann, Metzger, & Walter, 2012; McCabe & Mishor, 2011; Ossewaarde et al., 2011). In one study, antidepressant medication led to increased activation of the brain reward system of previously unmedicated MDD patients, such that there were no significant differences between medicated MDD patients and healthy controls (Stoy et al., 2012). Antidepressant medication might have influenced the results of the Chase et al. (2010) study, which included individuals taking medication. Further studies of unmedicated MDD patients using a probabilistic selection task may help to resolve the inconsistency in findings. Two limitations of the present study need to be considered. First, our study was conducted on nonclinical undergraduates. This raises the possibility that our results do not directly apply to MDD patients. However, our use of the recommended CES-D cut-off point allowed us to examine reinforcement learning impairments in depressed participants. As described above, our results differ from those of a previous study conducted with MDD patients (Chase et al., 2010). Future studies should examine possible learning parameter impairments in MDD patients, using our or a similar probabilistic selection task. Second, we used short decision-making durations (750 ms). This short duration might have permitted us to identify a significant difference between depressed and non-depressed participants, but psychomotor symptoms may make it difficult for MDD patients to make decisions within a 750 ms timeframe. Further research is needed to elucidate an optimal decision-making timeframe that will permit useful comparisons between MDD patients and healthy controls to be made. 5. Conclusion The present study showed that depressive participants have a deficit in reward-based decision making in probabilistic learning contexts. Learning parameter s values were higher in depressed than in non-depressed participants. These results help to clarify the mechanisms of faulty decision making observed in depression. Understanding learning parameters that characterize depression will shed new light on depression-related impairments in decision making, thereby leading to more effective treatments and contributing to the development of a computational approach to the study of mood disorders. Acknowledgments This work was supported by the Strategic Research Program for Brain Sciences and Grant-in-Aid for Research Activity Start-up 1093 (23830088) from the Ministry of Education, Culture, Sports, Science and Technology of Japan. References Abler, B., Gron, G., Hartmann, A., Metzger, C., & Walter, M. (2012). Modulation of frontostriatal interaction aligns with reduced primary reward processing under serotonergic drugs. The Journal of Neuroscience, 32, 1329e1335. American Psychiatric Association. (2000). Diagnostic and statistical manual of mental disorders (4th ed., text revision). Washington, DC: American Psychiatric Association. Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Beck depression inventory manual (2nd ed.). San Antonio, Texas: The Psychological Corporation. Behrens, T. E., Woolrich, M. W., Walton, M. E., & Rushworth, M. F. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10, 1214e1221. Brown, T. A., Chorpita, B. F., & Barlow, D. H. (1998). Structural relationships among dimensions of the DSM-IV anxiety and mood disorders and dimensions of negative affect, positive affect, and autonomic arousal. Journal of Abnormal Psychology, 107, 179e192. Busemeyer, J. R., & Stout, J. C. (2002). A contribution of cognitive decision models to clinical assessment: decomposing performance on the Bechara gambling task. Psychological Assessment, 14, 253e262. Chase, H. W., Frank, M. J., Michael, A., Bullmore, E. T., Sahakian, B. J., & Robbins, T. W. (2010). Approach and avoidance learning in patients with major depression and healthy controls: relation to anhedonia. Psychological Medicine, 40, 433e440. Costello, E. J. (1983). Information processing for decision making in depressed women. A study of subjective expected utilities. Journal of Affective Disorders, 5, 239e251. Daw, N. D., O’Doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876e879. Depue, R. A., & Iacono, W. G. (1989). Neurobehavioral aspects of affective disorders. Annual Review of Psychology, 40, 457e492. Endo, N., Saiki, J., Nakao, Y., & Saito, H. (2003). Perceptual judgments of novel contour shapes based on their hierarchical descriptions of geometrical properties. Japanese Journal of Psychology, 74, 346e353. Eshel, N., & Roiser, J. P. (2010). Reward and punishment processing in depression. Biological Psychiatry, 68, 118e124. Frank, M. J., Moustafa, A. A., Haughey, H. M., Curran, T., & Hutchison, K. E. (2007). Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proceedings of the National Academy of Sciences of the United States of America, 104, 16311e16316. Frank, M. J., Seeberger, L. C., & O’Reilly, R. C. (2004). By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science, 306, 1940e1943. Frank, M. J., Woroch, B. S., & Curran, T. (2005). Error-related negativity predicts reinforcement learning and conflict biases. Neuron, 47, 495e501. Henriques, J. B., & Davidson, R. J. (2000). Decreased responsiveness to reward in depression. Cognition & Emotion, 14, 711e724. Henriques, J. B., Glowacki, J. M., & Davidson, R. J. (1994). Reward fails to alter response bias in depression. Journal of Abnormal Psychology, 103, 460e466. Huys, Q. J., & Dayan, P. (2009). A Bayesian formulation of behavioral control. Cognition, 113, 314e328. Kasch, K. L., Rottenberg, J., Arnow, B. A., & Gotlib, I. H. (2002). Behavioral activation and inhibition systems and the severity and course of depression. Journal of Abnormal Psychology, 111, 589e597. Loas, G. (1996). Vulnerability to depression: a model centered on anhedonia. Journal of Affective Disorders, 41, 39e53. McCabe, C., & Mishor, Z. (2011). Antidepressant medications reduce subcorticalcortical resting-state functional connectivity in healthy volunteers. NeuroImage, 57, 1317e1323. McFarland, B. R., Shankman, S. A., Tenke, C. E., Bruder, G. E., & Klein, D. N. (2006). Behavioral activation system deficits predict the six-month course of depression. Journal of Affective Disorders, 91, 229e234. O’Doherty, J., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452e454. Ohira, H., Fukuyama, S., Kimura, K., Nomura, M., Isowa, T., Ichikawa, N., et al. (2009). Regulation of natural killer cell redistribution by prefrontal cortex during stochastic learning. NeuroImage, 47, 897e907. Ohira, H., Ichikawa, N., Nomura, M., Isowa, T., Kimura, K., Kanayama, N., et al. (2010). Brain and autonomic association accompanying stochastic decision-making. NeuroImage, 49, 1024e1037. Ossewaarde, L., Verkes, R. J., Hermans, E. J., Kooijman, S. C., Urner, M., Tendolkar, I., et al. (2011). Two-week administration of the combined serotoninnoradrenaline reuptake inhibitor duloxetine augments functioning of mesolimbic incentive processing circuits. Biological Psychiatry, 70, 568e574. Pizzagalli, D. A., Iosifescu, D., Hallett, L. A., Ratner, K. G., & Fava, M. (2008). Reduced hedonic capacity in major depressive disorder: evidence from a probabilistic reward task. Journal of Psychiatric Research, 43, 76e87. Pizzagalli, D. A., Jahn, A. L., & O’Shea, J. P. (2005). Toward an objective characterization of an anhedonic phenotype: a signal-detection approach. Biological Psychiatry, 57, 319e327. Radloff, L. S. (1977). The CES-D scale: a self-report depression scale for research in the General Population. Applied Psychological Measurement, 1, 385e401. 1094 Y. Kunisato et al. / J. Behav. Ther. & Exp. Psychiat. 43 (2012) 1088e1094 Samejima, K., Ueda, Y., Doya, K., & Kimura, M. (2005). Representation of actionspecific reward values in the striatum. Science, 310, 1337e1340. Small, D. M., Gitelman, D., Simmons, K., Bloise, S. M., Parrish, T., & Mesulam, M. M. (2005). Monetary incentives enhance processing in brain regions mediating top-down control of attention. Cerebral Cortex, 15, 1855e1865. Steele, J. D., Kumar, P., & Ebmeier, K. P. (2007). Blunted response to feedback information in depressive illness. Brain, 130, 2367e2374. Stoy, M., Schlagenhauf, F., Sterzer, P., Bermpohl, F., Hagele, C., Suchotzki, K., et al. (2012). Hyporeactivity of ventral striatum towards incentive stimuli in unmedicated depressed patients normalizes after treatment with escitalopram. Journal of Psychopharmacology, 26, 677e688. Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press. Tanaka, S. C., Doya, K., Okada, G., Ueda, K., Okamoto, Y., & Yamawaki, S. (2004). Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nature Neuroscience, 7, 887e893. Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: the PANAS scales. Journal of Personality and Social Psychology, 54, 1063e1070. Watson, D., Weber, K., Assenheimer, J. S., Clark, L. A., Strauss, M. E., & McCormick, R. A. (1995). Testing a tripartite model: I. Evaluating the convergent and discriminant validity of anxiety and depression symptom scales. Journal of Abnormal Psychology, 104, 3e14. Journal of Mathematical Psychology 66 (2015) 59–69 Contents lists available at ScienceDirect Journal of Mathematical Psychology journal homepage: www.elsevier.com/locate/jmp The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior Kentaro Katahira Department of Psychology, Graduate School of Environmental Studies, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi, 464-8601, Japan highlights • • • • Reinforcement learning (RL) models and regression models have been used for choice data analysis. We investigated the relation between these two approaches. We found a special case in which an RL model is equivalent to a regression model. Based on the relation, we discuss how the RL parameters are related to history dependence. article info Article history: Received 7 November 2014 Received in revised form 19 February 2015 Available online 15 May 2015 Keywords: Reinforcement learning History dependence Regression model Model-based analysis abstract Reinforcement learning (RL) models have been widely used to analyze the choice behavior of humans and other animals in a broad range of fields, including psychology and neuroscience. Linear regressionbased models that explicitly represent how reward and choice history influences future choices have also been used to model choice behavior. While both approaches have been used independently, the relation between the two models has not been explicitly described. The aim of the present study is to describe this relation and investigate how the parameters in the RL model mediate the effects of reward and choice history on future choices. To achieve these aims, we performed analytical calculations and numerical simulations. First, we describe a special case in which the RL and regression models can provide equivalent predictions of future choices. The general properties of the RL model are discussed as a departure from this special case. We clarify the role of the RL-model parameters, specifically, the learning rate, inverse temperature, and outcome value (also referred to as the reward value, reward sensitivity, or motivational value), in the formation of history dependence. © 2015 The Author. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). 1. Introduction Reinforcement learning (RL) models have been widely used to analyze the choice behavior of living systems in a wide range of behavioral studies, including psychology and neuroscience (Corrado & Doya, 2007; Daw, 2011; O’Doherty et al., 2004; O’Doherty, Hampton, & Kim, 2007; Yechiam, Busemeyer, Stout, & Bechara, 2005). Evidence of the neural correlates for the subcomponents assumed in RL theory (e.g., reward prediction error, action value) provides the validity to perform an RL model-based analysis for choice behavior (Niv, 2009; Samejima, Ueda, Doya, & Kimura, 2005; Schultz, 1997). An essential feature of the RL model is the formulation of what action to take based on previous experiences of reward or punishment regarding the action. In addition, the linear regression-based E-mail address: [email protected]. approach, using reward history and choice history as explanatory variables and future choice as an objective variable, has also been used to model choice behavior (Corrado, Sugrue, Seung, & Newsome, 2005; Katahira, Fujimura, Okanoya, & Okada, 2011; Kovach et al., 2012; Lau & Glimcher, 2005; Seo, Barraclough, & Lee, 2009; Seo & Lee, 2009; Seymour, Daw, Roiser, Dayan, & Dolan, 2012; Sugrue, Corrado, & Newsome, 2004). The linear regression approach is useful for estimating how reward and choice histories influence future action (e.g., how much influence the reward from n trials ago has on future actions). However, the relation between the RL model and regression models has not been explicitly addressed. Specifically, to what extent and how the predictions differ between the two models has not been explored. Hence, the dependence on reward history in RL models has not been clearly described. In the present study, we aimed to clarify the relation between the parameters of the RL model and the influence of reward history on future choice (specifically, the regression coefficients of the logistic regression models). Because the regression model can http://dx.doi.org/10.1016/j.jmp.2015.03.006 0022-2496/© 2015 The Author. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4. 0/). 60 K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 directly represent the dependency on the reward and choice histories, investigating the relationship between RL-model parameters and regression models would provide valuable information about which behavioral factors may underlie the differences in the model parameters. Conversely, using the relation, one can predict which types of the behavioral sequences can be expected given a specific set of model parameters. To achieve these aims, we performed analytical calculations and numerical simulations. We focused on fundamental RL-model parameters: the learning rate, the outcome value (also referred to as the reward value, reward sensitivity, or motivational value), and the inverse temperature (also referred to as the exploration parameter). These parameters have been used to characterize how psychological factors or personality traits of individuals affect choice behavior (Katahira, Fujimura, Matsuda, Okanoya, & Okada, 2014; Katahira et al., 2011; Kunisato et al., 2012; Lindström, Selbing, Molapour, & Olsson, 2014). However, how these parameters are related to particular behavioral aspects has not been explored sufficiently. The present study will aid in the interpretation of the different impacts of the RL-model parameters. In the present study, we focus on probabilistic learning tasks (also called bandit problems), in which a decision-maker must choose between a set of options, each with different unknown reward rates, to maximize the total reward. The reward rates can dynamically change during the task, but they do not depend on past choices. Such probabilistic learning tasks have been widely used in psychology and neuroscience research. Simplified Q-learning models have often been used in RL model-based analysis of data obtained using this task. A general Q-learning model computes the action value, which is an expected future reward, for each ‘‘state’’ (Watkins & Dayan, 1992). However, for the probabilistic learning tasks that we consider here, there is only one state, and thus, a state variable is not required. Thus, in this study, we focus on a simplified Q-learning model without a state variable, and we will refer to this model as simply the ‘‘Q-learning model’’. In this paper, we first introduce several variants of Q-learning models for probabilistic learning tasks. Next, we describe a logistic regression model, which is a typical regression model used to analyze choice data. Among the variants of the Q-learning models, we find that the forgetting Q-learning model (F-Q model), in which the value of an unchosen option decays by the same amount as the value of chosen, non-rewarded option, is able to make predictions equivalent to those of the logistic regression model. We can view the general Q-learning model as a model that deviates from this special case. The deviation clarifies the special properties of standard RL models. We then present numerical simulation results that demonstrate the relation between the parameters in Q-learning models and the history dependence of choice. Finally, we discuss several implications of our results. Based on the set of action values, the model computes the probability of choosing option 1 using the soft max function: P (a(t ) = 1) = = exp (β Q1 (t )) (1) exp (β Q1 (t )) + exp (β Q2 (t )) 1 1 + exp (−β [Q1 (t ) − Q2 (t )]) , (2) where β is the inverse temperature parameter that determines the sensitivity of the choice probabilities to difference in values. The model subsequently evaluates the outcome of the action. The outcome value in trial t is denoted by R(t ). We typically simply set the binary value for R(t ) such that R(t ) = 1 if a reward is given and R(t ) = 0 if no reward is given. The impact of different outcomes may be quantified by choosing parameters R(t ) = κ1 if outcome 1 is given, R(t ) = κ2 if outcome 2 is given, and R(t ) = 0 if a control outcome is given (Katahira et al., 2014, 2011, 2015). Based on the outcome, the action values for the chosen option i are updated as follows: Qi (t + 1) = Qi (t ) + αL (R(t ) − Qi (t )) , (3) where αL is the learning rate that determines how much the model updates the action value depending on the reward prediction error, R(t ) − Qi (t ). For the unchosen option j (i ̸= j), the action value is updated as follows: Qj (t + 1) = Qj (t ) − αF Qj (t ) (4) = (1 − αF )Qj (t ), (5) where αF is the forgetting rate (Ito & Doya, 2009). In a common RL model-based analysis, the action value of the unchosen option is not typically updated. This convention can be represented by setting αF = 0. We call this the standard Q-learning model. In this study, the forgetting rate parameter plays an important role in the identification of the connection between the regression and RL models, as discussed later. 2.2. Linear regression models Next, we will introduce a regression model that predicts a choice from the reward and choice history of previous trials (Corrado et al., 2005; Lau & Glimcher, 2005; Sugrue et al., 2004). Here, we consider a binary outcome case such that R(t ) = 1 when the reward is given and R(t ) = 0 when no reward is given. Following the convention of Corrado et al. (2005) and Lau and Glimcher (2005), we represent the reward history r (t ) as follows: 1 r (t ) = −1 0 if option 1 is chosen and a reward is given at trial t , if option 2 is chosen and a reward is given at trial t , if no reward is given at trial t . We represent the choice history c (t ) as follows: 1 if option 1 is chosen at trial t , if option 2 is chosen at trial t . 2. Models c (t ) = 2.1. Reinforcement learning models With these history variables, the regression model is defined with a predictor: Here, we introduce an RL model (Sutton & Barto, 1998). Specifically, we consider the Q-learning model (Watkins & Dayan, 1992), which is the most commonly used model for model-based analysis of choice behavior. Throughout the paper, we consider a case with only two options; however, our results can be generalized to multiple-option cases. The model assigns each action, i, an action value, Qi (t ), where t is the index of the trial. In the default setting, the initial action values, Qi (1), are set to zero, i.e., Q1 (1) = Q2 (1) = 0. Let a(t ) ∈ {1, 2} denote the option that was chosen at trial t. h(t ) = −1 Mr m=1 br (m)r (t − m) + Mc bc (m)c (t − m), (6) m=1 where br (m) and bc (m) are the regression coefficients for the trial m trials ago. The constants Mr and Mc are the history length for the reward history and the choice history (from the past trials to the current trial), respectively. Sugrue et al. (2004) and Corrado et al. (2005) used a linear regression approach with an identitylink function and optimized the regression coefficients so that they K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 minimized the sum of squared errors between h(t ) and c (t ). Here, we consider logistic regression with the logit link function (Kovach et al., 2012; Lau & Glimcher, 2005; Seymour et al., 2012)1 : 1 P (a(t ) = 1) = 1 + exp(−h(t )) . (7) We estimate the regression coefficients using the maximum likelihood method. The linear and logistic regressions do not yield qualitatively different results. 3. Analytical calculation 3.1. General formulation To analyze the relation between the Q-learning and logistic regression models, we transform the update rules (Eqs. (3) and (5)) that were originally in the form of a recurrence formula. First, we define the following quantity δt ,i = 1 0 if a(t ) = i if a(t ) ̸= i. (8) With this expression, the update rules (Eqs. (3) and (5)) can be summarized as Qi (t + 1) = Qi (t ) + αt∗,i δt ,i R(t ) − Qi (t ) , 61 β as a free parameter. When different values are assigned to different outcomes, such as R(t ) = κ1 if outcome 1 is given, R(t ) = κ2 if outcome 2 is given, and R(t ) = 0 if the control outcome is given, the relative values among the different outcomes are meaningful; however, the absolute value is not meaningful. This is not the case if the initial value is non-zero, i.e., Qi (1) ̸= 0. However, if αL is sufficiently larger than 0 and if there are sufficient trials, the impact of Qi (1) becomes weak compared with the other terms, and it can be neglected. Taken together, as long as we consider the binary reward history (0: absence of reward, 1: existence of reward) and the case in which the initial value is negligible, varying β and the outcome value κ has the same effect on the reward history dependence of choice probability. 3.2. Special case of the Q-learning model (the forgetting Q-learning model) Here, we consider the special case where the forgetting rate is identical to the learning rate, i.e., αL = αF (Barraclough, Conroy, & Lee, 2004; Ito & Doya, 2009). This model is referred to as the forgetting Q-learning model (F-Q model; Ito & Doya, 2009). For this case, αt∗,i = αL , and Eq. (11) becomes Qi (t + 1) = (1 − αL )t Qi (1) (9) + αL where t −1 (1 − αL )k−1 δt −k,i R(t − k). (12) k=0 αt∗,i = αL δt ,i + αF (1 − δt ,i ). (10) Expanding this update rule back into the past, Qi (t + 1) = Qi (t ) + αt∗,i δt ,i R(t ) − Qi (t ) ∆Q (t + 1) = (1 − αL )t [Q1 (1) − Q2 (1)] = (1 − αt∗,i )Qi (t ) + αt∗,i δt ,i R(t ) = (1 − αt∗,i ) (1 − αt∗−1,i )Qi (t − 1) + αt∗−1,i δt −1,i R(t − 1) + αt∗,i δt ,i R(t ) + αL k=0 ∆Q (t + 1) = αL k=0 y(t ) = Mr j =1 t −1 + k=0 αt∗−k,i δt −k,i br (m) δt −m,1 R(t − m) − δt −k,2 R(t − m) m=1 (1 − αj∗,i ) Qi (1) k−1 (15) Furthermore, the logistic regression model (Eq. (6)) can be written as we obtain Qi (t + 1) = t −1 (1 − αL )k−1 × δt −k,1 R(t − k) − δt −k,2 R(t − k) . + (1 − αt∗,i )αt∗−1,i δt −1,i R(t − 1) + αt∗,i δt ,i R(t ) = ···, (1 − αL )k−1 δt −k,1 R(t − k) − δt −k,2 R(t − k) . (14) If the difference in the initial values of the Q s is zero, i.e., Q1 (1) − Q2 (1) = 0, or it is negligible, Eq. (14) is reduced to + (1 − αt∗,i )αt∗−1,i δt −1,i R(t − 1) + αt∗,i δt ,i R(t ) = (1 − αt∗,i )(1 − αt∗−1,i ) (1 − αt∗−2,i )Qi (t − 2) + αt∗−2,i δt −2,i R(t − 2) t (13) t −1 = (1 − αt∗,i )(1 − αt∗−1,i )Qi (t − 1) From the softmax function (Eq. (2)), we notice that the difference in Q-values between the two options (∆Q (t ) ≡ Q1 (t ) − Q2 (t )) affects the choice probability. From Eq. (12), for ∆Q , we obtain + (1 − αt∗−l,i ) R(t − k). (11) l=0 Note that all terms, with the exception of the first term, are of the first degree in R. Thus, if all initial values of Qi are zero, i.e., Qi (1) = 0 for all i, the transformation R → a × R with a constant a has an equivalent effect on choice probability as the transformation Q → a × Q or β → a × β . Thus, scaling the reward magnitude R by a factor a would not affect the choice probability (Eq. (2)) if β is scaled by 1/a at the same time. This suggests that it is redundant to parameterize the outcome values such that R(t ) = 0 when no reward is given and R(t ) = κ when a reward is given while leaving bc (m) δt −m,1 − δt −m,2 . (16) i=1 A comparison of Eqs. (15) and (16) indicates that these equations are equivalent if we neglect the cutoff effect (the effect of neglecting the terms for more than Mr trials ago), with the following relation br (1) = βαL , br (2) = βαL (1 − αL ), br (3) = βαL (1 − αL )2 , . . . , br (Mr ) = βαL (1 − αL )Mr −1 , bc (1) = bc (2) = · · · = bc (Mc ) = 0. (17) The sum of the coefficients for the reward history, m br (m), converges to β regardless of the learning rate αL if the cutoff effect can be neglected because ∞ 1 This model is also called a conditional logit model. Mc m=1 br (m) = β ∞ m=0 αL (1 − αL )m = β. (18) 62 K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 3.3. Standard Q-learning model (with the forgetting rate αF = 0) The standard learning model in which the action value for the unchosen option is not updated is represented by setting αF = 0 in Eq. (9). With this setting, αt∗,i = αL δt ,i Qi (t + 1) = (1 − αL δj,i ) Qi (1) t −1 + αL δt −k,i k=0 t2 (1 − τ )m−1 δt −m,i . (24) m=1 (1 − αL δt −l,i ) R(t − k). br (2) = βαL (1 − αL ), . . . , br (Mr ) = βαL (1 − αL )Mr −1 , (19) l =0 Denoting the number of trials in which option i is chosen in the trials from trial t1 to trial t2 as nt 1 , t 2 , i ≡ t −1 br (1) = βαL , j=1 k−1 Ci (t ) = τ For the F-Q model, i.e., αL = αF , substituting this choice trace into Eq. (22) and comparing the resulting equation to Eq. (16), we find that these equations are equivalent if we neglect the cutoff effect, with the following relations: and Eq. (11) becomes t to the weighted sum of the choice history, with exponentially decaying coefficients: bc (1) = βϕτ , bc (2) = βϕτ (1 − τ ), . . . , bc (Mc ) = βϕτ (1 − τ )Mc −1 . (25) 4. Results δ t ,i , t = t1 Eq. (19) becomes Qi (t + 1) = (1 − αL )n1,t ,i Qi (1) t −1 + αL δt −k,i (1 − αL )nt −k+1,t ,i R(t − k). (20) k=0 Regarding ∆Q , we have ∆Q (t + 1) = (1 − αL )n1,t ,1 Q1 (1) − (1 − αL )n1,t ,2 Q2 (1) t −1 + αL δt −k,1 (1 − αL )nt −k+1,t ,1 R(t − k) k=0 t −1 − δt −k,2 (1 − αL ) nt −k+1,t ,2 R(t − k) . (21) k=0 It should be noted that as the factor (1 − αL )nt −k+1,t ,i indicates, the influence of a reward (R = 1) depends on the frequency that the subject chooses the same option after the choice and is no longer a constant; thus, it cannot be precisely mapped onto the regression model. 3.4. The choice-autocorrelation factor It has been shown that humans tend to repeat the recent choices, a tendency called ‘‘choice perseverance’’, ‘‘choice stickiness’’, or ‘‘decision inertia’’. Such a tendency is often explicitly modeled by a choice-autocorrelation factor that is added to the action value as follows (Akaishi, Umeda, Nagase, & Sakai, 2014; Gershman, Pesaran, & Daw, 2009; Huys et al., 2011): P (a(t ) = i) = exp (β[Qi (t ) + ϕ Ci (t )]) 2 , (22) exp (β[Qk (t ) + ϕ Ck (t )]) k=1 where Ci (t ) is a trace that quantifies how frequently the option i was chosen recently. The choice trace weight ϕ is a parameter that controls the tendency to repeat (when positive) or avoid (when negative) recently chosen options. An example of the choice trace is a trace computed using the following update rule (Akaishi et al., 2014): Ci (t + 1) = (1 − τ )Ci (t ) + τ δt ,i (23) where we have defined δt ,i as in Eq. (8) and the initial values are set to zero, i.e., C1 (1) = C2 (1) = 0. This choice trace is equivalent We performed numerical simulations to confirm the validity of the analytical calculations and to investigate the relation between the RL and regression models. Throughout the simulations, we adopted the following procedures. First, we generated the choice data from the Q-learning models that performed hypothetical decision-making tasks. We subsequently fitted the parameters of the logistic regression model to the simulated data using the maximum likelihood method. We set the history length as Mr = 10 and Mc = 10. For the majority of the simulations, the Q-learning models performed a simple, probabilistic learning task, unless otherwise stated. In the probabilistic learning task, one of the options was associated with a higher reward probability, pr , compared with the other option that had a reward probability of 1 − pr . With the probability for the chosen option, the reward was given (R(t ) = κ); otherwise, no reward was given (R(t ) = 0). We used pr = 0.7 and κ = 1.0, unless otherwise stated. After each 50trial block, the contingencies of the two stimuli were reversed, and the model performed 500 trials in total (thus, there were 9 reversals in one session). We generated data for 5,000 sessions per condition, which resulted in 2,500,000 trials per condition. The use of a large data set reduces the estimation error of the regression coefficient. We confirmed that the confidence intervals of all regression coefficients were less than 0.05. Therefore, the statistical estimation error can be neglected in the interpretation of the results. 4.1. Special case of the Q-learning model (F-Q model) We began the simulations by confirming the analytical relation between the Q-learning model with the special case αF = αL (F-Q model) and the logistic regression model. We set the inverse temperature parameter β = 3.0. Fig. 1(A) shows the regression coefficients of the logistic regression that were analytically predicted from Eq. (17) (squares) and those obtained by statistical model fitting to the simulated data (solid lines). We can confirm that these two results almost perfectly agree, which supports the validity of the analytical calculation. Eq. (18) predicts that the total sum of the regression coefficient for reward history is equal to the inverse temperature (here, β = 3.0) independent of the learning rate if the cut-off effect can be neglected. Fig. 1(B) shows that this is the case with the exception of the small learning/forgetting rate case where a negative deviation from the predicted value (=3.0) was observed, particularly for a short history length (e.g., Mr = 10). This exception is because of the cutoff effect of reward history: for the small learning/forgetting rate case, greater than 10 previous trials have an influence on the current choice (Fig. 1(A)). Because these trials were not included in K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 A 63 C B Fig. 1. Simulation and analytical results for the special case in which the learning rate (αL ) and the forgetting rate (αF ) are identical (F-Q model). These two parameters were varied while fixed to the same value (αL = αF ). (A) The regression coefficients for the reward history (top) and the choice history (bottom). The solid line represents the coefficients estimated for the logistic regression model fitted to simulated data generated by the Q-learning models. The squares represent the analytical predictions obtained using Eq. (17). (B) The total sum of the regression coefficients for the reward history (top) and the choice history (bottom), while varying the length of the history included in the regression model (Mr = Mc ). (C) The scatter plot of the predictions regarding the current choice (P (a(t ) = 1)) derived from the Q-learning model and the regression model for varying learning rates (with identical forgetting and learning rates). For (A) and (C), the history lengths Mr and Mc are both set to 10. The other parameters were κ = 1 and β = 3.0. the regression model, the total sum of the regression coefficient became smaller than the inverse temperature. Consistent with this interpretation, increasing the history length Mr reduced the discrepancy (i.e., as Mr increased: 20, 30, and 40). We also examined how well the prediction of choice, i.e., P (a(t ) = 1), of the two models matched. Fig. 1(C) compares the predictions of the Q-learning and regression models. Each dot represents the prediction for one trial. The samples from the first 5 sessions are shown for visibility. If the two models’ predictions perfectly agree, the samples should lie on the diagonal line. We see that for the case with a small learning/forgetting rate (αL = αF = 0.2) and a short history length of Mr = 10, there is a slight difference between the two models because of the cut-off effect of reward history in the regression models. With the exception of this case, the two models exhibited almost perfect agreement, as predicted by the analytical calculation. 4.2. The effects of the departure of the forgetting parameter from the learning rate Next, we consider how the deviation of the forgetting rate αF from the learning rate αL affects the behavior of the Q-learning model. Fig. 2 shows the simulation results with a varying αF and a fixed learning rate, αL = 0.4. It is noteworthy that a dependence on choice history arose (bc ̸= 0). When αF was smaller than 64 K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 B A Fig. 2. The effects of the deviation of the forgetting rate αF from the learning rate αL = 0.4. (A) The regression coefficients for reward history (top) and choice history (bottom). The convention is the same as in Fig. 1(A) with the exception that the squares that represent the analytical predictions are only shown for the case with αF = 0.4. (B) The scatter plot of the prediction regarding the current choice (P (a(t ) = 1)) derived from the Q-learning model and the regression model with a varied forgetting rate and a learning rate fixed at 0.4. The other parameters were κ = 1, and β = 3.0. Table 1 Model comparison between the full regression model (Mr = Mc = 10) and the regression model without choice history (Mr = 10, Mc = 0). Data were derived from the Q-learning model using the same sample for which results are shown in Fig. 2. AIC, Akaike Information Criterion (unit is 1,000); Correlation, Pearson product-moment correlation coefficient between the predictions of the Q-learning and regression models. Model Full model No choice history Criteria Forgetting rate AIC (×10 ) Correlation (r ) AIC (×103 ) Correlation (r ) 3 0.0 0.2 0.4 0.6 0.8 1.0 2506.02 0.979 2559.84 0.955 2290.50 0.998 2297.92 0.995 2296.85 1.0 2296.84 1.0 2331.24 0.999 2335.26 0.997 2362.75 0.995 2375.27 0.990 2390.28 0.990 2413.85 0.977 αL , e.g., the standard Q-learning model (αF = 0), a negative dependence on choice history was identified (bc < 0). In contrast, when αF was larger than αL , a positive dependence on choice history was identified (bc > 0). To examine whether the inclusion of choice history in regression modeling improves the prediction, we computed the Akaike Information Criterion (AIC) for the regression models and correlation coefficients of the Q-learning and regression model predictions (Table 1). The AIC is a measure of the predictive ability of a model, and a smaller AIC score indicates a better prediction. With the exception of the case of αL = αF = 0.4 (the F-Q model), the AICs were smaller for the full model with choice history (Mc = 10) compared with the model without choice history (Mc = 0); these findings indicate that the inclusion of choice history provides a better fit. The full regression model exhibited a larger correlation coefficient between the predictions of the Q-learning and regression models, which indicates that the full model’s predictions agreed with the Q-learning model better than the model without choice history. The reason the dependence on choice history arose is explained as follows. Consider an extreme case where no reward was given in the last Mr trials (R(t − 1) = · · · = R(t − Mr ) = 0). In this case, the regression model in which only reward history is included predicts that the subject chose option 1 with a probability of 0.5, i.e., P (a(t ) = 1) = 0.5. However, this consequence differs from the actual behavior of the Q-learning model. In the Q-learning model with αF < αL , the value of the unchosen option remains unchanged (when αF = 0) or decays slowly compared with the chosen option (when αF > 0). In contrast, the value of the chosen option decays, and the tendency for switching the option increases. Thus, the regression coefficients for the choice history, bc , become negative. When the forgetting rate is larger than the learning rate (αF > αL ), the value of the unchosen option decays faster than the chosen option. Thus, the model tends to favor the same choice that was made in the immediate past and accordingly, bc is a positive value. Although these properties of the Q-learning model can be captured, in part, via the incorporation of choice history in the regression model, it does not make the prediction perfect. This point is clarified by the scatter plot of the prediction P (a(t ) = 1) for the two models (Fig. 2(B)). The greater the difference between αL and αF , the larger the deviation of the prediction of the two models (discrepancy from the diagonal lines). This tendency can also be confirmed by the correlation coefficients in Table 1. As the multiplicative factor (1 − αL )nt −k+1,t ,i in Eq. (21) indicates, the influence of an outcome depends on the frequency that the subject chose the same option after the choice. This influence cannot be exactly expressed by the regression model, which does not explicitly include the number of choices after the outcome. This factor represents a property that the general Q-learning model possesses but regression models cannot represent. For a specific trial, the number of times the same option is chosen before the trial is roughly independent of the predictions of the Q-learning model for the trial unless the prediction of the Q-learning model does not take extreme values (i.e., when P (a(t ) = 1) is close to 0 or 1). Thus, the discrepancies are uniformly distributed around the diagonal line, to a certain extent. When the forgetting rate is larger than the learning rate (i.e., αF = 0.8, 1.0), the predictions of the Q-learning model tend to be distributed within a set of discrete values. This occurs because the value of the unchosen option quickly decays to zero; both options have a chance to be the unchosen option, and it is rare that one option continues to be chosen many times so that K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 A 65 B Fig. 3. The effects of parameters in a standard Q-learning model. (A) The effects of the outcome value parameter κ on the dependence of reward history. The other parameters were set to αL = 0.4, αF = 0, and β = 3.0. (B) The effects of the learning rate αL , where αF = 0, β = 3.0, and κ = 1.0. For each panel, the upper two graphs show the regression coefficients for reward and choice history. The lower two graphs show the total sum of the regression coefficients as a function of the varying parameter. the corresponding Q-value continuously change. Thus, P (a(t ) = 1) is distributed discretely. 4.3. The effects of outcome value and learning rate in the standard Q-learning model We examined the influence of the outcome value parameter κ and the learning rate αL for the standard Q-learning model in which the forgetting rate is zero (αF = 0). As previously discussed, if the initial action values are all zero (i.e., Q1 (1) = Q2 (1) = 0), varying the inverse temperature β has the same impact on the choice as scaling R(t ) by the same factor. Thus, varying κ is equivalent to varying β by the same amount if we set the value of the neutral outcome to zero. Therefore, we examined the effects of κ , instead of β , while setting the value of the neutral outcome to zero. Fig. 3(A) shows the estimated regression coefficients with varying κ . As expected, the outcome value had a monotonic effect on the regression coefficients over the entire reward history. In addition, for the regression coefficients for choice history, the outcome value also had a monotonic effect; the larger the κ , the greater the negative dependence on the choice history was found to be. Fig. 3(B) shows the effects of the learning rate αL on the influence of reward and choice history. The basic effect of the learning rate observed for the F-Q model (αL = αF ; Fig. 1(A)) is also retained for the standard Q-learning model. For the influence of reward in the immediate past, the learning rate has a positive monotonic effect. In contrast, for the reinforcement effect in the far past, the relation is inverted. The larger the learning rate, the greater the decay of the regression coefficient. In contrast to the F-Q model, where the sum of the regression coefficients is independent of the learning rate (Eq. (18)) if the cut-off effect can be neglected, the standard Q-learning model results in a region of decreasing total regression coefficients as a function of the learning rate αL (Fig. 3(B), third panel from top), in addition to an increasing region because of the cut-off effect. This is a result of the unchanging value of the unchosen option in the standard Q-learning model. Because the effect of retaining the value of the unchosen option is enhanced for the case with a lower learning rate, the total impact of reward history becomes smaller as the learning rate increases, which shapes the decreasing part. It is notable that for the case with αL = 1.0 in the standard Q-learning model, even the reward two trials ago has an influence (Fig. 3(B)) because of the effect of retaining the value of the unchosen option. In contrast, in the F-Q model 66 K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 A B Fig. 4. The effects of reinforcement schedules. The Q-learning model parameters were set to αL = 0.4, αF = 0.0, κ = 1.0, and β = 3.0. (A) The effects of the reward probability in the probabilistic learning task. (B) The effects of the withdrawal rate in the competitive foraging task. See the main text for details. with αL = αF = 1.0, only a reward given one trial ago has an influence (Fig. 1(A)). In addition, the learning rate has a non-monotonic effect on the regression coefficients of the choice history (Fig. 3(B), middle). The impact of the choice history has a maximum at approximately αL = 0.5. 4.4. The effects of the reinforcement schedule Our analytical calculation demonstrated that for the special case where the forgetting rate αF equals the learning rate αL , the regression coefficient is determined independent of the task structure, i.e., the reinforcement schedule. For the general case where αL ̸= αF , however, the reinforcement schedule may affect the influence of the previous reward history because the impact of the reward history depends on the number of the same choices after the reward is given, as shown in Eq. (21). To examine this effect, first, we conducted a simulation with varying reward probabilities for the optimal option (pr ) from 0.5 to 0.9 (with reward probability for the non-optimal option being 1 − pr ). Fig. 4(A) shows the regression coefficients obtained by the simulation. The closer to 0.5 that pr was (the more difficult it is to discriminate the optimal choice), the smaller the decay of the regression coefficients, although the effect was weak. This result is explained as follows. When the difference in reward probabilities of two options is small, the difference between two action values tends to be small; thus, the model is likely to switch the choice. Therefore, the number times the same option is repeated becomes smaller, which leads to a smaller decay of the influence of reward history. In the probabilistic learning tasks that we have considered thus far, the reward probability is independent of the subjects’ previous choices. Next, we investigated what happens if the reward is given depending on the previous choice. A typical reinforcement schedule for this situation is a variable interval (VI) schedule. In a VI schedule, a reward is assigned to options stochastically and independently of the subject’s choice, and the reward remains until it is harvested by choosing the option. The probabilistic learning task belongs to a variable ratio (VR) schedule in which a reward is assigned to each option independent of choice, but the reward is immediately withdrawn unless the subject chooses the option and harvests the reward. For the simulation, we adopted the competitive foraging task proposed by Sakai & Fukai (2008) as the reinforcement schedule (see also Katahira, Okanoya, & Okada, 2012). This schedule combines the VI and VR schedules via the introduction of a withdrawal rate that is denoted by µi for option i. With the probability of µi , a reward at option i is removed if the subject does not obtain it. If µa′ = 0, ∀a′ , the task corresponds to the VI schedule. If µa′ = 1, ∀a′ , the task corresponds to the VR schedule. An intermediate region of µa provides mixtures of the VI and VR schedules. Fig. 4(B) shows the regression coefficients for the simulation with varying withdrawal rates (µ, common to both options) and a fixed reward assignment rate of 0.3 for one option and 0.7 for another option. These probabilities remained constant throughout the task (500 trials), i.e., no reversal occurred. The smaller the withdrawal rate (i.e., closer to VI schedule), the smaller the regression coefficient decay of the reward history and the more negative the regression coefficients for the entire choice history. The reason for these effects is as follows. With lower withdrawal rates, the maintenance of the same option following a reward is less likely rewarded in the subsequent trials compared with the other option. Thus, the model tends to switch the option because of the decrease in the action value compared with the case with a smaller or zero withdrawal rate (VR schedule). Therefore, the number of times the same option repeats decreases, and the decay of the influence of reward history becomes slower. 4.5. The effects of the choice-autocorrelation factor We have seen that the standard Q-learning model exhibits a (negative) dependency on choice history, as well as on reward history. However, it is difficult to control the dependence on choice history explicitly by tuning a model parameter. As described above, human choice behavior has a property called choice perseverance, which is a tendency to repeat the same choice as the recent choice (Akaishi et al., 2014; Gershman et al., 2009; Huys et al., 2011). A straightforward way to represent choice perseverance in the Q-learning is simply to add a residual choice-autocorrelation factor to the action values when computing the choice probability, as in Eq. (22). Here, we investigated the effect of the choiceautocorrelation factor on history dependence using a probabilistic learning task (with pr = 0.7). K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 A 67 B Fig. 5. The effects of the choice-autocorrelation factor. (A) The FQ-learning model with varying ϕ (choice autocorrelation weight). The parameters were set to αL = 0.4, αF = 0.4, κ = 1.0, β = 3.0, and τ = 0.4. (B) The standard Q-learning model with varying ϕ (choice autocorrelation weight). The parameters were set to αL = 0.4, αF = 0.0, κ = 1.0, and β = 3.0. The task used for simulation was the same probabilistic learning task for which results are shown in Figs. 1–3. For the F-Q model (αF = αL ), the impact of the choice autocorrelation factor on the regression coefficients can be evaluated analytically (Eq. (25)). Fig. 5(A) confirms the analytical relation: The results of the simulation (solid lines) and theoretical prediction agreed. The choice-autocorrelation factor does not influence the dependence on the reward history (all of the solid lines overlapped each other); instead, the effect of the choice-autocorrelation factor appeared in the regression coefficients for the choice history. On the other hand, for the general case in which αL ̸= αF , the choice-autocorrelation factor can also influence the reward history because it directly affects the number of identical choices that are made after a reward is given, and this effect influences the regression coefficients for the reward history, as discussed above (see Eq. (21)). Fig. 5(B) shows the simulation results for the standard Q-learning model with αF = 0 and illustrates one example of this effect. As ϕ increases in a positive domain, the tendency to repeat the same choice increases and enhances the decay of the influence of the reward history compared to the case in which there was no choice-autocorrelation factor (ϕ = 0). As ϕ decreases in a negative domain, the opposite effect is observed. The residual choiceautocorrelation factor has an additive effect on the regression coefficients for the choice history, bc . Taken together, the effects of the choice autocorrelation factor on the dependence on the history are largely additive and straightforward. For the general case, however, this factor may modulate the dependence on reward history through the property that we have observed in previous results, i.e., that it depends on the number of times that the option is chosen. 5. Discussion RL and linear regression models are valuable tools for the analysis of the psychological/neural processes that underlie choicesequence data on a trial-by-trial basis. However, the relation between the two models has not been described, and these models have been used independently. In the present study, we aimed to clarify the relation between the two approaches using analytical calculations and computer simulations. Here, we summarize our findings and describe their implications. We demonstrate the special condition where the Q-learning and regression models provide an identical prediction. This is the case when the following three conditions hold: (1) the forgetting and learning rates are identical (αL = αF ; F-Q model), (2) the influence of the initial action value can be neglected (e.g., the initial values are zero), and (3) the cutoff effect of the history length (Mr ) included in the regression model can be neglected. When the first condition is not met (i.e., when the forgetting rate differs from the learning rate), as is the case for the standard Q-learning model (αF = 0), a dependence on choice history arises. This dependence on choice history is relatively complex and cannot be completely captured by a conventional regression model. This dependence on choice history is a property of the RL model, which distinguishes it from simple regression models. It should be noted that this apparent dependence on choice history was found only by examining the relationship between regression models and RL models, as we have done in the present study, rather than by simply examining the RL model itself. An apparent dependence on choice history in the general Qlearning model arises from differences in the ‘‘clocks’’ for different options. In the standard Q-learning model (αF = 0), the action value for the unchosen option does not change, i.e., the clock stops at this trial. In contrast, in the F-Q model (αF = αL ), the clock proceeds at the same speed for all options; for the unchosen option, the value decays at the same rate that the option is chosen, but no reward is given. Thus, for the standard Q-learning model, a reward in a more distant past can influence a future choice, depending on how many times the option was chosen. The more general case (αF ̸= αL ) can be understood by this analogy. When αF < αL , the clock for unchosen option moves more slowly than the clock for the chosen option. In contrast, for the case αF > αL , the clock for the unchosen option moves more quickly than the clock for the chosen option; thus, a reward in the more distant past from a frequently chosen option influences a future choice less. If the actual computational process in a decision-maker is similar to that employed in the standard Q-learning model, i.e., the value of the unchosen option remains unchanged, better predictions could be achieved by constructing a regressor of the regression with different clocks for each option. Specifically, such a model should include the variables that represents reward or 68 K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 choice n trials back in trials in which that option was chosen, rather than in actual trials (as in the method discussed in this paper). However, for more general cases (αF ̸= αL , αF ̸= 0), mapping the RL model to the regression model is not straightforward. Hence, which RL model is most appropriate does matter for determining which kind of regression model is most suitable for analyzing a given data set. Ito and Doya (2009) fitted various models to rat choice behavior. Their results demonstrated that the F-Q model (αL = αF ) better fit the rats’ choice behaviors compared with the standard Q-learning model with αF = 0, and the F-Q model had approximately the same prediction performance as the Q-learning model with differential forgetting (DF-Q model; αL and αF were independent free parameters). The F-Q model was ultimately favored because the F-Q model has fewer parameters than the DF-Q model. Together with our discussion in the present study, Ito and Doya’s results may suggest that the simple regression model (with an actual trial count) would suffice to describe choice behavior in rats. We have discussed the similarities and differences between RL models and regression models. Which approach researchers should use depends on the goals and actual computational processes of the decision-maker. The functional forms of the dependence on reward history in the Q-learning model are restricted to exponential decay by the model structure, whereas in regression models, the dependence on reward history can take any form. If the pattern of decay is not exponential, for example, if it is a heavy-tailed, double-exponential form (Corrado et al., 2005), and if the goal is to predict a future choice, a linear regression model may be more appropriate and may provide a better prediction. In contrast, if the actual learning process is similar to that in the general Q-learning model rather than that in the F-Q model, the Q-learning model may provide better predictions. In addition, the Q-learning model may provide a better prediction when the model has a smaller number of parameters than the regression model does. Moreover, as the RL model can represent the value update rule explicitly, one can easily grasp the computational process with this representation. For example, it is not straightforward how the computational process can lead to a double-exponential decay pattern, and several computational models have been generated to attempt to explain this pattern (eg., Saito, Katahira, Okanoya, & Okada, 2014). Most importantly, the variables in the RL model, such as action values and reward prediction errors, can provide a regressor of neural activities that can be used to find the corresponding neural substrates (Daw, 2011; O’Doherty et al., 2007). Furthermore, including a factor that can affect behavior and testing the effects of such a factor, based on data, are often straightforward with RL model-based approaches compared to regression approaches. The RL-model parameters, such as the learning rate, inverse temperature, and outcome values, have been used to characterize several factors of decision making, such as the social attitudes of individuals (Lindström et al., 2014), effects of mental disorders (Kunisato et al., 2012; Yechiam et al., 2005), brain dysfunctions (Yechiam et al., 2005), and effects of emotional outcome (Katahira et al., 2014, 2011, 2015). Model-based analysis is a promising approach to investigating the specific parameters that best characterize the factors that influence learning and behavior. However, it is important to consider which types of differences in properties of behavior, e.g., dependence on past experience, choice bias, or choice randomness, yield differences in the model parameters. The present study provides guidelines for relating the effects identified in model-based analysis to the properties of actual behavior. Below, we summarize how the RL-model parameters are related to the history dependence of choice; a portion of these results was obtained in the present study. First, the learning rate αL largely controls how the weights for past outcomes are balanced, i.e., how much the model weighs more recent outcomes compared to outcomes in the more distant past. In the F-Q model (αL = αF ), the learning rate does not influence the total weight (the sum of the regression coefficients for reward history). However, we demonstrated that if the learning rate and the forgetting rate differ, then the total weight can be a decreasing function of the learning rate. This finding implies that increasing αL does not necessarily lead to an increase in the cumulative effect of the recent reward history. Therefore, the value of the learning rate should be interpreted with caution. The inverse temperature β and the outcome value κ had essentially the same effect on the history dependence unless the outcome values varied for different outcomes. These parameters uniformly and multiplicatively control the weights for past events. Thus, the summed influence of the past reward history is a monotonically increasing function of these parameters. The residual choice-autocorrelation factor Ci (t ) has an additive effect on the dependence on choice history. In the general case, this factor may modulate the dependence on reward history. One limitation of the present study is that it only concerned learning tasks without state transition dynamics that can be formulated as Markov decision processes. Although much research in psychology and neuroscience has been conducted using simplified Q-learning models without state variables, as discussed in the Introduction, several studies using RL model-based analyses have examined RL models that incorporate state variables (Daw, Gershman, Seymour, Dayan, & Dolan, 2011; Gläscher, Daw, Dayan, & O’Doherty, 2010). In addition, we have focused on a specific algorithm of RL, i.e., Q-learning. Other algorithms for RL have been used for model-based analyses of choice behavior, e.g., actor-critic algorithm (O’Doherty et al., 2004). Although the scope of the present study is limited, the basic results found here can be applied to the more general case. Other algorithms that incorporate learning state transitions (Daw et al., 2011; Gläscher et al., 2010) essentially employ a sequential update rule, such as a delta rule (as in Q-learning); thus, these algorithms likely produce exponential decay in history dependence. The main results of the present study are the findings that if all of the variables decay at the same speed, then the history dependence can be represented using an appropriate regression model, but if the speeds differ or if updating is stopped when the variable is irrelevant to the current experience, then the history dependency would be more complicated. Future studies are needed to investigate the history dependence of choices in more general RL models, including models that include state variables and models with different learning algorithms. Acknowledgment This work was supported in part by the Grants-in-Aid for Scientific Research (KAKENHI) grants 24700238 and 26118506. References Akaishi, R., Umeda, K., Nagase, A., & Sakai, K. (2014). Autonomous mechanism of internal choice estimate underlies decision inertia. Neuron, 81(1), 195–206. Barraclough, D. J., Conroy, M. L., & Lee, D. (2004). Prefrontal cortex and decision making in a mixed-strategy game. Nature Neuroscience, 7(4), 404–410. Corrado, G., & Doya, K. (2007). Understanding neural coding through the modelbased analysis of decision making. Journal of Neuroscience, 27(31), 8178. Corrado, G., Sugrue, L. P., Seung, H. S., & Newsome, W. T. (2005). Linear-nonlinearPoisson models of primate choice dynamics. Journal of the Experimental Analysis of Behavior, 84(3), 581–617. Daw, N. (2011). Trial-by-trial data analysis using computational models. In Decision making, affect, and learning: attention and performance XXIII 23, 1. Daw, N., Gershman, S. J., Seymour, B., Dayan, P., & Dolan, R. J. (2011). Model-based influences on humans’ choices and striatal prediction errors. Neuron, 69(6), 1204–1215. http://dx.doi.org/10.1016/j.neuron.2011.02.027. Gershman, S. J., Pesaran, B., & Daw, N. D. (2009). Human reinforcement learning subdivides structured action spaces by learning effector-specific values. Journal of Neuroscience, 29(43), 13524–13531. http://dx.doi.org/10.1523/JNEUROSCI. 2469-09.2009. Gläscher, J., Daw, N., Dayan, P., & O’Doherty, J. P. (2010). States versus rewards: dissociable neural prediction error signals underlying modelbased and model-free reinforcement learning. Neuron, 66(4), 585–595. http://dx.doi.org/10.1016/j.neuron.2010.04.016. Huys, Q. J., Cools, R., Gölzer, M., Friedel, E., Heinz, A., Dolan, R. J., & Dayan, P. (2011). Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding. PLoS Computational Biology, 7(4), e1002028. K. Katahira / Journal of Mathematical Psychology 66 (2015) 59–69 Ito, M., & Doya, K. (2009). Validation of decision-making models and analysis of decision variables in the rat basal ganglia. Journal of Neuroscience, 29(31), 9861. Katahira, K., Fujimura, T., Matsuda, Y.-T., Okanoya, K., & Okada, M. (2014). Individual differences in heart rate variability are associated with the avoidance of negative emotional events. Biological Psychology, 103, 322–331. http://dx.doi.org/10.1016/j.biopsycho.2014.10.007. Katahira, K., Fujimura, T., Okanoya, K., & Okada, M. (2011). Decision-making based on emotional images. Frontiers in Psychology, 2, 311. Katahira, K., Matsuda, Y.-T., Fujimura, T., Ueno, K., Asamizuya, T., Suzuki, C., et al. (2015). Neural basis of decision-making guided by emotional outcomes. Journal of Neurophysiology, in press. Katahira, K., Okanoya, K., & Okada, M. (2012). Statistical mechanics of rewardmodulated learning in decision-making networks. Neural Computation, 24(5), 1230–1270. Kovach, C. K., Daw, N. D., Rudrauf, D., Tranel, D., O’Doherty, J. P., & Adolphs, R. (2012). Anterior prefrontal cortex contributes to action selection through tracking of recent reward trends. Journal of Neuroscience, 32(25), 8434–8442. http://dx.doi.org/10.1523/JNEUROSCI.5468-11.2012. Kunisato, Y., Okamoto, Y., Ueda, K., Onoda, K., Okada, G., Yoshimura, S., et al. (2012). Effects of depression on reward-based decision making and variability of action in probabilistic learning. Journal of Behavior Therapy and Experimental Psychiatry, 43(4), 1088–1094. http://dx.doi.org/10.1016/j.jbtep.2012.05.007. Lau, B., & Glimcher, P. W. (2005). Dynamic response-by-response models of matching behavior in rhesus monkeys. Journal of the Experimental Analysis of Behavior, 84(3), 555–579. Lindström, B., Selbing, I., Molapour, T., & Olsson, A. (2014). Racial bias shapes social reinforcement learning. Psychological Science, 25(3), 711–719. Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53(3), 139–154. O’Doherty, J., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304(5669), 452–454. 69 O’Doherty, J., Hampton, A., & Kim, H. (2007). Model-based fmri and its application to reward learning and decision making. Annals of the New York Academy of sciences, 1104(1), 35–53. Saito, H., Katahira, K., Okanoya, K., & Okada, M. (2014). Bayesian deterministic decision making: a normative account of the operant matching law and heavy-tailed reward history dependency of choices. Frontiers in Computational Neuroscience, 8. Sakai, Y., & Fukai, T. (2008). The actor-critic learning is behind the matching law: matching versus optimal behaviors. Neural Computation, 20(1), 227–251. Samejima, K., Ueda, Y., Doya, K., & Kimura, M. (2005). Representation of actionspecific reward values in the striatum. Science, 310(5752), 1337. Schultz, W. (1997). A neural substrate of prediction and reward. Science, 275(5306), 1593–1599. http://dx.doi.org/10.1126/science.275.5306.1593. Seo, H., Barraclough, D. J., & Lee, D. (2009). Lateral intraparietal cortex and reinforcement learning during a mixed-strategy game. Journal of Neuroscience, 29(22), 7278–7289. http://dx.doi.org/10.1523/JNEUROSCI.1479-09.2009. Seo, H., & Lee, D. (2009). Behavioral and neural changes after gains and losses of conditioned reinforcers. Journal of Neuroscience, 29(11), 3627–3641. http://dx.doi.org/10.1523/JNEUROSCI.4726-08.2009. Seymour, B., Daw, N. D., Roiser, J. P., Dayan, P., & Dolan, R. (2012). Serotonin selectively modulates reward value in human decision-making. Journal of Neuroscience, 32(17), 5833–5842. http://dx.doi.org/10.1523/JNEUROSCI.005312.2012. Sugrue, L. P., Corrado, G., & Newsome, W. T. (2004). Matching behavior and the representation of value in the parietal cortex. Science, 304(5678), 1782–1787. Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: an introduction. Cambridge, MA, USA: MIT Press. Watkins, C., & Dayan, P. (1992). Q-learning. Machine Learning, 8(3), 279–292. Yechiam, E., Busemeyer, J., Stout, J., & Bechara, A. (2005). Using cognitive models to map relations between neuropsychological disorders and human decisionmaking deficits. Psychological Science, 16(12), 973–978. JPII 8 (1) (2019) 75-80 Jurnal Pendidikan IPA Indonesia http://journal.unnes.ac.id/index.php/jpii ASSESSMENT OF SEVENTH GRADE STUDENTS’ CAPACITY OF LOGICAL THINKING M. Fadiana*1, S. M. Amin2, A. Lukito3, A. Wardhono4, S. Aishah5 Department of Mathematic Education, Faculty of Teacher Training and Education, Universitas PGRI Ronggolawe Tuban 2,3 Department of Mathematic, Faculty of Math and Science, Universitas Negeri Surabaya 5 Department of Defence Science, Faculty of Defence Science and Technology, National Defence University of Malaysia 1,4 DOI: 10.15294/jpii.v0i0.11644 Accepted: October 2nd, 2018. Approved: March 25th, 2018. Published: March 28th, 2019 ABSTRACT The objective of this research was to quantify the logical thinking capacity of seventh grade students in Tuban, East Java, Indonesia. This research was conducting using the quantitative descriptive method and 119 students in the seventh grade of secondary school in Tuban during the Academic Year 2016/2017. The data was collected by using Logical Thinking Test (LTT) which comprises of six different kind of reasonings; namely conservational reasoning, proportional reasoning, controlling variables, combinatorial reasoning, probabilistic reasoning, and correlational reasoning (source). Based on LTT, scores are categorized into three levels; concrete operational levels, transitional levels, and formal operational levels (source). The results of the research display that (88.21%) of the seventh grade are classified at the concrete operational level, (10.08%) at the transitional level, and (1.68%) at the formal operational level. After conducting this research, teachers are now able to design teaching tactics and have a better understanding of secondary school student’s cognitive development and behavior. © 2019 Science Education Study Program FMIPA UNNES Semarang Keywords: logical thinking, concrete operational, formal operational INTRODUCTION Logistical thinking is a though process using logic, rationality and reason (source). Logic is defined as the discipline that examines the assembly of information and differentiates it between right and wrong reasoning (source). Logic can also be identified as the tool of correct thinking, the key to the processes of mental reservation and complex problem-solving (source). This means it have the capability to solve problems by using mental operations or one’s ability to range *Correspondence Address E-mail: [email protected] principles or rules by making confidentgeneralizations or ideas (Yaman, 2005; Fero et al., 2009; Şengül & Üner, 2010). Logical thinking is one of the ways used in acquiring advanced mental activities (source). This ability is an application level activity that depends on the knowledge and understanding levels of the objective’s cognitive stage (source). Also, Logical thinking is used in evaluating ideas, experiences, and information. Our logic produces results relate to the topic we are interested in, and then it converts these to memory. Piaget, a Swiss psychologist who is known for this work on child’s development, defines lo- 76 M. Fadiana, S. M. Amin, A. Lukito, S. Aishah, A. Wardhono / JPII 8 (1) (2019) 75-80 gical thinking as an ability that is observed in the concrete and abstract operations stage (source). When students are in the concrete operations stage, they can use logical thinking abilities in solving concrete problems (source). In the abstract operations stage, students obtain the level of adults in terms of rational thinking. According to Demirel & Coşkun (2010) and Lee & Bednarz (2012)’s study, logical thinking includes effective use of numbers, finding scientific solutions to problems, differentiating among concepts, classification, generalizing, calculations, and providing hypotheses. Roadrangka states three developmental stages by utilizing the logical thinking levels including concrete, transitional, and formal (Roadrangka, 1995). It can also be referred to as providing us with information about an individual’s cognitive development level (source). The important question to ask is “Why is logical thinking important in science learning?” After several research studies, data shows that there is a significant relationship between abstract thinking and scientific process skills and success in chemistry (Oloyede, 2012). Students using abstract thinking tend to be more successful compared to those who do not because lowlevel reasoning will bring low-level performance (Oloyede, 2012; Piaget & Inhelder 2013). Science requires the skills of collecting and analyzing data to solve problems, to formulate hypotheses, to control variables and to define them operationally (source). Such processes require a high level of logical thinking ability. Proportional reasoning is important in the quantitative aspect of chemistry, specially to understand the origin and use of many useful relationships in chemistry, such as the development and interpretation of tabulation and graph data (Ruiz & Lupiáñez, 2009). In addition, correlational reasoning plays an important role in the formulation of hypotheses and data interpretations that needed to consider relationships between variables. The controlling variable has an important role in planning, implementation, and interpretation (source). Before designing the learning process in the classroom, the teacher should know the level of the students’ logical thinking abilities (Gómez, 2007). It is necessary for teachers to be able to design a learning strategy in accordance with the level of logical thinking ability of their students. As Othman et al. (2010) stated, many people fail to realize that logical thinking is among the most important factors in determining the qualifications of students in learning programs (Othman et al., 2010). Program development specialist have an important role to play by making special efforts to have a better understanding of the world for children and offer academic experience based on the children’s curiosity and demand (Simatwa, 2010). Assessment of the logical thinking capability can also be used as a basis for measuring the mastery of science materials (Fah, 2009). The mastery of science materials can be predicted based on logical thinking ability. This opinion is in line with the results of Oliva’s research (2003), students with high logical thinking ability can change their alternative conceptions more easily. Moreover, logical reasoning of learners makes a thinking style impact on the ability to solve Physics problems (Bancong, 2013; Etzler & Madden, 2014; Rakhmawan & Vitasari, 2016). However, the researcher often encountersthat teachers rarely measure students’ logical thinking skills before designing learning strategies to be undertaken. Therefore, the ability of logical thinking level of seventh grade students is examined. This study is very important in relations to defining logical thinking levels of students. This is useful for education experts and program development specialists because it affects various characteristics such as; the acquisition of scientific concepts, three-dimensional thinking, and scientific process skills (source). The researcher motivation for the study was to inspire and assist teachers in design learning strategies in the classroom. The researcher aims to answer the following research question: What is the fundamental difference found between the formal operational students, transitional students, and concrete operational students? METHODS This study uses the quantitative approach with non-experimental research designs in the form of surveys (Creswell, 2009). This survey method is used to obtain data in certain places where the datataken consists of natural data without treatment as well as experiments (Sugiyono, 2008).The sample that was used in this study was of 119 students (61 male and 58 female students) from a private junior high school in Tuban, East Java, Indonesia during the academic year 2016/2017. M. Fadiana, S. M. Amin, A. Lukito, S. Aishah, A. Wardhono / JPII 8 (1) (2019) 75-80 The instrument that was used in this research is Logical Thinking Test (LTT) which was revised from the Group Assessment of Logical Thinking (GALT). GALT was created to measure six different kind of reasonings; namely conservational reasoning, proportional reasoning, controlling variables, combinatorial reasoning, probabilistic reasoning, and correlational reasoning (Roadrangka et al., 1983). The test was formed to uses multiple answers while an image is inserted for each item to help visualize the problem (source). In order to acquire a valid and reliable research instrument, the researcher tested cogency and consistency of LTT. LTT has been validated by experts who are English Linguists, psychologists, science teachers, ad mathematics teachers. Efforts were made to confirm the content face validity of the adapted and translated version of the instrument. When the results were translated into Indonesian (the national language called Bahasa Indonesia) so that the respondents could then understand the items and choose the best results. LTT also administered a reliability test. The coefficient of reliability (KR-20) of the LTT is 0.83. Thus, the selection of question originating from a valid and reliable instrument resulted in the measurement of students’ logical thinking ability to be valid and reliable. The research sample was given LTT to be completed in 50 minutes. Students answer LTT on the answer sheet provided. Each type of operations is represented by each of the two questions, in which five types of reasonings in the form of multiple choices are accompanied by reasons. As for the type of combinatorial reasoning, participants must explain each possible answer. The answer for the item of LTT number 1-10 is correct if both the answer and reason are correct. For items number 11 and 12 (combinatorial reasoning), the students are required to write down the answers. The answer in the minimum score is 0. Next, the score is classified as follows; scores 0-4 are ground in concrete operational levels, 5-7 scores are grouped in transitional levels, and 8-12 are formal operational levels. 77 vels if they score 8-12. Based on the LTT that has been completed by the students, the writers obtain data, as shown in Table 1 Table 1. Logical Thinking Ability Levels of Seventh Grade Student at a Private Junior High School In Tuban, East Java, Indonesia Level of Logical Thinking Number of Student Percentage Formal Operational 2 1.68 % Transitional 12 10.08 % Concrete Operational 105 88.24 % Based on Table 1, (1.68%) of the seventh students were at theformal operational level, (10.08%) were at the transitional level and the remaining (88.21%) were at theconcrete operational level. The results of this study are consistent with the results of research from previous researchers. Bitner (1991) found thatseventh Grade students (n = 156), (5%) were at formal operational levels, 33% were at transitional levels, and 62% were at concrete operational levels. On the other hand, Promo and Fahey (1982) reported the results of their research, indicatingthat (3.5%) of seventhgrade students are at the formal operational levels. Similar results were also distint in the literature (Biber et al., 2013; Bulut et al., 2009; Şenlik, Balkan,& Aycan, 2011). Based on the measurement of students’ logical thinking ability classified according to the gender division of male and female, the results are presented as shown in Figure 1. RESULT AND DISCUSSION After the students have completed the LTT, the answer sheets are scored according to the scoring guidelines. Furthermore, the scores are derived from the three levels. If students attain a score of 0-4, then they are grouped into the category of concrete operational levels. If students score 5-7, they are grouped into transitional levels, and students are grouped into formal operational le- Figure 1. Logical Thinking Ability When Viewed Based on the Gender of Male and Female of Seventh Grade Student at a Private Junior High School in Tuban, East Java, Indonesia Based on Figure 1, the number of students who are at the formal operational levelsconsist 2 78 M. Fadiana, S. M. Amin, A. Lukito, S. Aishah, A. Wardhono / JPII 8 (1) (2019) 75-80 male students only. While at the transitional level there are 9 male and 3 female students. At the oncrete operational levels there are 50 male students and 55 female students. Based on the results of this study, the proportion of logical thinking ability between male and female students is similar. Researchers Yaman, Fah, Kincal and Tuna indicated that based on the gender variable, that was no major change in the mean for this variable (Yaman, 2005; Fah, 2009; Kıncal et al., 2010; Tuna et al., 2013). As an example, Fah (2009) in his research to investigate the logical thinking ability of Sabah Malaysian students whose population is 16 years old, (97.2%) of male respondents and (98.7%) of female respondents are classified as the transitional operational stage. This is consistent with the results of researcher Kincal, who declared that logical thinking when it came to gender variable that there was no major difference (Kincal et al., 2010). However, when it came to the variables of the different type of school such as; academic success, socio-economic background and socio-cultural background that there was a major different between the scores (Kincal et al., 2010). After all this research and data, it is safe to conclude that the gender variable provides a feeble and unreliable measure for any decision of logical thinking. However, when viewed per item matter, male students are superior to female students when it comes tocorrelational reasoning and probabilistic reasoning. By contrast, female students are superior to male students in this type of combinatorial reasoning.The results of the LTT analysis by the research subjects for each type of reasoning are presented in Table 2. Table 2. Logical Thinking Ability of Seventh Grade Students at a Private Junior High School In Tuban, East Java, Indonesia for Each Type of Reasoning Operation No. 1 2 3 4 Type of Operation Conservation Item No. Theme Number of Students with Correct Answers Percentage 1 Piece of Clay 38 31,9 % 2 Metal Weights 21 17, 6 % Correlational Reasoning 3 Glass Size 15 12, 6 % 4 Scale 22 18, 5 % Proportional Reasoning 5 Pendulum Length 20 16, 8 % 6 Ball 25 21 % Control Variable 7 Squares and Diamonds #1 13 10, 9 % 8 Squares and Diamonds #2 20 16, 8 % 5 Probabilistic Reasoning 9 The Mice 8 6,7 % 10 The Fish 6 5% 6 Combinatorial Reasoning 11 The Dance 86 7, 2 % 12 The Shopping Center 45 37,8 % Based on Table 2, the result indicates that combinatorial reasoning attains the highest average score, while reasoning reason has the lowest average score. Fah showed similar results, namely that the lowest are probability reasoning scores, but the highest scores can be found in the category of combinatorial reasoning (Fah, 2009). Yenilmez et al., (2005) and Sezen & Bülbül, (2011) scored highest in the reasoning of variable control, while the lowest was found in correlational reasoning (Yenilmez, 2005).This difference shows that the development of logical thinking ability of each student isdifferent, and obviously influenced by the environment that shaped it. There is a fundamental difference found in the results of the study between theformal and operational level of the students and the level of non-formal operations. In the case of proportional reasoning, concrete and transitional level operational students tend to use intuitive and additive reasoning rather than using rational reasoning in solving problems. In the problem of controlling variables, students in the categories oftheconcrete operational levels and transitional levels do not show an understanding of the relationship between manipulation and control. In solving probabilisticproblems, students at the concrete operational level and transitional level focus only on M. Fadiana, S. M. Amin, A. Lukito, S. Aishah, A. Wardhono / JPII 8 (1) (2019) 75-80 one or two dimensions of the problem (i.e., geometric rhombic shape and number of diamonds). They are unable to observe the characteristics of the object and understand the relationship between these characteristics on the problem of correlational reasoning. Students who are at the non-formal operational level (concrete and transitional operational level) have not been able to show the pattern and cannot solve all combinations in the problem of combinatorial logic. Roadrangka stated there is a correlation between formal operational reasoning capabilities and the student’s achievement in biology, physics, and chemistry (Roadrangka, 1995). At the formal operational stage, students scored higher in science, material science and science tests which was different for those who were a definite operational stage and understudies at the transitional operational stage (Roadrangka, 1995). This was declaring that students were unable to expand this understanding of theoretical ideas. Therefore, students who are successful in science would be certain by using different modes of formal operational reasoning (Tsaparlis, 2005; Tai et al., 2005; Lewis & Lewis, 2007; Fabelo et al., 2011). Lewis and Lewis highlighted the important needs to include a focus on the development of formal beliefs as well as content review in the way to help at-risk students in general chemistry classes (Lewis & Lewis, 2007). Therefore, in science and mathematics learning, bridges are needed to reduce the gap between formal operational and non-formal operational stages. Science and mathematics learning can make use of concrete learning media in order to make the abstract concept easier for students to understand. Interactive multimedia can encouragesuccess anda more advance thinking skill for science students today (Melida, 2014; Alimah, 2012; Hartini et al., 2017). In addition to using the media, teachers could apply cooperative learning methods. Cooperative learning methods will be improving the students’ logical thinking levels thus improving their performances (Othman et.al., 2010; Eskandar et al., 2013; Glen, 2013). CONCLUSION Assessment of students’ logical thinking at a private junior high school in Tuban, East Java Indonesia taken during Academic Year of 2016/2017 which consisted of 61 male and 58 female students had the following outcome; (1. 68%)are at formal operational levels, (10.08%) are at the transitional levels and (88.24%) are at concrete operational levels. The proportion of lo- 79 gical thinking among male and female students is similar. However, there is a difference between formal operational students and non-operational formal students (transitional students and concrete operational students) when it comes to relative thinking, control factors, probabilistic thinking, correlational thinking, and combinatorial thinking. It is important to recognize that since majority of the students that participated in this research are in the concrete operational period, there may have been influence from cultural variables, educational framework, and reading behaviors. Also, keeping in mind the grade level impact of coherent reasoning which could be from instructions and instructive dimension are in this manner likewise vital in intelligent reasoning. REFERENCES Alimah, S. (2012). Pengembangan Multimedia Pembelajaran Embriogenesis Hewan untuk Mengoptimalkan Pemahaman Kognitif Mahasiswa. Jurnal Pendidikan IPA Indonesia, 1(2). Bancong, H. (2013). Profil Penalaran Logis Berdasarkan Gaya Berpikir Dalam Memecahkan Masalah Fisika Peserta Didik. Jurnal Pendidikan IPA Indonesia, 2(2), 195-202. Biber, A. C., Tuna, A., & Incikabi, L. (2013). An Investigation of Critical Thinking Dispositions of Mathematics Teacher Candidates. Journal of International Educational Research, 9(3), 257-266. Bulut, S., Ertem, G., & Sevil, Ü. (2009). Examination of the Critical Thinking Levels of Nursing Students. Dokuz Eylül University Electronic Journal of School of Nursing , 2 (2), 27-38. Creswell, J. W. (2009). Mapping the Field of Mixed Methods Research. Demirel, M., & Coşkun, Y. D. (2010). Case Study on Interdisciplinary Teaching Approach Supported by Project Based Learning. The International Journal of Research in Teacher Education, 2(3), 28-53. Etzler, F. M., & Madden, M. (2014). The Test of Logical Thinking as a Predictor of First-Year Pharmacy Students’ Performance in Required First-Year Courses. American Journal of Pharmaceutical Education, 78(6), 121. Eskandar, F. A., Bayrami, M., Vahedi, S., & Ansar, V. A. A. (2013). The Effect of Instructional Analogies in Interaction with Logical Thinking Ability on Achievement and Attitude Toward Chemistry. Chemistry Education Research and Practice, 14(4), 566-575. Fabelo, T., Thompson, M. D., Plotkin, M., Carmichael, D., Marchbanks, M. P., & Booth, E. A. (2011). Breaking Schools’ Rules: A Statewide Study of How School Discipline Relates to Students’ Success and Juvenile Justice Involvement. New York: Council of State Governments Justice Center. 80 M. Fadiana, S. M. Amin, A. Lukito, S. Aishah, A. Wardhono / JPII 8 (1) (2019) 75-80 Fah, L. Y. (2009). Logical Thinking Abilities among Form 4 Students in the Interior Division of Sabah, Malaysia. Journal of Science and Mathematics Education in Southeast Asia, 32(2), 161-187. Fero, L. J., Witsberger, C. M., Wesmiller, S. W., Zullo, T. G., & Hoffman, L. A. (2009). Critical Thinking Ability of New Graduate And Experienced Nurses. Journal of Advanced Nursing, 65(1), 139148. Glen, N. J. (2013). Helping Pre-service Elementary Teachers Write Scientific Arguments and Understand Their Connection to the Common Core State Standards. Double Helix, 1. Gómez, P. (2007). Learning Processes in the Initial Training of High School Mathematics Teachers. Hartini, S., Misbah, M., Dewantara, D., Oktovian, R. A., & Aisyah, N. (2017). Developing Learning Media Using Online Prezi into Materials about Optical Equipments. Jurnal Pendidikan IPA Indonesia, 6(2), 313-317. Kıncal, R. Y., & Yazgan, A. D. (2010). Investigating the Formal Operational Thinking Skills of 7th and 8th Grade Primary School Students According to Some Variables. Elementary Education Online, 9(2), 723-733. Lee, J., & Bednarz, R. (2012). Components of Spatial Thinking: Evidence from a Spatial Thinking Ability Test. Journal of Geography, 111(1), 15-26. Lewis, S. E., & Lewis, J. E. (2007). Predicting At-Risk Students in General Chemistry: Comparing Formal Thought to a General Achievement Measure. Chemistry Education Research and Practice, 8(1), 32-51. Melida, D. (2014). Pengaruh Media Prezi the Zooming Presentations terhadap Hasil Belajar Fisika Siswa Kelas Xi Sma N 12 Padang. Pillar of Physics Education, 4(2). Oliva, J. M. (2003). The Structural Coherence of Students’ Conceptions in Mechanics and Conceptual Change. International Journal of Science Education, 25(5), 539-561. Oloyede, O. I., & Adeoye, F. A. (2012). The Relationship between Acquisition of Science Process Skills, Formal Reasoning Ability and Chemistry Achievement. International Journal of African & African-American Studies, 8(1), 1-4. Othman, M., Hussain, F. M., & Nikman, K. (2010). Enhancing Logical Thinking among Computer Science Students through Cooperative Learning. Gading Journal for the Social Sciences, 14(01). Piaget, J., & Inhelder, B. (2013). The Growth of Logical Thinking from Childhood to Adolescence: An Essay on the Construction of Formal Operational Structures. Routledge. Rakhmawan, A., & Vitasari, M. (2016). Kemampuan Berpikir Logis sebagai Prediktor Keberhasilan Mahasiswa dalam Perkuliahan Kimia Dasar. Jurnal Penelitian dan Pembelajaran IPA, 2(1), 99-109. Roadrangka, V., Yeany, R. H., & Padilla, M. J. (1983, April). The Construction of a Group Assess- ment of Logical Thinking (GALT). In 56th Annual Meeting of the National Association for Research in Science Teaching. Dallas, Texas, April (pp. 5-8). Roadrangka, V. (1995). Formal Operational Reasoning Ability, Cognitive Style, and Achievement in Biology, Physics, and Chemistry Concepts of Form 4 students in Penang, Malaysia. SEAMEO Regional Centre for Education in Science and Mathematics, Penang. Ruiz, EF, & Lupiáñez, JL (2009). Detection of Psychopedagogical Obstacles in the Teaching and Learning of the Topics of Reason and Proportion in Sixth Grade Students of Primary Education. Electronic Journal of Research in Educational Psychology , 7(17). Şengül, S., & Üner, İ. (2010). What is the Impact of the Teaching “Algebraic Expressions and Equations” Topic with Concept Cartoons on the Students’ Logical Thinking Abilities?. Procedia-Social and Behavioral Sciences, 2(2), 54415445. Şenlik, NZ, Balkan, Ö., & Aycan, Ş. (2011). Critical Thinking Skills of Prospective Teachers: Muğla University Example. Celal Bayar University Journal of Science , 7(1), 67-76. Sezen, N., & Bülbül, A. (2011). A Scale on Logical Thinking Abilities. Procedia-Social and Behavioral Sciences, 15, 2476-2480. Simatwa, E. M. (2010). Piagets Theory of Intellectual Development and Its Implication for Instructional Management at Pre-Secondary School Level. Educational Research and Reviews, 5(7), 366-371. Sugiyono. (2008). Metode Penelitian Pendidikan: (Pendekatan Kuantitatif, Kualitatif dan R & D). Alfabeta. Tai, R. H., Sadler, P. M., & Loehr, J. F. (2005). Factors Influencing Success in Introductory College Chemistry. Journal of Research in Science Teaching: The Official Journal of the National Association for Research in Science Teaching, 42(9), 987-1012. Tsaparlis, G. (2005). Non‐Algorithmic Quantitative Problem Solving in University Physical Chemistry: A Correlation Study of the Role of Selective Cognitive Factors. Research in Science & Technological Education, 23(2), 125-148. Tuna, A., Biber, A. Ç., & İncikapı, L. (2013). An Analysis of Mathematics Teacher Candidates’ Logical Thinking Levels: Case of Turkey. Journal of Educational Instructional Studies, 3(1), 83-91. Yaman, S. (2005). Effectiveness on Development of Logical Thinking Skills of Problem Based Learning Skills in Science Teaching. Journal of Turkish Science Education, 2(1), 31-33. Yenilmez, A., Sungur, S., & Tekkaya, C. (2005). Investigating Students’ Logical Thinking Abilities. Hacettepe University Faculty of Education Journal , 28(28), 219-225. International Journal on Emerging Mathematics Education (IJEME) Vol. 1, No. 2, September 2017, pp. 177-184 P-ISSN: 2549-4996, E-ISSN: 2548-5806, DOI: http://dx.doi.org/10.12928/ijeme.v1i2.5783 Creative Thinking Process based on Wallas Model in Solving Mathematics Problem 1Hevy Risqi Maharani, 2Sukestiyarno, 2Budi Waluya 1Mathematics Education Department, Sultan Agung Islamic University Education Department, Semarang State University e-mail: [email protected] 2Mathematics Abstrak Suatu hal yang sulit bagi guru SMP adalah memposisikan dan mengembangkan anak didiknya yang masih berada pada masa transisi dalam berpikir kreatif. Penelitian ini bertujuan untuk mengevaluasi proses berpikir kreatif siswa berdasarkan model Wallas. Penelitian ini ada penelitian deskriptif kualitatif menggunakan data triangulasi. Subjek dikategorikan pada kemampuan tinggi, sedang, dan rendah setelah diberikan tes kemampuan berpikir kreatif. Objek pada penelitian ini adalah eksistensi bagaimana siswa SMP dalam menyelesaikan masalah matematika. Data dianalisis melalui klasifikasi, representasi, dan kesimpulan. Hasil penelitian menunjukkan 1) terdapat 23,33% siswa tidak tuntas dan hanya mencapai tahap persiapan, disebut kelompok kategori rendah; 2) terdapat 60% siswa mencapai tahap iluminasi meskipun untuk sampai pada tahap ini siswa memerlukan waktu lama, disebut kelompok kategori sedang; dan 3) 16,67% siswa telah tuntas sampai tahap verifikasi, disebut kelompok kategori tinggi. Bagi siswa dengan kategori kemampuan rendah dan sedang masih membutuhkan pendampingan saat mengalami hambatan pada proses berpikir kreatifnya, sedangkan siswa kategori kemampuan tinggi membutuhkan materi pengayaan. Kata kunci: proses berpikir kreatif, masalah matematika, Model Wallas Abstract Developing student’s creative thiking is difficult for teacher, especially when they are still in transtition. This study aims to evaluate students’ creative thinking process based on the model of Wallas. This is a descriptive and qualitative research where data triangulation is employed. Subjects are categorized into upper, middle, and low category after doing creative thinking ability test. The object of the study is the existence of how junior high school’s students solving mathematics problems. Data were analyzed through classification, data representation, and conclusions. The results showed 1) 23,33% of students only reached preparation stage, called low category, 2) 60% of students reached illumination stage though students take a long time, called middle category, and 3) 16,67% of students have completed up to verification stage, called upper category. For students in low and middle category, they still need assistance when experiencing obstacles in the creative thinking process, while the upper category students need enrichment materials. Keywords: creative thinking process, mathematics problem, Wallas model How to Cite: Maharani, H.R., Sukestiyarno, & Waluya, B. (2017). Creative thinking process based on wallas model in solving mathematics problem. International Journal on Emerging Mathematics Education, 1(2), 177-184. http://dx.doi.org/10.12928/ijeme.v1i2.5783 INTRODUCTION The development of creative thinking is now expected to be the focus of mathematics education in which students are given the freedom to try to give original or new possible solutions from themselves (Kwon, Park, & Park, 2006). It means, learning mathematics should avoid the use of traditional learning methods that leads to convergent thinking in which students only remember mathematical theorems and Received February 7, 2017; Revised August 22, 2017; Accepted August 29, 2017 178 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 rules to solve problems. In reality, what often happens in mathematics learning is students are given closed problems to be solved. Students are not taught to use divergent thinking and reasoning which is important for them to understand the context so that they give a positive response and actively participate in the learning process. Students who have a creative thinking not only uses mathematical knowledge they have acquired during the learning in solving the problem, but can use new strategies and unusual in solving their problems (Wessels, 2014). Pehkonen (1997) give four reasons why it is important to teach problem solving in relation to creative thinking. First, problem solving can develop cognitive skills. Second, problem solving encourages creativity. Third, problem solving is a part of the process to apply mathematics. And the last, problem solving encourages students to learn mathematics. In Bloom taxonomy, there are two dimensional frameworks: knowledge and cognitive process (Bloom, 1956; Krathwohl, 2002; Huitt, 2011; Bonaci, Mustata & Ienciu, 2013; Munzenmaler & Rubin, 2013; IACBE, 2014). Structure of the knowledge dimension includes factual, conceptual, procedural, and metacognitive knowledge. Structure of the cognitive process dimension includes remember, understand, apply, analyze, evaluate, and create. Create is the highest dimension on cognitive process. Students must have creative thinking ability to get it. Highlighting the sub areas of mathematics, especially geometry, the material consists of concepts that students need more attention to understand them. For instance, junior high school students need transition of thinking process from the concrete to the abstract. The study of Maharani & Sukestiyarno (2015) showed that the average student’s mathematics creative thinking ability on geometry were categorized in almost not creative. There were no students in creative or very creative category. Students in solving the problems still use one particular way that has been taught by their teacher. they were not familiar using variety of ways in solving a problem. The solution given by students were not categorized in a new or unusual idea. Very rare students gave the answer in any variety of ways. Creative thinking ability on the concept of geometry in junior high school cannot be separated from problem solving ability. In defining the relationship between creative thinking and problem solving, it is important to determine what makes creative of creative problem solving. Therefore, it is important to do an investigation of the creative thinking process (Aldous, 2007). One model of the creative thinking process is presented by Wallas (2014). The model consists of four distinct stages: preparation, incubation, illumination, and verification. In the preparation stage we collect a wide range of knowledge, share this knowledge into rules of logic in the realm of the investigation and adopted to define a given problem. During the incubation stage, various ideas freely grouped and rearranged without students directly working on the problem. This stage requires a few seconds, minutes, or hours depending on the difficulty of the problems encountered. Usually when the solution is found, the illumination stage has arrived which is often called as the ‘aha’ experience. At the end, the obtained solution need to be checked, developed, and refined in the verification stage and elaborated to ensure the solutions is understandable. If the verification stage showed a solution to be not feasible then there may be a return to the beginning of creative process. Although preparation and verification stages are included in the conscious activity, incubation and illumination stages are in unconscious activity. In school, students have differences from the other students in many various domains. They have different levels of motivation, background, attitude towards teaching IJEME, Vol. 1, No. 2, September 2017, 177-184 IJEME ISSN: 2549-4996 179 and learning, as well as different responses to the classroom environment and certain instructional practices (Potur & Barkul, 2009; Runisah, Herman, & Dahlan, 2017). As a result, students have differences in creative thinking process. The purpose of this study is to evaluate student’s creative thinking process in solving mathematics problems. If the teachers know the process of creative thinking, the teacher can give the specific assistance to students and can select model or method that appropriate for students. The framework of this study is junior high school students who are in a transition of creative thinking process. In Sari & Sukestiyarno (2014), Widiastuti & Sukestiyarno (2014), and Wulandari & Sukestiyarno (2015), they suggest that in order to achieve the goal of students such as the mastery of cognitive ability in creative thinking, they must be trained to perform conditional affective aspects of creative that is focused under a specified time period. The model of Wallas is used to measure and determine student’s position in creative thinking stages. Therefore the following study is conducted under the research aims of describing the profil of student’s creative thingking in solving mathematics problems based on Wallas model. RESEARCH METHOD This study uses a descriptive qualitative approach. Variable of this study is junior high school student’s creative thinking process in solving mathematics problem. Data were collected through tests of mathematics problem solving that are evaluated based the model of Wallas (2014). Subjects were students of class VII Sultan Agung Junior High School. They were selected through purposive sampling technique that is based on several criteria. The first criterion is based on the level of student’s creative thinking ability. The number of subjects at each level contains a minimum of two students. The second criterion is to choose students who have a good communication skill. The creative thinking process of students in solving mathematics problems is observed based on Wallas model. The procedure of this study as follows: 1) provide a test of mathematics problem solving to students; 2) analyze the results of students in solving mathematics problem to identify the abilities of students in creative thinking; 3) conduct interviews for students to know their creative thinking process in solving mathematics problem; 4) analyze the results of interviews. RESULTS AND DISCUSSION Students were given a mathematics problem in geometry which is open question where some alternative answers are possible. The results of the analysis of student’s answers in solving mathematics problems showed that 5 students have mathematics creative thinking ability in upper category (16,67%), 18 students have mathematics creative thinking ability in middle category (60%), and 7 students have mathematics creative thinking ability in lower category (23,33%). Furthermore, in each category of creative thinking ability, two representatives were interviewed. For students in upper category, they went through the stages of creative thinking process of Wallas very well. When students are trying to find alternative answers, they always taking into account the available time to convince whether the answer is correct. Controlling the answer is something that is very important to them. Figure 1 is an example of the creativity of the students to arrange several plane figures that have same areas with an area of a given plane figure. The first alternative, student made arrangement of four rectangles with a length is 15 and width is 4. In the second alternative, student can make arrangement some plane figures that consist of rectangle and triangle shown in Figure 1. Creative Thinking Process based on Wallas Model in Solving Mathematics Problem Maharani, Suketiyarno, & Waluya 180 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Figure 1. Sample of student work in upper category The results of interviews with student in upper category as follow. Question : With the available time, you made two alternative answers were correct. What did you do? Answer : First, I made a plane figure arrangement which has the same area with the given plane figure. After I found the right answer, I checked out a minute later and then I looked for another arrangement. Question : Did you found it difficult to determine the area of plane figure arrangement? Answer : No, I didn’t. Because I have ever did the same problem to find the area of some plane figures. For students in middle category, they can solve the problems by producing an alternative answer. It can be seen in Figure 2. Figure 2. Sample of the Student Work in Middle Category IJEME, Vol. 1, No. 2, September 2017, 177-184 IJEME ISSN: 2549-4996 181 The results of interviews with student in middle category as follow. Question : With the available time, you made two alternative answers were correct. What did you do? Answer : I lack of attention to the available time. I was busy tried to find the plane figure arrangement. Question : Did you have time to verify your answer? Answer : Because I was in a rush in doing, I didn’t have time to check again my answer. Finally, for students in lower category, they have difficulty to determine the plane figure arrangement. They just showed streaks calculation, but they haven’t made a specific plane figure arrangement in their answer sheet. The results of interviews with student in lower category as follow. Question : With the available time, you didn’t make an alternative answers were correct. What did you do? Answer : I feel confused. I had difficulty to find the plane figure arrangement. Result of the student work in upper category, on the answer sheet show that they can understand the given problem well. This means that the preparation stage has been done, where they have knowledge to solve the problem. In incubation stage, they tried to think of the arrangement in a few minutes. Until the illumination stage, they immediately make some alternative arrangements, apply their idea and made a plane figure arrangements. In verification stage, they look back at their works and then after they believe that the answer is right, they make the other plane figure arrangements. This is where students do creative thinking process stages Wallas well. As stated in the interview, that they have become familiar with the problem relating to the area of plane figure. This is inline with results of previous studies that students who arrived at the expected goals namely creative thinking ability, need to be given a habituation to each stage of creative thinking process. On the other hand, the events that occurred on the student in middle category, they did imperfection in the stages of creative thinking process based on Wallas. There is an emotional aspect that brings students to become unfocused in the creative thinking process. Students can understand the given problem in the preparation stage. But in incubation stage, students think a little longer to finally find an arrangement while they make calculation first in the other sheet. After finding the appropriate plane figure arrangement (reach the illumination stage), they make the arrangement in the answer sheet. When the answer has been completed, they didn’t check the results again and continue to think the other arrangement. This was because they require a longer time to find the arrangement. They have tried to make some arrangements, but the results are not right so they try to change the arrangement to another arrangement. The student’s work in lower ability of creative thinking indicates that students have difficulty in determining the area of plane figure arrangement. Student’s answer sheet only shows streak of calculation, but they haven’t made a specific plane figure arrangement. This is where students really have difficulties in solving mathematic problem. With more intensive scaffolding, teacher will be able to help them. Based on student’s mathematics creative thinking ability in solving mathematics problems, students suggest that their ability are still low. Majority of students did not Creative Thinking Process based on Wallas Model in Solving Mathematics Problem Maharani, Suketiyarno, & Waluya 182 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 fulfill all the stages of creative thinking process well enough. This was because they experienced obstacles to go beyond each stage in creative thinking process. Students in lower category have experienced obstacles at early stage, while students in middle category have experienced obstacles in the verification stage. Only students in upper category who were able to go beyond every stage of creative thinking well. The result of the deeper analysis of student’s work and interviews show that the process of student’s creative thinking in solving mathematics problem based on the model of Wallas are divided into three categories. It can be seen in Table 1. Analysis from the interviews showed that each student of three categories have different characteristics in each stage of Wallas model. This result shows that students in upper category do not have a difficulty in every stage of creative thinking process. They can solve the problem clearly and can provide various ways to answer. Students also can explain the solution clearly and detailly. Students in middle category in illumination stage show that they always try to solve a problem in any variety of ways. But sometime they make an error when applied their idea and they change or make a new solution, when in fact they can correct the solution without looking for the new solution. Table 1. Creative thinking process in solving mathematics problems based on wallas model Wallas’s Stage Preparation Incubation Illumination Verification Upper Category of Creative Thinking Ability Students understood the problems and could communicate information obtained well and use their own language Middle Category of Creative Thinking Ability Students were able to understand the problems, but in communicating information obtained students still use the problems language Lower Category of Creative Thinking Ability Students didn’t understand the problems and less able to communicate any information obtained Students tend to stop for a moment to observe a given problems and remember the material that they learn previously Students applied the idea to solve the problem with confidence and have the correct solution Students tend to stop for a moment to observe the given problems and try to make scribble on a blank paper Students applied the idea to solve the problem and the solution was generally true Students tend to stop a bit longer and think about what to do Students rechecked Students the solution. When rechecked finding an error they solution. IJEME, Vol. 1, No. 2, September 2017, 177-184 Students didn’t clear in implementing the idea to solve the problem and the solution obtained was wrong Student didn’t the recheck the When solution because IJEME ISSN: 2549-4996 try to correct the finding an error solution until get the they tend to right solution replace or make other solution 183 they feel they did the correct solution, so they didn’t correct the solution that still wrong Students in lower category from the beginning in preparation stage, they already have difficulties. They do not understand with the given mathematics problems and tasks. Beside, they do not have any information or knowledge that can be used to solve the problems. Consequently, students are unclear in implementing the idea to solve the problem and the solution obtained is also wrong. This result is in accordance with Potur & Barul (2009) that state students differ from one another in levels of motivation, backgrounds, attitudes towards teaching and learning, as well as the different responses to the classroom environment and certain instructional practices. So, students have different process in creative thinking too. In this study, students have differences in their creative thinking process. CONCLUSION Creative thinking ability is an ability that is important for everyone, not just when studying at school, but also when face with the world of work. In mathematics learning especially in geometry for junior high school, teachers and researchers can develop learning involves divergent thinking so as to enhance the student’s creative thinking ability. Teachers can provide more opportunities for students to explore answers or solution that can lead to a creative thinking process. Based on the model of Wallas (2014), has four stages of creative thinking process that include preparation, incubation, illumination, and verification. Results of this study, students were given a mathematics problem to solve in order to measure the ability of creative thinking. The results showed 1) 23,33% of students only reached preparation stage, called the low category, 2) 60% of students reached illumination stage though to arrive at this stage students takes a long time, called middle category, and 3) 16,67% of students have completed up to verification stage, called upper category. Students in lower category have difficulty to solve a mathematics problem. Therefore they needs more intensive guidances from teachers to assist them in performing each stage in the creative thinking process. For students in middle category teachers need to provide a little guidance and motivation to students when they did an error in solving the problem until they find the right solution. Students who have in upper category have been through the creative thinking process fluently, so teachers need to provide further enrichment materials. REFERENCES Aldous, C.R. (2007). Creativity, problem solving and innovative science: Insights from history, cognitive psychology and neuroscience. International Education Journal, 8(2), 176-186. Bloom. B.S., et al. (1956). Taxonomy of Educational Goals. Canada: David McKay Company, Inc. Bonaci, C.G., Mustata, R.V. & Lenciu, A. (2013). Revisiting bloom’s taxonomy of educational objectives. The Macrotheme Review, 2(2), 1-9. Creative Thinking Process based on Wallas Model in Solving Mathematics Problem Maharani, Suketiyarno, & Waluya 184 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Huitt, W. (2011). Bloom et al.’s Taxonomy of the Cognitive Domain. Educational Psychology Interactive. Valdosta, GA: Valdosta State University. IACBE. (2014). Bloom’s Taxonomy of Educational Objectives and Writing Intended Learning Outcomes Statements. USA: The International Assembly for Collegiate Business Education. Krathwohl, D.R. (2002). A revision of Bloom’s Taxonomy: An overview. Theory Into Practice, 41(4), 212-218. Kwon, O.N., Park, J.S., & Park, J.H. (2006). Cultivating divergent thinking in mathematics through an open-ended approach. Asia Pacific Education Review, 7(1), 51-61. Maharani, H.R., & Sukestiyarno. (2015). Learning analysis based on humanism theory and mathematics creative thinking ability of students. International Journal of Education and Psychology, 1(2), 11-18. Pehkonen, E. (1997). The state-of-art in mathematical creativity. ZDM, 29(3), 63-67. Potur, A.A., & Barkul, O. (2009). Gender and creative thinking in education: A theoretical and experimental overview. Journal of ITU A| Z, 6(2), 44-57. Runisah, Herman, T., & Dahlan, J.A. (2017). Using the 5E learning cycle with metacognitive technique to enhance students’mathematical critical thinking skills. International Journal on Emerging Mathematics Education, 1(1), 87-98. Sari, E.P. & Sukestiyarno. (2014). Pemanfaatan media lingkungan dalam pembentukan karakter kemandirian siswa pada pembelajaran matematika model PBL pendekatan konstruktivisme. Proceedings of the National Conferenfe on Conservation for Better Life (NCCBL), Semarang State University, pp. 350-357. Wallas, G. (2014). The Art of Thought. England: Solis Press. Wessels, H. (2014). Levels of mathematical creativity in model-eliciting activities. Journal of Mathematical Modelling and Application, 1(9), 22-40. Widiastuti, M.M.H., & Sukestiyarno. (2014). Pemanfaatan lingkungan sekolah guna menunjang pembelajaran matematika dengan model cycle 7E dalam rangka membentuk karakter rasa ingin tahu siswa dan pemecahan masalah. Proceedings of the National Conferenfe on Conservation for Better Life (NCCBL), Semarang State University, pp. 331-338. Wulandari & Sukestiyarno. (2015). Development of ATONG based reference module for school geometry subject and analysis of mathematical creative thinking skills. Paper of the International Conference on Mathematics, Science, and Education (ICMSE), Semarang State University. IJEME, Vol. 1, No. 2, September 2017, 177-184 International Journal on Emerging Mathematics Education (IJEME) Vol. 1, No. 1, March 2017, pp. 1-10 P-ISSN: 2549-4996, E-ISSN: 2548-5806, DOI: http://dx.doi.org/10.12928/ijeme.v1i1.5507 Development of Student Worksheets to Improve the Ability of Mathematical Problem Posing 1,2Harry Dwi Putra, 2Tatang Herman, 1,2Utari Sumarmo 1STKIP Siliwangi, Jl. Terusan Jend. Sudirman No. 3 Baros, Cimahi Tengah 40526 2Universitas Pendidikan Indonesia, Jl. Dr. Setiabudhi No. 299, Bandung 40154 Email: [email protected] Abstrak Aktivitas mengajukan masalah (problem posing) dan menyelesaikannya merupakan bagian penting dari matematika, sehingga problem posing merupakan aktivitas yang sangat berperan dalam berpikir matematis dan menjadi bagian yang penting dalam pemecahan masalah. Jenis penelitian ini adalah research and development yang bertujuan untuk mengembangkan lembar kerja siswa mengenai aturan pencacahan dengan pendekatan scientific disertai strategi what if not. Metode penelitian terdiri dari: studi pustaka, observasi, wawancara; pengembangan bahan ajar, penilaian dari dua ahli, dan uji coba terbatas pada siswa kelas XI di SMAN 2 Cimahi, SMAN 3 Cimahi, dan SMAN 4 Cimahi. Berdasarkan hasil penelitian, diperoleh lembar kerja siswa yang memuat aktivitas mengamati, menanya, mencoba, menalar, dan menyimpulkan. Pada aktivitas menanya menggunakan strategi what if not dengan cara merubah data, menambah data, mengubah data dengan pertanyaan yang sama, atau mengubah pertanyaan dengan data yang sama. Dalam lembar kerja disajikan masalah kontekstual yang sesuai dengan pengalaman siswa. Setelah dilakukan uji coba menggunakan lembar kerja, peningkatan kemampuan mathematical problem posing siswa pada ketiga sekolah berada pada kriteria sedang. Kata Kunci: lembar kerja siswa, pendekatan scientific, strategi what if not, mathematical problem posing Abstract Activities posing and solving a problem are important parts in mathematics, so the problem posing is an activity that was instrumental in mathematical thinking and become an important part in solving the problem. This type of research is research and development that aims to develop student worksheets through scientific approach with what if not strategy. The research method consists of stages: literature study, observation, interviews, development of student worksheets, assessment of two experts, and restricted trial to the eleventh grade students in SMAN 2 Cimahi, SMAN 3 Cimahi, and SMAN 4 Cimahi. Based on the findings, it was concluded that the student worksheets through scientific approach with what if not strategy load some activities including observing, questioning, trying, reasoning, and concluding. The activity of questioning using what if not strategy involves changing data, adding data, changing data with the same question, or changing question with the same data. The student worksheets also presented some contextual problems in accordance with the experience of students. After tested using worksheet, the improvement of student mathematical problem posing in the three schools was in moderate criteria. Keywords: student worksheets, scientific approach, what if not strategy, mathematical problem posing How to Cite: Putra, H.D., Herman, T., & Sumarmo, U. (2017). Development of student worksheets to improve the ability of mathematical problem posing. International Journal on Emerging Mathematics Education, 1(1), 1-10. http://dx.doi.org/10.12928/ijeme.v1i1.5507. INTRODUCTION The heart of mathematics is to pose problems and solve them (Brown & Walter, 1990). Therefore, the problem posing activity become an important role in Received December 28, 2016; Revised February 23, 2017; Accepted February 27, 2017 2 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 mathematical thinking and become an important part on problem solving. Based on some findings, Winograd (1997) suggested that giving assignments to the students to create questions could improve the ability to solve problems and their attitudes toward mathematics. In line with this finding, English (1998) stated that the problem posing could improve thinking skills, problem solving, attitude, confidence in solving problems, and generally contribute to the understanding of mathematical concepts. To acquire the ability of problem solving, students should have many experiences in solving various problems (Herman, 2007). Mathematics learning in senior high school should load tasks to formulate mathematical problems based on various situations, both inside and outside mathematics, organize and find conjecture, and learn to generalize and extend the problem through problem posing (NCTM, 2000). To develop the ability of mathematical problem posing, we required the ability to creating new questions, creating new opportunities, and looking at old questions from a new angle (Ellerton & Clarkson, 1996). This opinion is supported by Silver (2013) who states that the relationship between creativity and posing problems does not stand alone but affects each other. After made questions, the students were asked to check the accuracy of the questions answered. These activities could train the students to think reflectively. Problem posing is not only seen as a purpose of learning but also as a tool that can provide experiences for students to find and to create mathematical problems. This experience is still a bit possessed by students. According to the findings of Van Harpen & Sriraman (2013) advanced high school students had trouble posing good quality and novel mathematical problem. Joo & Han (2016) revealed that some senior high school students had difficulties in posing problem or limited understanding of that. Igor (2016) found that the problem posing performance of senior high students and the teachers was the low quality of the created problems. Therefore, the ability of problem posing should be trained to the students in order to develop the ability to think, to be skillful to troubleshoot, and expand the understanding of the concept. If the teacher presents a challenging problem and requires students to think, then the situation can provide greater opportunities for students to develop critical thinking. In the curriculum of 2013 emphasizes on a contextual approach becomes the process approach as strengthening of the use of scientific approach. The scientific approach is one of the approach that facilitates students to acquire knowledge and skills based on a scientific method: observing, questioning, reasoning, trying, and forming networks (concluding, presenting, and communicating). Students are directed to process knowledge, discover and develop their own concepts with regard to the subject matter so that it provides an opportunity for students to cultivate high-level thinking skills (Kemdikbud, 2013). Brown & Walter (1990) stated that generally mathematical thinking begins with given statements so that we just trained to resolve the issue of the statement. However, it should be better if we give another statement rather than just accept the statement. In mathematical thinking, posing a problem is better than just finishing that problem. One of the learning strategies that can develop the high-level ability of mathematical thinking is what if not strategy. This strategy can develop mathematical problem posing abilities through activities of students in analyzing problems, contrasting conditions on the problem, and checking the correctness of completion. Based on the explanation above, it seems that the ability of mathematical problem posing for students is so important that teachers need to develop student worksheets through appropriate learning approaches and strategies in order to IJEME, Vol. 1, No. 1, March 2017, 1-10 IJEME ISSN: 2549-4996 3 improve the ability of the high-level mathematical thinking. The research questions of this study are formulated as follows: How to develop student worksheets through a scientific approach with what if not strategy to improve the students’ ability of mathematical problem posing? What is the impact of the use of student worksheets through a scientific approach with what if not strategy on the students’ ability of mathematical problem posing? RESEARCH METHOD This research is a research and development that produces student worksheets through scientific approach with what if not strategy to improve the ability of mathematical problem posing for students in senior high school. The procedures of this research include (Sukmadinata, 2012): a field survey to obtain information from the teachers in a certain school about mathematical problem posing abilities of the students. Literature review to examine the student worksheets that will be developed through a scientific approach and strategy of what if not. Drafting of student worksheets that will be assessed by a team of experts. The worksheets that had been validated were then tested to eleventh grade students from three schools, which have high, medium, and low criteria of senior high school namely SMAN 2 Cimahi, SMAN 3 Cimahi, and SMAN 4 Cimahi. Research procedure shown in the following Figure 1. Figure 1. The Procedure of Learning Materials Development Research Development of student worksheets to improve the ability of mathematical problem posing Putra, Herman, & Sumarmo 4 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 RESULTS AND DISCUSSION Based on literature review, it can be obtained some indicators of the ability of mathematical problem posing to the material of enumeration rules. In making question dealing with the ability of mathematical problem posing for enumeration rules pertaining to matter, it refers to the following indicators (Sumarmo, 2015). Draw up a new question regarding to the problem of enumeration rules. Express a problem into another form that has the same meaning with the material of cyclic permutation. Ask questions of a series of mathematical semi-structured information with respect to the ability to think reflectively on the material of permutations. Specify the questions about the material of combination into question parts. Ask questions before, during, and after solving problems with regard to the material of chances. According to interviews with teachers in SMAN 2 Cimahi, SMAN 3 Cimahi, and SMAN 4 Cimahi, students still had difficulty in solving the problems of permutations and combinations within the rules enumeration. They were confused on using permutation or combination formula to solve the problems. Presentation of the material from the book makes students follow the example of completion of the given problem, so that when the questions were changed, they become confused to define the concepts to use whether it is permutation or combination. In lessons, students rarely ask questions. That was only teachers who posed the question to the students. This shows that the students' ability to pose a problem was undeveloped. One of the appropriate approaches is scientific approach with what if not strategy. In the opinion Brown & Walter (1990) to develop the ability of problems posing, we can use a what if not strategy by changing the data in question, adding data on the problem, changing the data with the same question, or changing the question with the same data on the problem. Based on observations and interviews conducted to teachers at those three schools, it can be concluded that the worksheets need to be developed through a scientific approach with what if not strategy to improve the ability of mathematical problem posing of high school students. The initial design of student worksheets includes three parts. The front part consists of a title page, preface, and a table of contents. Displays the title pages of the student worksheets are made. The body section consists of a crosshead of the enumeration rules that represent the content of the chapter: rule multiplication, permutations, combination, Binomial Newton, and Chance. At the beginning of the presentation of the material given the problems that exist in daily life of the students were resolved by the given instructions to find the concept. The conclusion sections, students give reflection and evaluation of the presentation material on worksheets that they used to work on in order to make improvements. Student worksheets were adjusted to the scientific approach that includes activities to observe, ask, try, reason, and concludes. These activities can vary the sequence. Figure 2. Observation Activity on Worksheets IJEME, Vol. 1, No. 1, March 2017, 1-10 IJEME ISSN: 2549-4996 5 In the observation, problems were presented as an introduction to the concepts to be learned. In the figure 2, students observe the problems regarding the bank officer who want to print the costumer’s queue number consisting of three numbers of 1, 2, 3, and 4. The question is how much the queue number choices can be made from the first three digits? How much the queue number choices can be made from the four numbers are available? Figure 3. Try Activity on Worksheets In the stage of a trying, students were asked to solve problems in worksheets. Students are given instructions in solving problems. A lot of the order queue number consisting of three numbers of 1, 2, and 3 are 6 composition. Whereas, a lot of the order number of the queue consisting of three numbers of 1, 2, 3, and 4 are 24 order. Based on the instructions, students try to adjust the answer. When students answered not in accordance with the instructions, they will reflect on their mistakes. Figure 4. Reasoning Activity on a Worksheets In the reasoning activity, students are directed to find an answer through the concept of factorial. Students examine the truth of the answers by providing an explanation. The number 6 can be converted into a pattern of 3 2 1. Any number Development of student worksheets to improve the ability of mathematical problem posing Putra, Herman, & Sumarmo 6 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 3 2 1 . Based on the concept 1 3 2 1 3! of factorial, 3 2 1 pattern can be written as 3! so . Karena 1! equal to 1 1! 3! 3! . If students can explain the truth of the answers, it means that (3 3)! so 1! (3 3)! the students already understand the concept of factorial. divided by 1 the result is the number itself, so 3 2 1 Figure 5. Questioning Activity on a Worksheets In the questioning activity, students ask two more questions about these problems. Then, students choose one question and answer it. Students can use what if not strategy to ask questions. When a bank officer would like to print the costumer’s queue number consists of 4 numbers from 1, 2, 3, 4, and 5. The questions are: 1. How much the queue number choices can be made from the first four digits? 2. Is it true that a lot of the order that can be made from 5 digits available is 120 compositions? Figure 6. Concluding Activity on a Worksheets IJEME, Vol. 1, No. 1, March 2017, 1-10 IJEME ISSN: 2549-4996 7 In the conclusion activity, students make inferences about the concept was studied. Students understand that many compositions can be formed from 3 different numbers of 3 numbers available are called permutations 3 elements are indicated by 3! the symbol P(3,3) 3 P3 P33 3! . Many compositions can be formed from 3 (3 3)! different numbers of 4 number available are indicated by the symbol 4! P(4,3) 4 P3 P34 . In conclusions, a lot of permutations of k elements is k ! (4 3)! and a lot of permutations of k elements take from n elements available is n! . P (n, k ) n Pk Pkn (n k )! Assessment of student worksheets according to a team of experts from the aspect of the contents had fulfilled the five elements in the scientific approach: observing, questioning, trying, reasoning, and concluding in teaching the concept of problem. From the display of student worksheets, it looks simple. Display of student worksheets had already developed a mathematical problem posing abilities of students in asking questions about new problem or developing a given problem. Based on the results of the assessment team of experts, these materials were already eligible tested on students. Here is presented the assessment of the experts. Tabel 1. Feasibility test result of the student worksheets Feasibility Criteria (%) No. Experts Content Display Language 1. Validator 1 83,33 85,94 81,67 2. Validator 2 79,19 82,81 80,00 Mean 81,25 84,38 80,83 Based on Table 1, it seems that the average ratings of student worksheets by both the validator based on the criteria of content and language is between 63.50%83.00% that means it is feasible. While the criteria of display is above 83,00% which means it is more feasible. Therefore, worksheets are feasible to be tested on students. Student worksheets that had been assessed shown to teachers at three schools to get suggestion for further development. The worksheets can be tested, which is in eleventh grade students in Mathematics and Science classes of SMAN 2 Cimahi, SMAN 3 Cimahi, SMAN 4 Cimahi. Student worksheets that had been validated are given to students in learning activity nine meetings. Students were given an explanation regarding to the presentation of worksheets so that they can solve the problems properly. There are five scientific activities that must be done by students: look into the subject, ask questions about the problems with the strategy of what if not, try to solve the problem, reason against the troubleshooting steps, and make conclusions from the material they had learned. After finishing studying all the material of enumeration rules in student worksheets, they were asked to fill a scale of opinion regarding to the presentation of the worksheets they had learned. Percentage score of opinion scale of students from three schools toward the student worksheets were 67.86%, 65.79% and 65.14%. These results were between 61.00% - 80.00% which means that students had positive opinion toward worksheets. Students agree that the worksheets contain the task of asking questions. Student worksheets can increase their confidence to draw the questions and solve them. Table Development of student worksheets to improve the ability of mathematical problem posing Putra, Herman, & Sumarmo 8 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 2 below presented the results of the pretest, posttest, and enhancement (N-gain) of the students. Ideal Maximum score is 14. Tabel 2. Result of pretest dan posttest of mathematical problem posing ability School Total Pretest Posttest N-gain Criteria SMAN 2 Cimahi 31 4,31 10,73 0,67 Moderate SMAN 3 Cimahi 35 4,51 10,48 0,63 Moderate SMAN 4 Cimahi 37 4,29 10,16 0,60 Moderate Based on Table 2, it is shown that an improvement of the ability of mathematical problem posing of the students from the third grade is moderate. Students were not accustomed to drafting a new question, stating the problem in another form with the same meaning, asking questions with regard to determining and verifying the relevant data, scanting details into the form of parts, as well as asking questions before, during, and after troubleshooting. Through this student worksheets, it can be said that the ability of mathematical problem posing of students was increased quite well. CONCLUSION Based on the results of the assessment team of experts and the restricted trial of student worksheets, it can be concluded that student worksheets contained the activities of observing, questioning, trying, reasoning, and concluding in accordance with the principles of the scientific approach. What if not strategy was applied to the activities of questioning, by changing the data, adding data, or changing the question. The designs of student worksheets consist of three parts. Beginning consists of a title page, preface and table of contents. The content section containing the title of the chapter on the rules of the enumeration and contextual problems that exist in the students’ life. Closing as a reflection and evaluation of the material presented on student worksheets for the next revision. Student worksheets could increase the ability of mathematical problem posing in the moderate criteria. ACKNOWLEDGEMENT The author would like to extend thanks to the chair and staffs of the Ministry of Research Technology and Higher Education (KEMENRISTEK DIKTI) who has helped funding this research. REFERENCES Brown, S.I., & Walter, I. (1990). The Art of Problem Posing (2nd ed.). Hillsdale, NJ: Lawrence Erlbaum Associates. Ellerton, N.F., & Clarkson, P.C. (1996). Language Factor in Mathematics Education. Alphen aanden Rijn: Kluwer Academic Publisher. English, L.D. (1998). Children's problem posing within formal and informal contexts. Journal for Research in Mathematics Education, 29(1), 83-106. Herman, T. (2007). Pembelajaran berbasis masalah untuk meningkatkan kemampuan berpikir matematika tingkat tinggi siswa sekolah menengah pertama. Educationist, 1(1), 47-56. Igor, K. (2016). Considerations of aptness in mathematcial problem posing: students, teacher, and expert working on billiard task. Far East Journal of Mathematical Education, 16(3), 243-260. IJEME, Vol. 1, No. 1, March 2017, 1-10 IJEME ISSN: 2549-4996 9 Joo, H., & Han, H. (2016). The analysis of middle school students' problem posing type and strategies. The Mathematical Education, 55(1), 73-89. Kemdikbud. (2013). Pendekatan Scientific (Ilmiah) dalam Pembelajaran. Jakarta: Pusbagprodik. Kontorovich, I. (2016). Considerations of aptness in mathematcial problem posing: students, teacher, and expert working on billiard task. Far East Journal of Mathematical Education, 16(3), 243-260. NCTM. (2000). Principles and Standards for School Mathematics. Reston, VA: National Council of Teacher Mathematics. Silver, E.A. (2013). Problem posing reseach in mathematics education: looking back, looking around, and looking ahead. Educational Studies in Mathematics, 83(1), 157-162. Sukmadinata, N.S. (2012). Metode Penelitian Pendidikan. Bandung: PT Remaja Rosdakarya. Sumarmo, U. (2015). Mathematical Problem Posing: Rasional, Pengertian, Pembelajaran, dan Pengukurannya. Retrieved from STKIP Siliwangi: http://utarisumarmo.dosen.stkipsiliwangi.ac.id/files/2015/09/Problem-Posing-MatematikPengertian-dan-Rasional-2015.pdf Van Harpen, X.Y., & Sriraman, B. (2013). Creativity and mathematical problem posing: an analysis of high school students' mathematical problem posing in China and the USA. Educational Studies in Mathematics, 82(2), 201-221. Winograd, K. (1997). Ways of sharing student-authored story problems. Teaching Children Mathematics, 4(1), 40-49. Development of student worksheets to improve the ability of mathematical problem posing Putra, Herman, & Sumarmo 10 IJEME, Vol. 1, No. 1, March 2017, 1-10 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 ISSN 2087-8885 E-ISSN 2407-0610 Journal on Mathematics Education Volume 11, No. 1, January 2020, pp. 59-76 ELEMENTARY PRESERVICE TEACHERS’ KNOWLEDGE, PERCEPTIONS AND ATTITUDES TOWARDS FRACTIONS: A MIXED-ANALYSIS Roslinda Rosli1, Dianne Goldsby2, Anthony J. Onwuegbuzie3, Mary Margaret Capraro2, Robert M. Capraro2, Elsa Gonzalez Y. Gonzalez4 1Universiti Kebangsaan Malaysia, UKM Bangi, Malaysia A&M University, College Station, Texas, United States 3Sam Houston State University, Huntsville, Texas, United States 4University of Houston, Texas, United States Email: [email protected] 2Texas Abstract Previous research has shown knowledge, perceptions, and attitudes are essential factors during mathematics classroom instruction. The current study examined the effects of a 3-week fraction instructional unit using concrete models, problem-solving, and problem-posing to improve elementary preservice teachers’ knowledge, perceptions and attitudes towards fractions. A quasi-experiment design was implemented to gather data via closed-ended, open-ended, and essay tasks from a convenience sampling of 71 female elementary preservice teachers during pre- and post-assessments. The study discovered that the select preservice teachers were weak in the content knowledge specifically on unit-whole, part-whole, equivalent area, arithmetic operations, and ordering fractional values. In contrast, the incorporation of concrete models, problem-solving and problemposing was effective in improving the preservice teachers’ level of pedagogical content knowledge, perceptions and attitudes towards fractions. Implications of the results and suggestions are discussed. Keywords: Elementary School, Problem Posing, Teacher Preparation Program, Preservice Teachers, Mixed Methods Abstrak Penelitian sebelumnya menunjukkan bahwa pengetahuan, persepsi, dan sikap merupakan faktor penting dalam pembelajaran matematika. Penelitian saat ini meneliti tentang pengaruh dari pembelajaran pecahan selama kurun waktu 3 minggu menggunakan model konkret, pemecahan masalah, dan problem-posing untuk meningkatkan pengetahuan, persepsi, dan sikap calon guru sekolah dasar terhadap materi pecahan. Sebuah desain eksperimen semu diimplementasikan untuk mengumpulkan data melalui permasalahan dalam bentuk soal closed-ended, open-ended, dan uraian dari pengambilan sampel yang representative dari 71 calon guru wanita sekolah dasar selama pra dan pasca penilaian. Studi ini menemukan bahwa calon guru terpilih lemah pada konten materi pecahan, khususnya pada unit-whole, part-whole, area yang sama, operasi aritmatika, dan nilai pecahan berurutan. Sebaliknya, penggabungan model konkret, pemecahan masalah, dan problem-posing adalah efektif dalam meningkatkan level pengetahuan konten pedagogis calon guru, persepsi, dan sikap terhadap materi pecahan. Implikasi hasil dan saran lebih lanjut dibahas dalam tulisan ini. Kata kunci: Sekolah Dasar, Problem Posing, Program Persiapan Guru, Calon Guru, Metode Campuran How to Cite: Rosli, R., Goldsby, D., Onwuegbuzie, A.J., Capraro, M.M., Capraro, R.M., & Gonzalez, E.G.Y. (2020). Elementary preservice teachers’ knowledge, perceptions, and attitudes towards fractions: A mixedanalysis. Journal on Mathematics Education, 11(1), 59-76. http://doi.org/10.22342/jme.11.1.9482.59-76. Knowing numbers and operations is a cornerstone of mathematics education in the school curriculum (National Council of Teacher of Mathematics [NCTM], 2000). Young students should have acquired a conceptual understanding of number systems, their structures, and properties during classroom instruction (Lamon, 1999). Fractions are considered central concepts for school mathematics but have conventionally been challenging and cumbersome for teachers to deliver as well as for students to learn (Barnett-Clarke, Fisher, Marks, & Ross, 2010; Lin, 2010). An incomplete understanding of fractions 59 60 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 can eventually affect individuals’ learning in subsequent mathematical areas such as algebra, trigonometry, and calculus and related disciplines (Barnett-Clarke et al., 2010). Numerous previous studies have emphasized the impact of diverse classroom instruction (e.g., traditional, concrete models, web-based, one-on-one) on student learning of fractions (Lin, 2010; Newton, 2008; Osana & Royea, 2011). However, there has been scant research measuring the potential benefit of an instructional practice that integrates concrete models for developing preservice teachers’ conceptual understanding of fractions (Sarama & Clements, 2009). In this study, we focused on measuring the integration of Mathematics TEKS Connection module with concrete model instructional practices in facilitating preservice teacher knowledge construction, perceptions and attitudes towards fractions during the teacher preparation programs. A body of literature has documented that a majority of in-service teachers and preservice teachers have limited profound knowledge for teaching mathematics (Ball, 1993; Hill, 2010; Ma, 1999; Newton, 2008; Timmerman, 2004). In addition, White, Way, Perry and Southwell (2005) and Llinares (2002) stated preservice teachers' belief or perceptions is reflected in their action or attitudes during teaching and learning that influence the classroom instructional practices. Despite this evidence, little is known about the nature of knowledge for teaching fractions, perceptions and attitudes that teachers should have nurtured during teacher preparation, the place where teachers should have acquired their teaching repertoire. Teacher Knowledge Ball (1993) and Ma (1999) stated that teachers’ content knowledge could be a possible aspect affecting classroom instruction. Meanwhile, Shulman (1986) noted content knowledge alone is not sufficient; pedagogical content knowledge is also significant for making the learning of mathematical concepts understandable. Shulman (1986) pointed out that content knowledge is solely an amount of mathematics knowledge that one should have for teaching a particular concept. It includes the conceptual and procedural understanding of specific mathematical ideas (Shulman, 1986). Many researchers argue the importance of teacher knowledge in making teaching and learning of mathematical content meaningful. When teachers do not possess in-depth knowledge of a particular concept (Ball, Thames, & Phelps, 2008) and do not know how to represent the idea and to make it comprehensible and understandable (Shulman, 1986), they often fail to deliver the concept for students’ understanding (Barnett-Clark et al., 2010). In contrast, pedagogical content knowledge includes the way a teacher represents fractions to facilitate student learning by using appropriate models, analogies, illustrations, examples, explanations, and demonstrations (Shulman, 1986). Also, teachers must be aware of students’ knowledge of fractions and know the strategies for reorganizing students’ understanding appropriately (Shulman, 1986). Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 61 Perceptions and Attitudes Previous research showed preservice teachers entered teacher preparation programs with preexisting views grounded mainly on their background and positive or negative experiences in learning mathematics (Timmerman, 2004; White et al., 2005). These preexisting perceptions are resistant to change and difficult to break (Kane, Sandretto, & Heath, 2002). White et al. (2005) claimed an intervention is needed to stop a cycle of negativity perceptions and attitudes towards mathematics especially for preservice teachers who in turn influence the formation of student attitudes. According to White et al., negative perceptions could create negative teaching ways, that can successively affect students’ beliefs, attitudes and learning outcomes. Thus, teacher educators should perceive preservice teacher perceptions and attitudes expressly for utilizing the potential ways for knowledge construction and stimulating learning. Multiple Tasks and Concrete Models Appropriate instructional materials and mathematical tasks in classrooms are indeed crucial for helping students grasp abstract concepts for knowledge construction (Cramer & Wyberg, 2009; Sarama & Clements, 2009). The incorporation of problem-solving and problem-posing tasks can help teachers to enhance students’ understanding of fraction with the assistance of concrete models such as fraction strips, and paper-folding. For instance, when students are asked to generate a problem for this division of fractions, 3 4 1 8 ÷ = 𝑛 (Sowder, Philipp, Armstrong, & Schappelle, 1998), they can use a concrete model to discover fraction size (Empson, 2002) and the relationships between the 1 4 1 and the 8. The teacher can examine students’ understanding by asking students to explain the meaning of 3 4 1 ÷ 8 as illustrated in Figure 1. Based on the representational model of Figure 1, students should be able to make connections to their prior knowledge about fractional numbers and find the answer to the problem. Figure 1. Models for Division of Fraction 62 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 Teachers may assist students’ learning during the process of operating the models. A student 1 3 must be able to grasp the concepts and ponder the underlying association on six sets of 8 in 4. Based on the necessary knowledge, the student could think of a fraction situation that fits the given equation and then teachers would ask the students to share their situations with others in the class. METHOD In utilizing both quantitative and qualitative data and rigorous methodological research techniques, we adapted close-ended and open-ended problem-solving and problem-posing tasks to shed light on the research question: What was the effect of the instructional unit of developing conceptual understanding of fractions with concrete models on the levels of elementary preservice teachers’ content knowledge, pedagogical content knowledge, perceptions and attitudes towards learning and teaching fractions? The pragmatist approach (Onwuegbuzie, Johnson, & Collins, 2009) drove the study that involved administering two pre- and post-assessments containing close-ended items and open-ended questions. Pragmatist researchers believe that there are many useful ways of seeking knowledge, including mixing quantitative and qualitative data and methods (Onwuegbuzie et al., 2009). A quasi-experimental research method was utilized for gathering both quantitative and qualitative data that focused on a onegroup pretest-posttest design (Shadish, Cook, & Campbell, 2002). The qualitative component of the study was based on the constructivist-naturalistic paradigm (Lincoln & Guba, 1985). We attempted to assess the preservice teachers’ understanding and attitudes about fractions by examining the open-ended responses and seek “individual and collective reconstructions that may unite around consensus” (Onwuegbuzie et al., 2009, p. 122). It was hoped that we could assess the treatment integrity (e.g., fidelity score) of the fraction instruction using both quantitative and qualitative instruments (Collins, Onwuegbuzie, & Sutton, 2006) to reduce any implementation biases (Onwuegbuzie, 2003). Besides, mixing quantitative and qualitative data can provide significant enhancement of the findings for generalizability purposes (Collins et al., 2006). The study employed the use of a convenience sampling technique to select the participants who were willing to be part of the study (Collins, Onwuegbuzie & Jiao, 2007). The instructor, Dr X who is an experienced professor, has been teaching elementary methods courses for more than ten years at one of the public universities in Texas. The method course was compulsory for the completion of the undergraduate degree with certification in the EC-Grade 6 Core Subjects/Generalist program. The participants were 71 female preservice teachers who were pursuing a Texas teacher certification in elementary school classrooms. They were from two class sections of the experienced professor in the Fall semester. The course comprised of one hour and 15 minutes of 16-week face-to-face meetings together with an online module through the Blackboard Learning System (eLearning) of the university. Concerning ethical considerations, permission was granted to perform the study over the Institutional Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 63 Review Board (IRB) of the university and an information sheet was provided to the participants informing them that the study would run as regular class session. The times for both class sections were back to back and the professor taught the same content and used similar instructional strategies. Close-ended items from the Learning Mathematics for Teaching [LMT] (2008) were utilized for examining content knowledge and pedagogical content knowledge. Only 11 close-ended were selected based on content-related validity to fractional concepts (Measure A) under investigation. In addition, we adopted five close-ended items on geometry (Measure B) to examine the non-equivalent dependent variable for improving the design. This instrument was made available to participants using an online survey tool, Qualtrics, and was piloted to preservice teachers in the other class sections of the methods course. The internal consistency scores (Cronbach’s alpha) were .74 for fractions and .73 for geometry using the Statistical Program for Social Science (SPSS) software version 21.0 (IBM Corp., 2013). Additionally, nine open-ended items were selected from Sowder et al., (1998), five questions were used to measure content knowledge and four were utilized to assess pedagogical content for teaching fractions. Two professors reviewed these open-ended questions to determine the contentrelated validity such as the relevancy and appropriateness of the tasks (Collins et al., 2006). The openended instrument was piloted and no changes were made to the instrument and the administration time of the present study. The participants received three weeks of classroom instruction (each week for 4 hours) on basics fraction concepts including addition, subtraction, multiplication, and division. The classroom instruction was comprehensively concentrated on concrete models that helped preservice teachers discover and construct basics mathematics knowledge based on the constructivist theory. Four to five preservice teachers were grouped into a small station (table) and concrete models such as tangrams, fraction bars, fraction circles, fraction towers, fraction strips, and pattern blocks were made available in every class session. During class instruction, Dr. X adopted a meaningful teaching approach through demonstrations and problem-posing activities that involved preservice teachers’ use of modeling to develop conceptual knowledge of fractions. Preservice teachers were given various fraction problems to create a scenario for and model with manipulatives or drawings with their partners. Problems were such as as 1 2 1 + 4, 2 3 1 −4, 1 2 3 3 1 1 × 4 , 4 × 2 , 4 ÷ 2 , and 1 2 ÷ 4. In addition to classroom instruction, preservice teachers engaged and participated actively in online modules, which utilized multiple instructional strategies including web-based activities, videos, and readings. The modules were developed and field-tested that followed the Mathematics Texas Essential Knowledge and Skills (TEKS) Connection [MTC Project]. It was a part of the Texas Math Initiative and represented a partnership between the Texas Education Agency (TEA) and the university. With three different bands (K-4, 5-8, 9-12), it covered some specific and vital mathematics concepts (e.g., place value, fractions) for each grade level. The modules were made available for future teachers use from http://mtc.tamu.edu/home.htm?intro-pre.htm. 64 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 The fraction assessments were administered sequentially online during class periods in October 2011. A week before the professor began the instruction on fractions, the preservice teachers were asked to access the Qualtrics survey tool and respond to 16 multiple-choice items by choosing their best options. Then, during one class period, a graduate assistant administered the open-ended tasks wherein participants were required to write their responses on an answer sheet. The fraction instruction took place during three weeks wherein unstructured observations were made and field notes were taken to capture all potentially relevant phenomena. The graduate assistant was an observer-as-participant (Johnson & Christensen, 2012) wherein the primary role was to collect related information (e.g., setting, interactions, and subtle factors) concerning fraction instruction (Erlandson, Harris, Skipper, & Allen, 1993). A week after Dr. X completed the fraction instruction, the preservice teachers were required to login to the Qualtrics instrument and answer the same items as on the pre-assessment. Similarly, the graduate assistant administered the same open-ended assessment to the participants. The close-ended items were scored based on their correctness, one point for a exact answer or zero point for an inappropriate one. Data were stored in the SPSS software (SPSS Inc., 2007) and points for items were totaled up for content knowledge (i.e., Item 1, 3, 4, 6, 7, 10, and 11) and pedagogical content knowledge (Item 2, 5, 8, and 9). Next, the open-ended items were transformed quantitatively where each pre- and the post-written response was assessed and coded into a numerical value (Teddlie & Tashakkori, 2006) based on the degree of accomplishment, zero (lowest) - four points (highest). For inter-rater agreement percentage, two independent raters graded seven identical scripts and attained an 83.3% agreement. The raters discussed to resolve any disagreement. Based on the data collection, a few statistical analyses were run and effect sizes for pre-post contrast were calculated. Then, we analyzed the written responses and observation data through a constant comparison analysis (Glaser, 1965) for a more in-depth and better understanding of the elementary preservice teachers’ strategies (Lincoln & Guba, 1985) for solving fractional tasks. The essay parts were unitized using the QDA Miner 3.2 (Provalis Research, 2009) following the classical content analysis method (Onwuegbuzie & Teddlie, 2003). The written essays were examined, underlined into chunks, and unitized into smaller significant parts (codes). The codes that were frequently appear were counted to symbolize essential concepts of preservice teachers’ perceptions and attitudes towards fractions. RESULTS AND DISCUSSION Based on the quantitative results and qualitative findings, we hoped to make external (statistical) generalizations regarding the level of fractional understanding to the entire population from where the participants were conveniently drawn. In the present study, the differences from pretest to posttest scores on the problem-solving and problem-posing tasks were utilized to observe the effect of the fraction instruction that focused on concrete models. These employed mixed data analyses to measure the change in the levels of elementary preservice teachers’ content knowledge, pedagogical content Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 65 knowledge, perceptions and attitudes regarding learning and teaching fractions. Analysis of Quantitative Data This quasi-experimental design utilized a one-group pretest-posttest design to assess the causeand-effect relationship of fraction instruction (treatment) and preservice teachers’ knowledge and belief about fractions. For reducing threats to validity of the experimental design, we adapted six geometry items as the nonequivalent dependent variable (measure B) in addition to 11 fraction items for measure A. The results showed the preservice teachers’ scores of fraction knowledge (measure A) changed statistically significantly, t(59) = 3.50, p = .001 with a mean gain effect size of .32 from pretest to posttest possibly because of the treatment of fraction instruction. No statistically significant change for geometry knowledge (measure B) was noted from pretest (M = 5.13) to posttest (M = 5.05). According to Shadish et al. (2002), this indication is useful for ruling out the possibility of a cause-and-effect relationship in a research study. The results of the close-ended assessment included 60 completed responses from the select preservice teachers. Eleven answer scripts were eliminated due to an invalid/incomplete pretest or posttest. The t-test value indicates that the mean score for fraction knowledge on the posttest was significantly greater than the mean score on the pretest. Specifically, the results from the close-ended items revealed statistically significant differences in means scores for both preservice teachers’ content knowledge [t(59) = 2.14, p = .037] and pedagogical content knowledge [t(59) = 2.87, p = .006]. The standardized mean gain effect sizes were 0.19 and 0.38 respectively. The results indicate that the preservice teachers’ content knowledge and pedagogical content knowledge improved during instruction as measured by using close-ended items. A review of the item analysis showed that preservice teachers scored a lower percentage for content knowledge specifically on Items 4, 6, 7 10, and 11 with less than 40% answering correctly on both pretest and posttest. Also, we found that preservice teachers had the most difficulty on Item 4 with only 10% correct on the pretest and posttest. They were not able to interpret and to analyze different situations given in the question that related to the unit-whole of a fraction. The results supported Lamon’s (1999) argument that most of the classroom instruction failed to present the unit-whole fraction that contributes to students’ misunderstanding of the concept. However, the majority of preservice teachers were able to answer correctly the part of Items 2, 5, and 8 that involved their knowledge of problem posing. Besides, the most significant differences between the percentage of scores on the pretest and posttest were found on Items 6, 2 (item b only), 8 (all sub-items), and 9. Analysis of Qualitative Data As mentioned in the previous section, the qualitative data were mainly collected from preservice teachers’ responses to the open-ended assessment consisting of five fraction content knowledge items and four fraction pedagogical content knowledge items. Based on 71 completed responses, each item 66 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 was assessed and transformed into a numerical value (0-4 points) that can be analyzed statistically; then the subtotals were calculated for content knowledge and pedagogical content knowledge. Overall, the Wilcoxon signed-rank test revealed that a 3-week instruction did not elicit a significant improvement statistically in the level of elementary preservice teachers’ content knowledge [Z = -2.25, p = .024, r =.21]. Indeed, the median content knowledge score was 2.0 both pre- and post-assessment. The statistical analyses of each item under content knowledge were utilized and the Bonferroni adjustment with α = .05/5 = .01 was applied in controlling for the familywise error rate (Jaccard & Guilamo-Ramos, 2002). Even though scores mostly were higher on the post-assessment, no statistically significant difference was evident in respect to solving arithmetic operations of fractions (Item 2), recognizing a fractional part-whole (Item 3), when finding an equivalent area (Item 4), and ordering fractional values (Item 5). Instead, the study revealed an impressive result where the post-assessment score was statistically significantly lower than pre-assessment [Z = -3.54, p < .001] for identifying a unit whole (Item 1) with effect size, r = .30. In both assessments, about 58% to 77% preservice teachers did not conceptually understand unit wholes—they were not able to recognize that the shaded part of two pizzas 5 8 eaten was (item 1). Instead, they believed that the two pizzas were a separated unit whole, supported Lamon’s (1999) and Ball’s (1993) argument that many individuals tended to refer to a single item (e.g., a single pizza) as a unit (one). Similarly, the Wilcoxon test analyses were performed for multiple comparisons among items under pedagogical content knowledge for adjusted α values of .05/4 = .0125. The results indicated that the select participants gained much input from the 3-week fraction instruction as measured by their pedagogical content knowledge with all items showing statistically significant differences between the assessment results (p ≤ .001). Specifically, the preservice teachers indicated statistically significantly 3 greater improvement when recognizing the accurate representation, 2 × 4 for Item 2 with Z = 4.33 and 3 an effect size, r = .36. When they were asked, “A recipe calls for 4 the cup of flour. How much flour is needed if the recipe is doubled?” during the pre-assessment, many mentioned 2 × 3 4 and 3 4 × 2 were the same representations and accurate. They stated that because of the commutative property, multiplication is reciprocal thus resulting in the same final answer. Nevertheless, after the instruction, they were able to differentiate that both representations were not identical. Preservice teachers attained statistically significantly higher scores on the post-assessment for Item 1, 3, and 4 with similar effect size, r = .27. Through a closer examination of the written responses, rich information was revealed about preservice teachers’ pedagogical content knowledge on different concepts. For instance, for Item 1 on problem posing, when creating a scenario problem on pre3 4 1 8 3 4 assessment to match the division fraction ÷ = 𝑛, many confused it with ÷ 8 = 𝑛. Here are the examples of the fraction scenarios created from the same preservice teacher in the study: Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 67 There are 24 students on the class roll. Monday at school, only ¾ of the class was present. The teacher needed to divide the class into eight groups. How many students are in each group? (Pre-assessment) How many 1/8 ft long strips of ribbon can be cut from a ribbon that is ¾ ft long? (Postassessment) The responses were similar to the previous results (Item 2) that showed many preservice teachers in the present study could not differentiate the underlying fractional concepts between two arithmetic operations. To build preservice teachers’ understanding of fractions, Dr. X allocated extra time presenting and using various concrete examples for creating scenarios during class instruction. Preservice teachers spent time practicing and generating their scenario problems based on the given fraction operation. As a result, they were able to pose meaningful and accurate scenarios during postassessment. For Item 3 and 4 that represented classroom situations for pedagogical analysis, the majority of preservice teachers were having some difficulties in analyzing the contexts and in providing brief descriptions supporting their arguments. For example, only a small number of preservice teachers could identify a missing unit from the children’s responses in Item 3. For this reason, preservice teachers had difficulties deducing the size of chunks the children considered in the case of cola. The participants were unable to analyze the children’s thinking and were difficult to make inferences based on the children’s unlabeled responses. However, other participants considered students’ thinking, understanding and misunderstanding when responding to this open-ended item and focused on the number that children provided in much detail. They attempted to predict children’s thinking with some drawings, assuming a case with 12 or 24 colas, and working backward to solve the problem. Item 4 4 focused on the conversation among three children who were comparing two fractions 4 and 4 with their 8 representation. Initially, preservice teachers’ responses failed to include the vital concept of unit-whole and the role of a common unit. However, during post-assessment, they were able to point out that the children struggled to recognize which one was a more substantial value using the fraction model, failing to understand that the size of rectangles must be the same (equal unit-whole) before comparing the fractions. For instance, a preservice teacher responded: Student 2 had the right idea that if student one had made the whole rectangles the same instead of the fractional part of a whole, he would have seen that 4/4≠4/8. However, student 3 is correct in that four parts out of 4 take up the entire rectangle while four parts of 8 take up half of a rectangle, which shows two different fractions. The students definitely could have used fraction bars to make the whole shapes even, but students 2 and 3 seemed to have understood without seeing the representation. They knew that the whole was represented with four parts for one fraction and 8 for the other. They knew 68 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 the parts of the whole were being compared. Observation during Fraction Instruction From the classroom observation, most of the preservice teachers were not experiencing problems creating scenarios involving the addition and subtraction of fractions. They were able to pose scenarios and check each other for appropriate modeling and terminologies. However, when it came to multiplication and division, preservice teachers demonstrated the most considerable anxiety because they were not able to pose scenarios that fit the algorithmic operations. They struggled to understand the concept clearly, thus were not able to come up with appropriate word problems. For instance, in one of the class sessions preservice teachers were demonstrated and illustrated scenario problems for 1 1 two fraction multiplications 2 × 3 and 3 × 2. They were guided in creating meaningful scenarios to differentiate the underlying concepts between these arithmetic operations that can produce the same 2 answer, 3 but have different meanings. Many were not capable of grasping the idea for the first time and showed their frustration by expressing their anxiety and anxiousness to their group members. Then, they asked Dr. X to show other examples that would hopefully help them build their conceptual understanding. After several explanations and practices with concrete models, most of them were able to create some meaningful scenarios involving multiplication and division even though some still did not fully understand the notion. We believed more time and effort would be necessary for them to build and understand these new concepts. Overall, we found the majority of preservice teachers enjoyed and gained much knowledge from each session of their fraction instruction and the online modules. In class, they actively participated in exploring and experimenting with various manipulatives with group members or partners to build their fractional understanding. One preservice teacher stated, “I believe that manipulatives are the best learning tool for children because they are concrete objects that they can work out the math problems with”. The response supports Piaget’s (1964) argument that hands-on materials or concrete models can help younger children build mental sense and abstract ideas in mathematics. Perceptions and Attitudes towards Fractions After the semester end, sixty-six preservice teachers submitted short written responses that described their perceptions and attitudes towards learning and teaching fractions based on their involvements in the method course. Each written response was coded into meaningful units of data in the form of phrases, sentences, or paragraphs and then these data were grouped into similar emergent themes (Johnson & Christensen, 2012). Eighteen themes were developed from the coding process and were classified into three meta-themes that related to their positive, negative, or unchanged attitudes towards fractions. We categorized preservice teachers under positive attitudes meta-theme when they showed optimistic phrases, statements and words from the written essays. Otherwise, the negative Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 69 responses noted all preservice teachers’ unfavorable endorsement towards learning and teaching fractions. The results from the coding process of the essays revealed that 62 of 66 selected preservice teachers showed some positive remarks about fractions based on their experiences from participating in the activities during the method course. Nine themes with 377 units of data were formed and considered from the responses demonstrating the change in the perceptions and attitudes. Among all the themes, the course material was frequently coded (100 units of data) indicating this might be the most critical concept noted by the majority of preservice teachers. They believed the incorporation of concrete manipulatives, online modules and videos, daily examples, and scenario problems during class instruction helped their learning about fractions. They discovered meaningful experiences in exploring and practicing different approaches to learning and teaching fractions from the method course. Also, most preservice teachers indicated that they felt comfortable with fractions (71 units of data) and their understanding of fractional concepts had increased (57 units of data) as compared to the beginning of the semester. However, they firmly believed that more time, practice, and research (39 units of data) were needed to understand fractional concepts profoundly and to be able to teach fractional concepts to young children effectively. For example, one preservice teacher stated: I feel that my attitude towards fractions has changed for the better. Dr. X did a great job explaining fractions to the class in ways I could understand. She allowed us to use different manipulatives that helped in understanding fractions rather than just using a pencil and paper. Although I am no 100% confident about teaching fractions I feel so much better about it and I know that with a little more practice I will be able to teach others what I learned about fractions in Dr. X's class. Simultaneously, preservice teachers believed the instructor (23 units of data), Dr. X, facilitated the learning of fractional concepts through various teaching aids and scenario problems. Dr. X focused on building the conceptual understanding of fractions with hands-on activities that emphasized the ‘why’ instead of the ‘how’ behind addition, subtraction, multiplication, and division of fractions. Based on preservice teachers’ past experiences, many argued that they were taught fractions with rules and memorization (40 units of data) in elementary and middle school and the learning process was extremely confusing. Also, they mentioned the ‘rote learning’ of fractions through a series of steps and formulas (i.e., algorithms) without an understanding of what was going on. A participant stated, “I have known how to get the correct answer when working with fractions, but until this class I didn’t understand how I got the answer or why it was correct”. It was an unlearning and relearning process for the majority of preservice teachers during the method course with Dr. X. In addition, creating and practicing different scenarios (e.g., problem-solving problems, word problems) involving fractions with concrete models helped preservice teachers understand the concepts better. After the course, they could “see” how portions make sense in their daily lives and would be 70 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 interested in incorporating the same teaching method into their future classroom (21 units of data). For instance, one preservice teacher discussed: When we started with creating word problems I thought no big deal. However, it became 3 4 1 3 1 3 3 4 difficult to word a problem to show if it was × versus × , because the modeling was completely different even though the answer will be the same. We practiced a lot on fractions, and also though I do not feel like I am an expert in them, I feel confident enough to teach them. I may have to consult my notes from time to time, but isn't that what those are for? I would use my notes mainly in planning out my lessons. Contrary to the many positive responses, some preservice teachers had included an opposite perception throughout their essay. We found four themes that revealed their negative attitudes about the learning and teaching of fractions. Most preservice teachers still felt anxious (64 units of data) about fractions and intimidated by mathematics (14 units of data) as a subject in general. They were nervous about teaching fractions to young children (39 units of data) because the resistance towards learning fractions had been entrenched for years and it was not going to disappear overnight demonstrating their low self-esteem and confidence towards fractions. Therefore, they were still conscious and not sure how to explain the concepts in class. For them, it was difficult and challenging to fully understand fractions especially when creating scenarios for multiplication and division with correct terminologies. Also, we found several preservice teachers were overwhelmed with the methods class (14 units of data) but were able to rationalize the situation, thus tried to motivate themselves and be more positive towards the teaching and learning of fractions. Comment for a preservice teacher: My view of fractions has changed slightly. Before this class, I did not even think about how difficult fractions would be to teach. I got nervous when I heard that it is one of the most challenging subjects to teach. When we started learning about them, I did find that fractions are tough for me to understand. Therefore, I was fearful of teaching them. I think the main thing I am concerned about is the ordering of fractions. For some reason, that gives me the most difficulty. I do believe that after this course, it has gotten a little easier to understand because of the hands-on activities that we did daily. However, I think I will always have fear when teaching fractions. The analysis of essay questions revealed two preservice teachers had entirely negative attitudes after instruction. They realized how complex working with fractions can be and hated fractions because they did not feel adequate in their teaching abilities even after taking a method course. Also, we noticed a preservice teacher claimed that she did not see any changes in her attitudes towards fractions after the instruction. She had been struggling with fractions and how to understand the concept before and after the methods course. She mentioned “It is hard for me to learn in a group setting. I am much better at Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 71 learning things one on one when I cannot comprehend them". The external generalization of the present study is limited to all elementary preservice teachers from a particular university where participants are conveniently drawn and are based on the results taken from the validated measures. However, we emphatically trust the outcomes because we incorporated both quantitative and qualitative research based on a pragmatist approach providing a perfect combination yielding “complementary strengths and nonoverlapping weaknesses” (Johnson & Turner, 2003, p. 299). Notably, the use of both closed and open-ended items uncovered several key findings from multiple perspectives that add to the existing literature on the preservice teachers’ level of knowledge and attitudes towards fractions. Results of the present study advance current understanding of the effect from a 3-week fraction instruction of a mathematics method course in several valuable ways. First, it provides useful insights into the potential benefits of integrating concrete models, problem-solving, and problem-posing activities on building prospective teachers’ profound knowledge of teaching fractions. Second, the study was distinctive because it represents the first study using a mixed-methods research design to more deeply understand the complex phenomena of selected preservice teachers’ content knowledge and pedagogical content knowledge of fractions, and attitudes towards fractions. Seventy-one preservice teachers participated in the study where they complete pre- and postassessments on content knowledge and pedagogical content knowledge of fractions. Also, they submitted an essay about attitudes towards fractions after the 3-week instructional unit. Importantly, the study showed that the incorporation of problem-solving, problem posing, and hands-on activities have the potential to assist instructors in making fraction instruction more meaningful and enjoyable thus eventually helping learners to develop conceptual knowledge of fractions (Thompson, 1994). Some other significant outcomes and issues need to be addressed aside from the inconclusive results as measured by the closed and open assessment tasks. Related to the select preservice teachers’ content knowledge of fractions, results from the descriptive statistics revealed low percentages of preservice teachers achieved the full points for most of the items on both pre-and post-assessments. The results might indicate the participants were weak on the particular fraction concepts being tested. Specifically, we noticed the select preservice teachers were not aware that the unit-whole may include more than one item or may consist of items packaged as one known as a composite unit. The behavior was consistently evident when they were identifying the unit-whole of Item 4 (closed-ended) and Item 1 (open-ended). On pre- and post-assessments, almost 90% of preservice teachers were not able to notice that children were considering a different set of unitwhole on Item 4 (close-ended). Similarly, the picture for Item 1 (open-ended) showed two pizzas with the same or different kinds, which is called a one two-unit and not two one-unit pizzas. However, many 1 preservice teachers assumed a single object (i.e., a single pizza) as a unit (one) and answered 1 4 pizza 5 instead of 8 pizza. Lamon (1999) argued that the concept of the unit has frequently been neglected in 72 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 classroom discussions. Majority teachers and textbook authors emphasized using the same unit-whole such as one cake in which making students assumed a unit was always a single object (Lamon, 1999). Lamon argued many classroom instructions failed to introduce the vital concept of unit-whole of fractions to students. CONCLUSION While this study revealed the preservice teachers’ lack of content knowledge of fractions, the results showed compelling evidence of their understanding of teaching fractions supported previous research by Ball (1993), Hill (2010), Ma (1999), Newton (2008), and Timmerman (2004). It is noteworthy to see that they demonstrated statistically significant improvement in their level of pedagogical content knowledge even though they received only a three-week fraction instruction unit mainly focusing on problem posing and hands-on activities. We believe the close interaction with the instructor and quality of instruction during the mathematics method course possibly helped preservice teachers develop knowledge about pedagogy and nurture their fraction teaching repertoire (White et al., 2005). Our data warrant further investigation into the relationship among the duration of instruction, the role of instructor, quality of instruction, and level of fractional knowledge they gained. Therefore, a more thorough examination is critical to determine the factors that might affect the level of knowledge before and after instruction. In the post-assessment, many preservice teachers successfully generated and identified meaningful and accurate fraction situations that matched the given arithmetic operations. When reviewing their fraction essays, they mentioned how problem posing and hands-on activities had an impact on their learning and understanding of fractions. The majority of preservice teachers showed positive attitudes towards fractions after receiving the instruction and believed it was an eye-opening session. Still, some expressed their concerns about additional practice and the time needed for developing confidence in teaching fractions. Our results revealed that the select preservice teachers were able to reflect on students’ work and to suggest basic teaching approaches for helping students’ misunderstanding of fraction concepts. However, the responses showed they had limited practical experiences teaching fractions to children. Because the participants would be elementary school teachers later, more exposure working directly with students in a real classroom setting is needed. The experience responding to elementary students would develop their pedagogical skills and improve preservice teachers’ understanding of educational components (Llinares, 2002; White et al., 2005). Taken together, teacher education services are a place for teachers to build expertise, teach skills, values, and understanding in order to become effective teachers in mathematics (Llinares, 2002). Nonetheless, the prior knowledge and behaviors of service teachers have a significant impact on what and how they learn during their teacher training programs (Llinares, 2002). Teacher educators are responsible for providing potential teachers with the in-depth knowledge and experience required to Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 73 effectively teach the basic concepts of fractions. ACKNOWLEDGMENTS The author would like to acknowledge the National University of Malaysia through the GG-2019015 and GGPM-2013-085 grants for providing support for the research work. The open-access publishing fees for this article have been partially covered by the Texas A&M University Open Access to Knowledge Fund (OAKFund), supported by the University Libraries and the Office of the Vice President for Research. REFERENCES Ball, D. L. (1993). Halves, pieces, and twoths: Constructing and using representational contexts in teaching fractions. In Carpenter, T. P., Fennema, E., Romberg, T. A. (Eds.). Rational numbers: An integration of research (pp.157-195). Hillsdale, NJ: Erlbaum. Ball, D. L., Thames, M. H., & Phelps, G. (2008). Content knowledge for teaching: What makes it special? Journal of Teacher Education, 59(5), 389-407. http://dx.doi.org/10.1177/0022487108324554. Barnett-Clarke, C., Fisher, W., Marks, R., & Ross, S. (2010). Developing essential understanding of rational number: Grades 3-5. Reston, VA: The National Council of Teachers of Mathematics. Collins, K. M. T., Onwuegbuzie, A. J., & Jiao, Q. G. (2007). A mixed methods investigation of mixed methods sampling designs in social and health science research. Journal of Mixed Methods Research, 1(3), 267-294. http://dx.doi.org/10.1177/1558689807299526. Collins, K. M. T., Onwuegbuzie, A. J., & Sutton, I. L. (2006). A model incorporating the rationale and purpose for conducting mixed methods research in special education and beyond. Learning Disabilities: A Contemporary Journal, 4(1), 67-100. Cramer, K., & Wyberg, T. (2009). Efficacy of different concrete models for teaching the part-whole construct for fractions. Mathematical Thinking and Learning, 11(4), 226-257. http://dx.doi.org/10.1080/10986060903246479. Erlandson, D. A., Harris, E. L., Skipper, B. L & Allen, S. D. (1993). Doing naturalistic inquiry: A guide to methods. Newbury Park, CA: Sage. Empson, S. B. (2002). Organizing diversity in early fraction thinking. In B. Litwiller & G. Bright (Eds.), Making sense of fractions, ratios and proportions: 2002 Yearbook (pp. 29-40). Reston, VA: National Council of Teachers of Mathematics. Glaser, B. (1965). The constant comparative method of qualitative analysis. Social Problems, 12(4), 436-445. http://dx.doi.org/10.2307/798843. Hill, H. C. (2010). The nature and predictors of elementary teachers’ mathematical knowledge for teaching. Journal for Research in Mathematics Education. 41(5), 513-545. IBM Corp. (2013). IBM SPSS Statistics for Windows, Version 21.0 [Computer software]. Armonk, NY: IBM Corp.). Jaccard, J., & Guilamo–Ramos, V. (2002). Analysis of variance frameworks in clinical child and adolescent psychology: Issues and recommendations. Journal of Clinical Child and Adolescent Psychology, 31(1), 130 –146. http://dx.doi.org/10.1207/S15374424JCCP3101_15. 74 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 Johnson, B., & Christensen, L. (2012). Educational research: Quantitative, qualitative, and mixed approaches (4th ed.). Boston, MA: Sage. Johnson, R. B., & Turner, L. A. (2003). Data collection strategies in mixed methods research. In A. Tashakkori, & C. Teddlie (Eds.), Handbook of mixed methods in social and behavioral research (pp. 297-319). Thousand Oaks, CA: Sage. Kane, R., Sandretto, S., & Heath, C. (2002). Telling half the story: A critical review of research on the teaching beliefs and practices of university academics. Review of Educational Research, 72(2), 177–228. http://dx.doi.org/10.3102/00346543072002177. Lamon, S. J. (1999). Teaching fractions and ratios for understanding: Essential content knowledge and instructional strategies for teachers. Mahwah, NJ: Erlbaum. Learning Mathematics for Teaching. (2008). Mathematical knowledge for teaching measures: Rational number. Ann Arbor, MI: Author. Lin, C. (2010). Web-based instruction on preservice teachers’ knowledge of fraction operations. School Science and Mathematics, 110(2), 59-70. http://dx.doi.org/10.1111/j.1949-8594.2009.00010.x. Lincoln, Y., & Guba, E. (1985). Naturalistic inquiry. Newbury Park, CA: Sage. Llinares, S. (2002). Participation and reification in learning to teach. The role of knowledge and beliefs. In G. Leder, E. Pehkonen, & G. Tö rner (Eds.), Beliefs: A hidden variable in mathematics education (pp. 195–210). Dordrecht, The Netherlands: Kluwer. Ma, L. (1999). Knowing and teaching elementary mathematics: Teachers’ understanding of fundamental mathematics in China and the United States. Mahwah, NJ: Erlbaum. National Council of Teachers of Mathematics. (2000). Principles and standards for school mathematics. Reston, VA: Author. Newton, K. J. (2008). An extensive analysis of preservice elementary teachers’ knowledge of fractions. American Educational Research Journal, 45(4), 1080-1110. http://dx.doi.org/10.3102/0002831208320851. Onwuegbuzie, A. J. (2003). Expanding the framework of internal and external validity in quantitative research. Research in the Schools, 10(1), 71-90. Onwuegbuzie, A. J., Johnson, R. B., & Collins, K. M. T. (2009). A call for mixed analysis: A philosophical framework for combining qualitative and quantitative. International Journal of Multiple Research Approaches, 3(2), 114-139. http://dx.doi.org/10.3102/000283120832085110.5172/mra.3.2.114. Onwuegbuzie, A. J., & Teddlie, C. (2003). A framework for analyzing data in mixed methods research. In A. Tashakkori & C. Teddlie (Eds.), Handbook of mixed methods in social and behavioral research (pp. 351–383). Thousand Oaks, CA: Sage. Osana, H. P., & Royea, D. A. (2011). Obstacles and challenges in preservice teachers’ explorations with fractions: A view from a small-scale intervention study. Journal of Mathematical Behavior, 30(4), 333-352. http://dx.doi.org/10.1016/j.jmathb.2011.07.001. Piaget, J. (1964). Part I: Cognitive development in children: Piaget development and learning. Journal of Research in Science Teaching, 2(3), 176-186. http://dx.doi.org/10.1002/tea.3660020306 Provalis Research. (2009). QDA Miner 3.2. User‘s guide. Montreal, QC, Canada: Author. Rosli, Goldsby, Onwuegbuzie, Capraro, Capraro, & Gonzalez, Elementary preservice teachers’ knowledge … 75 Sarama, J., & Clements, D. H. (2009). "Concrete" computer manipulatives in mathematics education. Child Development Perspectives, 3(3), 145-150. http://dx.doi.org/10.1111/j.17508606.2009.00095.x. Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and quasi-experimental designs for generalized causal inference. Boston, MA: Houghton Mifflin. Shulman, L. S. (1986). Those who understand: Knowledge growth in teaching. Educational Researcher, 15(2), 4-14. http://dx.doi.org/10.3102/0013189X015002004. Sowder, J. T., Philipp, R. A., Armstrong, B. E., & Schappelle, B. P. (1998). Middle-grade teachers’ mathematical knowledge and its relationship to instruction: A research monograph. New York, NY: State University of New York. Teddlie, C., & Tashakkori, A. (2006). A general typology of research designs featuring mixed methods. Research in the Schools, 13(1), 12-28. Thompson, P. W. (1994). Concrete materials and teaching for mathematical understanding. Arithmetic Teacher, 41(9), 556-558. Timmerman, M. (2004). The influences of three interventions on prospective elementary teachers’ beliefs about the knowledge base needed for teaching mathematics. School Science and Mathematics, 104(8), 369-382. http://dx.doi.org/10.1111/j.1949-8594.2004.tb18003.x. White, A. L., Way, J., Perry, B., & Southwell, B. (2005). Mathematical attitudes, beliefs and achievement in primary pre-service mathematics teacher education. Mathematics Teacher Education and Development, 7, 33-52. 76 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 59-76 ISSN 2087-8885 E-ISSN 2407-0610 Journal on Mathematics Education Volume 11, No. 1, January 2020, pp. 17-44 PROSPECTIVE MATHEMATICS TEACHERS’ COGNITIVE COMPETENCIES ON REALISTIC MATHEMATICS EDUCATION Rezan Yilmaz Ondokuz Mayis University, Faculty of Education, Samsun, Turkey Email: [email protected] Abstract Realistic Mathematics Education (RME) is based on the idea that mathematics is a human activity; and its main principle is to ensure the transition from informal knowledge to formal knowledge through contextual problems. This study aims at revealing how RME is configured in the minds of prospective mathematics teachers and their cognitive competency in that sense. For that purpose, at the end of the process, in which the approaches used in mathematical education including RME are examined and interpreted, 32 prospective teachers were asked various open-ended questions. Moreover, they were expected to pose contextual problems that could be used in RME. After analysing the obtained data via qualitative research techniques, it is seen that the majority of the prospective teachers possesses theoretical knowledge on RME. However, it is also observed that their ability to present its differences and similarities with other approaches and to pose contextual problems suitable to RME has been decreased. Keywords: Realistic Mathematics Education, Prospective Mathematics Teacher, Cognitive Competency, Contextual Problem Abstrak Pendidikan Matematika Realistis (PMR) didasarkan pada gagasan bahwa matematika adalah aktivitas manusia; dan prinsip utamanya adalah memastikan transisi dari pengetahuan informal ke pengetahuan formal melalui masalah kontekstual. Penelitian ini bertujuan untuk mengungkapkan bagaimana PMR dikonfigurasi dalam pikiran calon guru matematika dan kompetensi kognitif mereka dalam pengertian itu. Untuk tujuan itu, pada akhir proses, 32 calon guru ditanyai berbagai pertanyaan terbuka, yang mana pendekatan yang digunakan dalam pendidikan matematika termasuk PMR diujikan dan ditafsirkan. Selain itu, mereka diharapkan untuk dapat mengemukakan masalah kontekstual yang dapat digunakan dalam pendekatan PMR. Setelah menganalisis data yang diperoleh melalui teknik penelitian kualitatif; terlihat bahwa mayoritas calon guru memiliki pengetahuan teoritis tentang PMR. Namun, pengamatan lebih jauh terkait kemampuan mereka untuk menyajikan perbedaan dan persamaan antara pendekatan PMR dengan pendekatan yang lain dan pengajuan masalah kontekstual yang sesuai dengan PMR, telah berkurang. Kata kunci: Pendidikan Matematika Realistik, Calon Guru Matematika, Kompetensi Kognitif, Masalah Kontekstual How to Cite: Yilmaz, R. (2020). Prospective mathematics teachers’ cognitive competencies on realistic mathematics education. Journal on Mathematics Education, 11(1), 17-44. http://doi.org/10.22342/jme.11.1.8690.17-44. The studies conducted in mathematical education, which is highly impacted by theoretical and methodological framework of psychology (Sriraman & English, 2010), have paved the way for the innovative ideas on learning including new learning outcomes, new types of learning processes, and new instructional methods that are both wanted by society and currently stressed on in psychological and educational theory (Simons, Van der Linden, & Duffy, 2000; Yackel, Gravemeijer, & Sfard, 2011). These ideas occur in a manner that is totally different from behavioral approach studies (Aubrey & Riley, 2016) which traditional education methods are based on. And their approaches can be fundamentally categorized as cognitive approaches and constructivism. Cognitive theories focus on 17 18 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 the conceptualization of one’s learning process and emphasize how information is received, organized, stored and retrieved in one’s mind (Davis, 1990; English, 1995; Ertmer & Newby, 1993; Jonassen, 1991; OECD, 2003). Despite being viewed as a branch of cognitivism as a result of expressing learning as a cognitive activity (Ertmer & Newby, 1993), constructivism is based on the idea that the mind filters the input received from life in order to create its own reality (Jonassen, 1991a) and the information is not simply received but actively constructed by the person himself/herself (Driscoll, 2005). The different perspectives in education led to radical changes in objectives and nature of mathematics as well as many other areas. This change in mathematics education particularly emphasizes that mathematics learning can occur only when students discover things through actual experience and structured problem-solving procedures and finally by means of interaction process among students and/or teachers (Kwon, 2002). In addition to this, many countries aiming at raising productive and innovative individuals in the 21st century made radical changes in their curriculum with the purpose of attaching much more importance to the quality of their mathematics education. Essentially, various countries such as UK, USA, Singapore, Finland and Australia believe that mathematics enables the improvement of thinking capacity as well as its use in real life in a critical, creative and logical manner. These countries have begun to pursue the goal of raising individuals with problem solving, reasoning, and connecting skills, as well as productive disposition and conceptual comprehension (Cai & Howson, 2013; Stacey, 2005). It is believed that among these skills, the problem solving is the key skill in the 21st century (Barell, 2010). Moreover, it is asserted that the teaching process can be implemented for students through case-based or problem-based learning methods by presenting authentic problem situations called as real-world problem (Barell, 2010; Barrows, 1986). Conceptual comprehension, on the other hand, is a process that takes place in coordination with such skills (Wu, 1999) and involves the understanding of concept, relation and operation (National Research Council, 2001). This new perspective shares many common points with the theoretical perspective of Realistic Mathematics Education (RME) (Bray & Tangney, 2016; Kwon, 2002) as discussed by Freudenthal (1973). RME focuses on mathematization that is actualized through the re-invention of formal mathematics. In RME, the student is informally guided by the teacher in a class-interaction, thus is encouraged to utilize self-developed models in order to solve and interpret the experientially real contextual problems (Dawkins, 2015; Gravemeijer, 1994, 1999; Treffers, 1991). In this regard, it is believed that RME contributes to the formation of the targeted skills such as problem solving, reasoning and connecting; as well as to conceptual comprehension (Borko & Putnam, 1996; Hadi, 2002). Teachers’ Pedagogical Knowledge (PK), which includes knowledge the teachers have on curriculum and teaching methodology as well as on how to teach them (Lianghuo, 2014), and Pedagogical Content Knowledge (PCK) that is referred to as “special amalgam of content and Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 19 pedagogy” (Shulman, 1987), are important determinants of instructional quality that impact students’ learning gains and motivational development (Baumert & Kunter, 2013). And also they have vital importance in gaining the targeted skills (Borko & Putnam, 1996). This situation causes many results, which could affect teacher education and causes major changes in prospective teachers' knowledge (Battista, 1994; Putnam & Borko, 2000; Szydlik, Szydlik, & Benson, 2003). However, teachers’ efforts to change their PK and practices are usually not reflected on their tendencies at skill-based applications in the targeted curriculum (Anderson, White, & Wong, 2012). Although, it is difficult to evaluate teachers’ PCK (Beswick & Goos, 2012), understanding the pedagogical theories underlying the radical changes about mathematics education and detecting effective teaching strategies are significant in terms of being able to professionally assessing teachers (Sowder, 2007). Furthermore, besides prospective teachers' knowledge of general and specific approaches; what is essentially important is whether they comprehend when, where, how, and why they should use these approaches, rather than their variety (Feiman-Nemser, 2001; Tsamir, 2008). However, many prospective teachers are not ready to use them the way they should (Herman & Gomez, 2009). Therefore, it is believed that it is important to know to what extent prospective teachers have cognitive competences about these approaches, in order to be able to choose the approach that will be used depending on place and time and to implement them in a manner that is suitable for the purpose. In a general sense, the study aims at identifying prospective teachers' cognitive competency regarding the approaches before moving onto practice from theory. In this regard, the study mainly focuses on RME in terms of the approach and the answer to the question of "How are the cognitive competencies of prospective mathematics teachers related to RME?" has been sought. Thus, the subproblems of "How do prospective mathematics teachers explain RME and its implementation? How do they interpret the similarities and differences with other approaches? How a contextual problem do they pose in compliance with the theoretical structure of RME?" have been discussed. In the below-mentioned theoretical framework, firstly the definition and primary elements of RME have been explained, and information related to its implementation has been presented. Subsequently, the criteria used in determining the cognitive competency and the levels of cognitive competency have also been explained. THEORETICAL FRAMEWORK Realistic Mathematics Education (RME) RME, which is based on the idea that mathematics is a human activity (Freudenthal, 1973) and the idea that student achieves formal mathematics knowledge by using his/her informal knowledge by means of re-inventing under the guidance of a teacher (Treffers, 1991) has a significant place in the studies conducted in the field of mathematical education (e.g. Barnes, 2004; Beswick, 2011; Bray & Tangney, 2016; Makonye, 2014; Rasmussen & King, 2000; Streefland, 1991). In this approach, the informal knowledge in real life is transformed into formal knowledge after being abstracted and is 20 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 again associated with real life as a result of mathematical implementations (De Lange, 1996). Transformation is believed to be achieved by contextual problems that are experientially real to the students (Gravemeijer, 1999). The process of conceptual mathematization in RME has been given in Figure 1. In this process, mathematical concepts start to develop from the real world and ends with the reflection of the solution back to the real world. Figure 1. Conceptual mathematization (De Lange, 1999) The contextual problems used in RME are mathematical problems presented in the real life situations that children are familiar with, through stories that are fictionalized from the real world. These problems can be a word problem, a game, a drawing, a newspaper clipping, a graph or the combinations of such elements. At the same time, a pure mathematical problem can also be a contextual problem. However, the main point here is to what extend the problem would fulfill the criteria of being experientially real or authentic; and thus would provide a concrete orientation towards a new concept/skill; and would also allow utilization of prior knowledge (De Corte, 1995; Doorman, Drijvers, Dekker, van den Heuvel-Panhuizen, de Lange, & Wijers; 2007; Gravemeijer, 1999). Gravemeijer (1994, 2001) emphasizes the necessity for three main elements while designing education in RME, which includes the following: 1. Guided reinvention through progressive mathematization 2. Didactical phenomenology 3. Self-developed or emergent models Guided reinvention is based on configuring and organizing problems in order to discover rules by revealing mathematical factors in a problem. This research, which is conducted with a strong intuitional component, is considered to be the discovery or reinvention of mathematical conception (De Lange, 1987; Freudenthal, 1973, 1991). In this process, the teacher should design the roadmap to enable students to learn correctly and should provide the students with the opportunity to experience a process that is similar to the discovery process of mathematicians (Gravemeijer, 1994; 2001). By doing so, students have the opportunity to obtain knowledge by themselves (Freudenthal, 1991; Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 21 Gravemeijer, 1999; Yackel & Cobb, 1996) and they mathematize the contextual problems by solving them (Treffers, 1987; 1991). Guided reinvention through progressive mathematization can be considered as a two-stage process: horizontal and vertical mathematization. Horizontal mathematization is where one uses informal strategies by schematization in order to define and solve contextual problems, in other words transforming real life problems into mathematical problems. Vertical mathematization is to abstract the conception in the world of symbols and solve the problem by adopting different models or find the relevant algorithm by using mathematical language in the light of informal strategies (Freudenthal, 1991; Gravemeijer, 1994; Treffers, 1987, 1991; Van den Heuvel-Panhuzen, 2003). In Figure 2, horizontal and vertical mathematization is described. Figure 2. Horizontal and vertical mathematizations (adapted from Gravemeijer, (1994)) Didactical phenomenology requires working with phenomenon that are meaningful to students in the process of learning mathematics, can be organized by students, are stimulating for the learning process and meet four functions including concept formation, model formation, applicability and practice (Gravemeijer, 1994, 2001; Treffers & Goffree, 1985). Self-developed or emergent model bridges informal knowledge of students with formal knowledge while solving a problem. At the beginning, the student develops a model, which gradually becomes a dynamic and holistic model compatible with his or her own mathematical thinking after generalizing and formalizing processes (Gravemeijer, 2001; Treffers, 1991). Thereby, at the end of this process, which is named as the transformation from 'model-of' to 'model-for', the student obtains a model that enables him/her to achieve mathematical reasoning (Gravemeijer, 1999). Cognitive Competency Cognition refers to the variables with respect to the kind and quantity of information, and the classification of relations among the variables of information (Kraiger, Ford, & Salas, 1993). 22 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 Cognitive domain in learning contains learnings, in which person's mental sides are in the foreground (Bloom, Engelhart, Furst, Hill, & Krathwohl, 1956; Bloom, 1994). Cognitive competency is a psychological construction which cannot be directly observed but can be inferred from the behavior or performance of an individual on content-relevant tasks (Wang, 1990). Researchers have asserted various taxonomies in order to evaluate this kind of learnings. Some of these taxonomies are Bloom’s Taxonomy (Bloom, et al. 1956), Revised Bloom Taxonomy (Anderson & Krathwohl, 2001), Barrett’s Taxonomy (Barrett, 1976), Pearson-Johnson Taxonomy (Pearson & Johnson, 1978), Webb’s Depth of Knowledge Levels (Webb, 1997, 1999) and Program for International Student Assessment (PISA)’s Competency Levels (OECD, 2003). Bloom’s Taxonomy among such taxonomies, is the first and most widely accepted classification (Granello, 1995) in the subject of cognitive abilities and educational objectives used in education. In a similar way, PISA is implemented and reported in many countries and regarded as a new approach in national and international evaluation (Sadler & Zeidler, 2009). Categories are classified in Bloom Taxonomy from simple to complex and from concrete to abstract acting as prerequisite for one another: Knowledge refers to a person's acts of remembering such as recognizing, expressing when asked, or repeating the characteristics of any object or event from his/her memory; Comprehension means interpreting, assimilating, and expressing the obtained targets without losing their meanings at knowledge level; Application, is when a person implements knowledge by solving the problem in a new situation by making use of his/her learning at knowledge and comprehension level; Analysis refers to cognitively differentiating among the items of a pattern or knowledge in terms of their relationships and organizations; Synthesis means bringing together and creating a whole of the items in such a way that they would bare characteristics such as innovation, originality, and creativity based on certain relationships and rules. Evaluation is when a person decides whether the products created are competent enough by stating justifications (Bloom, et al. 1956; Bloom, 1994; Krathwohl, Bloom, & Bertram, 1973; Krathwohl, 2002). In PISA, OECD (2003) has determined three levels for detecting the competency levels of students in order to define their cognitive activities. These levels include reproduction, connection, and reflection. In reproduction level, already known contents, previously used knowledge, standard algorithms, and elementary formula are used and basic operations are conducted. In connection level, less commonly known contents are interpreted and explained; systems, representation of which are different, are obtained through association; and necessary strategies are chosen and used for extraordinary problem solving. In reflection level, in which comprehension is required; reflection, creativity, and knowledge necessary for solving complex problems are associated; observed results are generalized and justified; and abstraction is carried out. The association between Bloom’s Taxonomy and PISA’s Competency Levels is indicated in Figure 3. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 23 Figure 3. The relations of cognitive competency (OECD, 2003; Bloom, et al. 1956) METHOD This research is a qualitative study that has been conducted with the purpose of researching the cognitive competencies of prospective mathematics teachers related to the approaches used in mathematics education. Research design is a case study, which has been evaluated as a case of RME. This study has been conducted in the faculty of education of a state university in the Black Sea region of Turkey with 32 prospective mathematics teachers, 20 of which are female and 12 of which are male. These prospective mathematics teachers are all senior students studying their fourth year of their 5-year education plan. The majority of these prospective mathematics teachers have successfully completed fundamental education courses such as introduction to education, educational psychology, guidance, theories and approaches of learning and teaching, curriculum development and instruction besides pure mathematical courses. All participants were informed about the process of the research. They volunteered to attend the research and gave the researcher the permission to use the data acquired from their interpretations and their posed problems in the manuscript. The prospective mathematics teachers had been taking Methods of Teaching Mathematics lesson during the time this study was conducted. In the lesson instructed by the researcher and lasted for four hours a week, some approaches about mathematical education (e.g. Ausubel’s meaningful learning approach (Ausubel, 1963), Freudenthal’s RME approach (Freudenthal, 1973), Bruner’s discovery learning approach (Bruner, 1961) have been elaborately examined and their implementation in teaching has been interpreted and discussed. In this research, it has been aimed to reveal cognitive status of prospective teachers about RME at the end of their 12-hour experience. In this study, two sessions that were suggested by Selter (1997, 2001) and grounded on the study of Zulkardi (2002) about how preservice teachers have developed the RME learning environment, were taken into consideration: understanding the new approach by providing a theoretical overview and by actually doing mathematics-the mathematical component, designing 24 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 instructional materials-the didactical component. Thus, cognitive competencies of prospective teachers in the theoretical sense were focused on and the components of the sessions were discussed as “overview of RME theory, doing mathematics and designing contextual problems” (Zulkardi, 2002). During ‘overview of RME theory’ session, firstly, the prospective teachers were provided with theoretical information on RME, and then they conducted activities about the approach. Afterwards, the implementation process and effectiveness of the activities were discussed. The prospective teachers stated their comments on the elements of the transition of RME application from real life to mathematics such as the characteristics of contextual problems applicable to horizontal mathematization, the modelings created during its solution, and roles of teacher and students. Then, they continued to discuss about vertical mathematization. During doing mathematics session, prospective teachers were treated as learners while the researcher performed as a teacher and it was aimed that they learn how to teach using with RME. In Appendix, there are some examples of the contextual problems used in the applied activities during these sessions (Altun, 2011; Fauzan, 2002; Feijs, 2005; Wubbels, Korthagen, & Broekman, 1997). During the second session, for instance, Feijs’s (2005) Grand Canyon Problem (problem 4 in appendix) developed in line with the construction of RME learning environment was implemented with prospective teachers. The prospective teachers formed 3- person student groups in real classroom environment and two of the prospective teachers from the group sat on one desk, whereas the other one sat on the desk that is parallel to the other desk. On the paper, which was hung down these parallel desks in order to create an imaginary river in the gap between the two desks, the points where the vision lines of these two people have been drawn with the help of a third person and canyon tables activity have been implemented. In the practice process, discussions were held on how they identify situations that can be seen or not seen by a person and how they perceive the vision lines from these situations. After the implementation, the situation of treating steepness of vision line as phenomena was examined according to the didactical phenomenology criterion where the main idea is familiarity and appeal for them. At this point, the mathematical content, which requires the ratio of a right triangle formed by an angle, was associated with the concept of steepness, and the state of abstraction resulting in the concept of tangent was also discussed. In addition, the self-developed or emergent models’ criterion of the approach was also discussed, considering the models that they can suggest for solution of the problem. During designing contextual problem session, it was aimed to relate the context and the concept in a learning environment. In this manner, prospective teachers could learn how to design contextual problems to use in RME. So, they were asked to pose a contextual problem, for which they were given an adequate period of time and which would enable them to conduct horizontal mathematization. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 25 After these sessions, they were asked some open-ended questions about RME and the problems they posed; moreover, they were expected to write down their opinions about these questions. The questions asked to the prospective teachers are as follows: 1. What is RME? Please explain. 2. How is RME applied? How is this approach different from the other approaches you have learned? Please explain. 3. Pose a (contextual) problem that you could apply in RME, and explain for which conception's mathematization, this problem will be used. Please, evaluate the approach, on which the problem you posed will be used, by considering different approaches. The written answers received from the prospective teachers and their contextual problems were analyzed, through the descriptive analysis method so as to determine their cognitive competencies about RME. In descriptive analysis, primarily a framework is established for analysis; the data are processed in accordance with thematic framework; findings are described and interpreted (Patton, 1990). Due to that reason; PISA Competency Clusters (OECD, 2003) and Cognitive Domain Taxonomy (Bloom, et al. 1956), which are displayed in Figure 3 below in order to reveal the cognitive competencies of prospective mathematics teachers, were used as the foundation for the data analysis. In this process, the relevant categories and data related to sub-categories that belong to each category have been processed and common themes were determined (Creswell, 1998; Patton, 1990). In the process of analyzing the data, the data collected from the first part of the question asked to the prospective teachers have been processed in the reproduction-knowledge category; the data collected from the second part of the first question have been processed in the reproduction-comprehension category; the data obtained from the first part of the second question have been processed in the connection-application category; the data collected from the second part of the second question have been processed in the connection-analysis category; the data retrieved from the first part of the third question have been processed in the reflection-synthesis category; and the data collected from the second part of the third question have been processed in the reflection-evaluation category. Furthermore, the sub-categories within each category have been composed as adequate ones in and deficient (or incorrect) ones in the related category. In order to ensure the reliability of the study, two researchers, who have completed their PhD in the field of mathematics education and are experts in qualitative studies, firstly individually analyzed the data then it was discussed among the members until they reached a consensus on overall categories, sub-categories and themes. So, the use of multiple experts, as well as the use of their evaluations has led to conformability of the data. The credibility has been increased through the data obtained from the contextual problems consisting of the written answers given by the participants to the open-ended questions. For transferability, in order to help applying the findings to the other contexts, the description of the context was delivered in a clear and detailed manner. In other words, 26 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 the participants, the approaches examined and the activities conducted in the classroom were elaborated. Besides, in order to ensure that the results can be conveyed into similar media, the obtained findings have been supported with the quotations and detailed descriptions have been made (Berg, 2001; Lincoln & Guba, 1985; Yıldırım & Şimşek, 2005). RESULTS AND DISCUSSION The contextual problems and answers given by 32 prospective teachers participating in the study were examined and evaluated; furthermore, common themes were created based on their cognitive competencies. The categories and sub-categories created for determining common themes are displayed in Table 1. Moreover, frequencies of categories and sub-categories are also demonstrated in that table. The findings on cognitive competencies of the prospective teachers have been explained within each category by directly quoting them. Table 1. Frequencies and percentages of categories and sub-categories of prospective teachers' competencies Category Knowledge Reproduction Sub-category Having knowledge about RME 87.5 4 12.5 27 84.375 5 15.625 26 81.25 6 18.75 18 56.25 Deficient or incorrect analysis of RME 14 Posing an original (or an unfamiliar) contextual 11 problem Deficient or incorrect posing about contextual 21 problem 43.75 Deficient or incorrect knowledge about RME Applying knowledge about RME Deficient or incorrect application about RME Connection Analysis Synthesis Reflection Evaluation % 28 Making explanation or interpretation about RME Comprehension Deficient or incorrect comprehension about RME Application Frequency Analyzing knowledge about RME Making valuation by passing a judgment No answer or deficient evaluation 34.375 65.625 5 15.625 27 84.375 Reproduction Knowledge In the first part of the 1st question, prospective teachers were asked to answer to define RME approach. Thereby, it was aimed to reveal the level of knowledge of the prospective teachers. The 28 (%-87.5) of 32 prospective teachers have correctly answered to the question 'what is RME?'. An Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 27 example from the answers given by the participants is stated below: “We come across with mathematics in many parts of our everyday lives and we have to do the math. RME starts with this idea. In RME, a problem about real life are presented to students before teaching them the subject, and students solve this problem by making use of their previous knowledge through their own models. Thus, the student achieves a mathematical expression” (3rd PT). It was observed that 4 (12.5%) of the participants have deficiently or incorrectly defined RME. There were no participants, who did not answer to the question. One example from one of the prospective teachers, who deficiently answered to the question, is as follows: “Enabling them comprehend mathematics by giving them examples from life. How can we make life easier by using mathematics in our life. This is based on this” (13th PT). The prospective teacher, who deficiently answered the question, mentioned the association of the approach with real life and said that mathematics could be learned through real life examples. However, the fact that she did not mention contextual problems, which are considered to be necessary for the transition from real life to mathematics and is the focus of the approach, demonstrates that the prospective teacher does not have adequate knowledge about the approach. The answer received from one of the prospective teachers, who incorrectly answered the question, is as follows: “RME is an approach that could help us solve the problems we may face in life. It is teaching students a subject by giving examples of the events we may experience in our lives about the subject” (31th PT). Even though the prospective teacher talks about real life problems, his/her '...teaching students a subject by giving examples of events from life...' statement indicates that he/she has inaccurate knowledge on the roles of the student and teacher and also on the application of the approach. Recalling the knowledge is vital in terms of making learning meaningful and solving a problem, thereby this knowledge can be used in more complex assignments (Anderson & Krathwohl, 2001). Moreover, Ayer (1936) discusses that when something is defined, it should contain the definition itself or its synonymous or expressions with equivalent meanings. Examining the answers received by the prospective teachers; majority of the prospective teachers have knowledge on RME. Comprehension In the 1st question, which was asked in order to understand whether the prospective teachers comprehended RME or not, they were expected to explain the approach. Their explanations suggest that 27 (84.375%) of the participants have correctly comprehended RME. One of the teachers, who correctly comprehended RME, explained it as the following: 28 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 “The mathematical conception, which is intended to be taught to the student, is taught through a problem that could be experienced in real life. The student tries to solve this problem, which is created by preparing the necessary setting, through his/her own knowledge and modelings. In a sense, how mathematicians discover mathematics in the events, which they come across with in real life, is experienced by the student. They are introduced to mathematical knowledge by solving the problem. Thus, this problem should be compatible with real life and should enable students reach mathematical conceptions” (24th PT). The explanations suggest that 5 (15.625%) of the prospective teachers didn’t comprehend RME. One of these prospective teachers (16th PT) could not adequately comprehend it even though he/she correctly defined it. The other four prospective teachers are in the category of those who incorrectly or deficiently answered the question at the knowledge level. The prospective teacher, who could not comprehend the conception, explained it as the following: “In this approach, real life problems are asked to the student. We solve the problem and teach them the subject by saying this problem is its application in mathematics” (16th PT). The statements of the prospective teacher, suggest that it should be begin with real life problems. According to the approach, the problem should be solved through a model by the student himself/herself by discussing it with his/her group friends. However, his/her statements about the teachers' solving the problem for students and conducting the mathematization indicate that the prospective teacher incorrectly comprehended the approach. If we consider explaining as conveying the meaning to the other person and interpretation as realizing broad understandings (OECD, 2009); it is possible to say that both actions refer to comprehension. Thus, the answers given via explanations or interpretations indicate that the majority of the prospective teachers could significantly comprehend the approach. Connection Application Examining the answers received by the prospective teachers for the first part of the 2nd question about the application of RME, 26 (81.25%) of them correctly answered to the question about the application of this approach. One of the prospective teachers explains the application of approach as follows: “Firstly, a real life problem, which could enable student to reach a conception at the end, is given to the student. The student tries to actively solve this problem through the model he/she desires by discussing. The teacher helps them as a guide. At the end, the Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 29 student reaches mathematics through the model he/she created and is introduced to the desired conceptions. This part, in other words transition from real life to mathematics is called horizontal mathematization. Afterwards, the operations conduced within mathematics are called vertical mathematization. For example, everybody solved the snake problem we applied in classroom through their own desired models and discussion; and everybody was introduced to geometrical progression without even realizing. While doing so, we discussed and made associations in the classroom; and the example was modelled through expressions such as exponential numbers or common factors. This part was horizontal mathematization. Then we continued with vertical mathematization through conducting operations about geometrical progression. What is important in the approach is that students create their own models, discover mathematics through teachers' guidance, and obtain knowledge” (18th PT). The above-quoted prospective teacher pointed out the three main factors, which are required in RME applications and stated by Gravemeijer (1994, 2001) including reinvention through progressive mathematization, didactical phenomenology, and self-developed or emergent models. The explanations suggest that he/she correctly evaluated the RME applications conducted in classroom; and coordinated between the approach and its application. It was observed that 6 (18.75%) of prospective teachers incorrectly evaluated the application of RME. 5 of these prospective teachers fall under the category of those who incorrectly comprehended the approach; while 1 of them is in the category of those who correctly comprehended the approach. The prospective teacher, who correctly comprehended the approach but made incorrect statements about its application, made the following statement: “The student is asked real life problems. Afterwards, the student tries to solve this. For example, in the problem about Canyon, various lines were drawn in order to see the river. From what perspective these lines should be coming and the distance between the lines and the river were discussed and tackled. We thought of their ratios and associated them with tangent conception. In conclusion, the teacher makes associations in order to enable the student to solve the problem; and the student solves the problem by bringing them together” (30th PT). In the process of transition from real life to mathematics which is called mathematization, full process of transmitting to mathematical model from original problem situation is referred as modelling (Blum & Niss, 1991). However, in RME, modelling should be conducted by the student (Gravemeijer, 2001; Treffers, 1991). The statements made by the prospective teacher, which expressed that associating should be done by the teacher while solving the problem, suggest that 30 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 he/she is unable (or incorrectly) coordinate between the approach and its in-class application. Analysis In the second question asked to prospective teachers, they were asked to compare RME to other approaches besides discussing about its application. 18 (56.25%) of the prospective teachers correctly analyzed the approach and evaluated its similarities and differences with other approaches. Some of the comments they have made are as follows: “The student reaches at a mathematical conclusion through his/her own knowledge by himself/herself. Since student is in the center of learning and the teacher guides; it is similar to discovery learning. However, in discovery learning, it is not necessary to begin with real life problems. In other words, a student can learn through discovery in a setting, which could enable generalizing a conception, and conduct applications after discovering the conceptions. In fact, in general, this application and constructivist approaches are quite different from traditional education. All of them are focused on the process and student-centered. The students themselves make sense of it. The main difference in RME is the environment is a stimulant and real life problems are what we begin with” (3rd PT). Their explanations suggest that 14 (43.75%) of the prospective teachers are inadequate at analyzing the approach, thus their comparisons of approach to other approaches are sometimes incorrect and sometimes deficient. 6 of these prospective teachers are those, who already incorrectly applied the approach, and other 8 are among those who correctly applied it. One of the students, whose comparisons of other approaches are deficiently, explains it as: “I think there is not a huge difference between them. All of them are student-centered. Some of them create a problem while some of them do the same through an activity instead of a problem. In conclusion, they all end the same” (14th PT). Constructivism, which deals with how knowledge emerges and is based on cognitive psychology, takes the relationships among complex problem solving and cognitive structures and behaviours as basis (Noddings, 1990) and is based on the idea that knowledge is not directly taken from the teacher but structured actively by the learner (Lesh, Doerr, Carmona, & Hjalmarson, 2003; Von Glasersfeld, 1987). RME fundamentally share similarities with various learning and teaching theories of constructivism, which can be considered to be a theory of knowledge. In discovery learning, which is founded by Bruner (1961) and is one of the constructivist approaches, the learner is not informed of target knowledge or the conception, and learning setting is prepared by ensuring proper circumstances (Alfieri, Brooks, Aldrich, & Tenenbaum, 2011). Learning occurs through the assumptions made by organizing patterns and examples, which are based on conceptualizations and generalizations from simple to complex; and conducting researches in an intuitive and systematic Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 31 manner (Jacobsen, Eggen, & Kauchak, 1993) furthermore, in this process teacher's duty is to guide students (Hammer, 1997; Svinicki, 1998). In RME, even though the student is not informed of target knowledge; basic differences in main principles and learning setting play a major role in differentiating approaches form one another. Primarily, basic differences in RME are based on the idea that human knowledge structures the knowledge and mathematical intuitions and procedures are invented and not discovered (Freudenthal, 1973). Thus, even though 'guided reinvention', which is one of the main principles of RME, stresses on guidance in the process; it is still different from the other approaches from that perspective. Moreover, the meaning of guidance in RME refers to the facilitative role of teacher for reinvention while enabling scaffolding instead of making explanations to the students (Hamzah & Bustang, 2014). Essentially, RME and social constructivism, which focuses on the impact of the setting, are concerned with whether student is active; creativity; problem solving; reality of contexts; mathematical reality; and emergence of mathematical objects. However, in RME it is necessary to not only motivate students with everyday life contexts; but also to associate with experimentally real contexts and use them as the starting point for progressive mathematization (Gravemeijer, 2001). Examining the views of the prospective teachers on differences of RME approach from other approaches; the fact that more than half of them mentioned these basic differences suggests that they can analyze at connection-building level in terms of cognitive competency. Reflection Synthesis The prospective teachers were asked to pose a contextual problem (4th question) that they could use in RME, in order elaborately examine the cognitive status of the prospective teachers about RME. Thereby, above the theoretical and practical knowledge they have about the approach; it was aimed to find out how they bring this knowledge together in a different manner and integrate it with associations. Examining the problem they posed, 11 (34.35%) of the prospective teachers produced a contextual problem, in which they could apply the approach; whereas 21 (65.625%) of them did not pose such a problem. Some of the contextual problems are stated below: When these problems are examined, it becomes visible that the students can solve these problems and mathematize new conceptions through the models they created. “Ayşe will meet her friends and go to a movie this weekend. However, she cannot decide what to wear. When she looks at her closet she sees alternatively blouses, skirts, pairs of shoes, and socks with different colors. How many different outfit combinations can she create by doing these pieces in her closet?” Conception: Counting rule-factorial (2nd PT). 32 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 “Uncle Ahmet wants to cover his rectangular garage with square tiles. He measures the size of his garage and goes to buy the tiles from a construction market. However, Uncle Ahmet believes that the less number of tiles, the less it will cost him. For that job, what dimension of tiles Uncle Ahmet should buy”? Conception: The greatest common factor (6th PT). “A medical aid helicopter taking off from the emergency aid base brings aid to those injured in a traffic accident. If the pilot knows the distance from emergency aid base to the hospital, the distance from the scene of accident to the hospital and the angle between them, how can he find the distance between the emergency aid base and the scene of accident?” Conception: Cosine Law (12th PT). The two problems stated below, which were posed on the mathematization of proportion conception by the prospective teachers, are inappropriate for the approach. Although these problems exist in real life, the modeling that will be conducted in the process of solution will not help the mathematization of any conception. Moreover, it is believed that these problems can be an exercise question, which could be applied on taught proportion-ratio conception. “Ezgi's watch loses time 5 minutes every hour. At 10.00 o'clock Ezgi agreed to meet her friend Mehmet at 17.00. What time Ezgi should be there according to her time, in order to meet Mehmet punctually at the time they agreed to meet?” (26th PT). “6 workers worked at a construction and completed the building in 20 days. These workers want to make a planning for their next job. According to this, if 6 workers work together again; in how many days can they finish 5 buildings?” (8th PT). Similarly, even though the problem stated below can exist in real life; it cannot be qualified as a contextual problem. “How many pieces can we cut the cake by 5 moves?” (29th PT). It is believed that the below-mentioned problem cannot be evaluated as a contextual problem, which could serve as a real life problem and help teaching a conception; it rather aims at achieving a mathematical relation by making students generalize through different cases. “Four children play with one hoop each. Later, the children pile hoops over one another and start to examine their junction points. What is the maximum number of junction points for these hoops?” (23th PT). Posing a problem is a significant component for problem solving and good mathematical problems can be posed by good mathematics teachers (Kilpatrick, 1987). The person in the process of posing a problem is actively engaged in challenging situations that involve them in exploring, Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 33 questioning, constructing, and refining mathematical ideas and relationships (English, 2003). In light of this view, it is believed that most of the prospective teachers have carried out the activities mentioned in the process of problem posing. However, a problem should point out the types of realistic thinking, which characterizes out-of-classroom problem solving, in order to be realistic (Verschaffel, Greer, & De Corte, 2000). Also, it should bring out a variety of mathematical interpretations and solution strategies which serve as a basis for progression to a more formal and sophisticated mathematics, and should support students’ mathematisation process (Widjaja, 2013). For example, even though a problem such as “Do the medians of a triangle intersect at a single point” is concrete; it is still far from everyday life problems, thus cannot be considered realistic. Hence, all contextual problems are not realistic. In order to evaluate a problem as realistic, it has to be real or experienced by the person in an interesting manner (Wubbels, Korthagen, & Broekman, 1997). Similarly, if a person wants to pose a problem about a farm, using sentences such as 'imagine a cow in a sphere form' will not make the problem realistic (Greer, 1997). Taking into account of all of these ideas, it is observed that more than half of the problems posed by the prospective teachers are not qualified enough to serve for RME and only 34% of them fit for the purpose. Thus, it can be accepted that only the students in this percentage have cognitively reached the synthesis stage at reflection-construction level. Evaluation In the 4th question asked to the prospective teachers, they were expected to evaluate the application of the problem they posed and interpret it according to the other approaches. The 5 (15.625%) of the prospective teachers, who posed a relevant problem, have carried out an evaluation; whereas the other 27 (84.375%) of them failed to either pose a relevant problem or to carry out an evaluation. The evaluation made by a prospective teacher, who is believed to have posed a relevant problem to RME is as follows: “I aimed to enable the students to achieve the greatest common factor in my problem. The teacher could try to make student comprehend the conception through explaining its meaning. In other words, if it were given through presentation, the student would be provided with prepared knowledge. Furthermore, since the student would not be active; he/she may be bored and mathematics would be far from real life for him/her. I believe that the things a person does or achieves on his/her own is always more valuable and not forgotten. Therefore, for a student to solve a problem, a real problem, and to meet a new conception will be more meaningful for him/her. We could also give this knowledge through discovering numbers and the correlations among factors, without using a problem. This may be effective, too. However, I believe that it will be more permanent for students to reach at knowledge through real life events, discussion and creating their own models” (6th PT). 34 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 According to Hiebert and Carpenter (1992), understanding is defined as making connections between ideas, facts or procedures, and occurs through recognizing relationships between pieces of knowledge. Consolidated knowledge enables the use of this knowledge in various cases in a proper and confident manner (Wubbels, Korthagen, & Broekman; 1997). Also, if knowledge is consolidated; the evaluations about this knowledge will be conducted in a meaningful manner. In that sense, it can be concluded that the evaluations of prospective teachers through the problems they posed are insufficient. CONCLUSION Enabling students to learn mathematics in a meaningful manner and to acquire various targeted skills during the process can be only made possible by placing education of mathematics teachers in the center. Therefore, it is of vital importance for prospective teachers to be able to prepare appropriate learning environments and to correctly plan the roadmaps for that purpose (Brousseau, 1987; Richards, 1991). In order to achieve that, the prospective teachers must have enough knowledge on the key principles and fundamental philosophy of the approach that they plan on using while preparing such environments, moreover, they must also be able to implement and interpret instructional activities (Gravemeijer & Cobb, 2006). In this study, where prospective teachers’ cognitive structures about the approaches have been examined, RME has been the focused approach and prospective teachers’ cognitive structures about this approach have been evaluated. The study suggests that the majority of the prospective teachers have theoretical knowledge about RME and can generally interpret that knowledge, however, their ability to associate that knowledge with other approaches falls by half. Moreover, the prospective teachers’ skills fell even shorter when they were asked to pose a contextual problem suitable with RME, which was asked of them in order to see their high cognitive competencies. Even though it is necessary to place mathematical connections in the relevant social situations for achieving meaningful learning in mathematics; creating authentic activities is naturally quite difficult (Ainley, Pratt, & Hansen, 2006) and also designing lesson materials, especially finding real life examples that match with the mathematics concepts to be taught is difficult (Zulkadri, 2002). Also, preparing a learning environment that allows for reinvention process, is quite complicated (Gravemeijer, 2008). Furthermore, some of the reasons why these problems emerged may be due to the fact that they do not have adequate contextual knowledge about the nature of some of the conceptions they were supposed to learn; or they could not detach from the constructions in the learning settings, in which they were raised. Also, as Widjaja (2008) mentioned, their prior knowledge about the concept and the nature of their knowledge may cause this result. For that reason, prospective teachers should be trained in a way that reflects what is expected from their teachings (Gravemeijer, 2008). Since this study aims to reveal the cognitive competency of prospective teachers on RME; it is believed that the obtained results suggest ideas on the construction about RME only in the minds of Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 35 prospective teachers. Hence, reflections of prospective teachers on RME can be conducted through a phenomenological study so as to obtain more elaborated knowledge. Moreover, in the school experience and school practice stages; the application processes of prospective teachers on RME can be examined and in the light of the retrieved results, necessary adaptations can be conducted for real classroom situations. ACKNOWLEDGMENTS I would like to thank the prospective teachers for their voluntary participation. REFERENCES Ainley, J., Pratt, D., & Hansen, A. (2006). Connecting engagement and focus in pedagogic task design. British Educational Research Journal, 32(1), 23-38. https://doi.org/10.1080/01411920500401971. Alfieri, L., Brooks, P.J., Aldrich, N.J., & Tenenbaum, H.R. (2011). Does discovery-based instruction enhance learning? Journal of Educational Psychology, 103(1), 1-18. https://doi.org/10.1037/a0021017. Altun, M. (2011). Eğitim fakülteleri ve matematik öğretmenleri için liselerde matematik öğretimi (Mathematics Education in High Schools). Bursa: Alfa Akademi. Anderson, L.W., & Krathwohl, D.R. (2001). (Eds.) Taxonomy for learning, teaching, and assessing: A revision of Bloom’s taxonomy of educational objectives. Needham Heights, MA: Allyn & Bacon. Anderson, J., White, P., & Wong, M. (2012). Mathematics curriculum in the schooling years. In B. Perry, T. Lowrie, T. Logan, A. MacDonald, & J. Greenlees (Eds.), Research in mathematics education in Australasia 2008–2011 (pp. 219–244). Rotterdam: Sense Publishers. Aubrey, K., & Riley, A. (2016). Understanding and using educational theories. London: Sage. Ausubel, D.P. (1963). The psychology of meaningful verbal learning. New York: Gruene and Stratton. Ayer, A.J. (1936). Language, truth and logic. Oxford: Oxford University Press. Barell, J. (2010). Problem-based learning: The foundation of 21st century skills. In J. Bellance & R. Brandt R (Eds.), 21 st century skills (pp. 174-199). Bloomington: Solution Tree Press. Barnes, H. (2004). Realistic mathematics education: Eliciting alternative mathematical conceptions of learners. African Journal of Research in Mathematics, Science and Technology Education, 8(1), 53-64. https://doi.org/10.1080/10288457.2004.10740560. Barrett, T.C. (1976). Taxonomy of reading comprehension. In R. Smith & T. C. Barrett (Eds.). Teaching reading in the middle grades. Reading, MA: Addison-Wesley Publishing Co. Barrows, H.S. (1986). A taxonomy of problem‐based learning methods. Medical Education, 20(6), 481-486. https://doi.org/10.1111/j.1365-2923.1986.tb01386.x. Battista, M.T. (1994). Teacher beliefs and the reform movement in mathematics education. The Phi Delta Kappan, 75(6), 462-470. Baumert, J., & Kunter, M. (2013). The effect of content knowledge and pedagogical content knowledge on instructional quality and student achievement. In Cognitive activation in the 36 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 mathematics classroom and professional competence of teachers (pp. 175-205). Boston: Springer. Berg, B.L. (2001). Qualitative research methods for the social sciences (4th ed.). Boston: Allyn & Bacon. Beswick, K. (2011). Putting context in context: An examination of the evidence for the benefits of ‘contextualised’tasks. International Journal of Science and Mathematics Education, 9(2), 367390. http://dx.doi.org/10.1007/s10763-011-9323-y. Beswick, K., & Goos, M. (2012). Measuring pre-service primary teachers' knowledge for teaching mathematics. Mathematics Teacher Education and Development, 14(2), 70-90. Bloom, B.S., Engelhart, M.D., Furst, E.J., Hill, W.H., & Krathwohl, D.R. (1956). Taxonomy of educational objectives: The classification of educational goals. Handbook 1: Cognitive domain. New York: David McKay. Bloom, B.S. (1994). Reflections on the development and use of the taxonomy. In L. W. Anderson & L. A. Sosniak (Eds.). Bloom's taxonomy: A forty-year retrospective (1-61). Chicago: University of Chicago Press. Blum, W., & Niss, M. (1991). Applied mathematical problem solving, modelling, applications, and links to other subjects-state: trends and issues in mathematics instruction. Educational Studies in Mathematics, 22(1), 37-68. https://doi.org/10.1007/BF00302716. Borko, H., & Putnam, R. (1996). Learning to teach. In D. C. Berliner, R. C. Calfee (Eds.). Handbook of educational psychology (673-708). New York: Simon & Schuster Macmillan. Borasi, R., & Fonzi, J. (2002). Professional development that supports school mathematics reform. Foundations series of monographs for professionals in science, mathematics and technology education. Arlington, VA: National Science Foundation. Bray, A., & Tangney, B. (2016). Enhancing student engagement through the affordances of mobile technology: a 21st century learning perspective on Realistic Mathematics Education. Mathematics Education Research Journal, 28(1), 173-197. https://doi.org/10.1007/s13394-015-0158-7. Brousseau, G. (1987). Les differents roles du maitre. Colloque des P.E.N. Angers. In S. A. Martin (Ed.). Reconstructing mathematics pedagogy from a constructivist perspective. Washington, DC: National Science Foundation. Bruner, J.S. (1961). The act of discovery. Harvard Educational Review, 31, 21-32. Cai, J., & Howson, G. (2013). Toward an international mathematics curriculum. In M. A. K. Clements, A. J. Bishop, C. Keitel, J. Kilpatrick, & F. K. S. Leung (Eds.), Third international handbook of mathematics education (pp. 949–974). New York: Springer. Creswell, J.W. (1998). Qualitative inquiry and research design: Choosing among five traditions. Thousands Oaks, CA: Sage. Davis, R.B. (1990). Discovery Learning and Constructivism. Journal for Research in Mathematics Education: Monograph (4): Constructivist Views on the Teaching and Learning of Mathematics, (pp. 93-106+195-21). Association Drive, Reston, VA 22091. Dawkins, P.C. (2015). Explication as a lens for the formalization of mathematical theory through guided reinvention. The Journal of Mathematical Behavior, 37, 63-82. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 37 De Corte, E. (1995). Fostering cognitive growth: A perspective from research on mathematics learning and instruction. Educational Psychologist, 30(1), 37-46. https://doi.org/10.1207/s15326985ep3001_4. De Lange, J. (1987). Mathematics, insight, and meaning. The Netherlands, Utrecht: OW&OC. De Lange, J. (1996). Using and applying mathematics in education. In A. J. Bishop et al. (Eds.). International handbook of mathematics education (pp. 49-97). The Netherlands: Kluwer Academic Publishers. Doorman, M., Drijvers, P., Dekker, T., van den Heuvel-Panhuizen, M., de Lange, J., & Wijers, M. (2007). Problem solving as a challenge for mathematics education in the Netherlands. ZDM, 39(5-6), 405-418. https://doi.org/10.1007/s11858-007-0043-2. Dreyfus, T., & Tsamir, P. (2004). Ben's consolidation of knowledge structures about infinite sets. The Journal of Mathematical Behavior, 23(3), 271-300. https://doi.org/10.1016/j.jmathb.2004.06.002. Driscoll, M.P. (2005). Psychology of learning for instruction. (3th Ed.) Boston, MA: Pearson. English, L.D. (1995). Cognitive psychology and mathematics education. In L. D. English, G. S. Halford (Eds.). Mathematics education: Models and processes (pp. 1-20). Lawrence Erlbaum Associates, Mahwah: NJ. English, L.D. (2003). Problem posing in the elementary curriculum. In F. Lester, R. Charles (Eds.). Teaching mathematics through problem solving. Reston, Virginia: National Council of Teachers of Mathematics. Ertmer, P.A., & Newby, T.J. (1993). Behaviorism, cognitivism, constructivism: Comparing critical features from an instructional design perspective. Performance Improvement Quarterly, 6(4), 50-72. https://doi.org/10.1111/j.1937-8327.1993.tb00605.x Fauzan, A. (2002). Applying Realistic Mathematics Education (RME) in teaching geometry in Indonesian primary schools. Unpublished Doctoral Dissertation. Enschede: University of Twente. Feijs, E. (2005). Constructing a learning environment that promotes reinvention. In R. Nemirovsky, A. S. Rosebery, J. Solomon, & B. Warren (Eds.). Everyday matters in science and mathematics: Studies of complex classroom events. Mahwah, NJ: Lawrence Erlbaum. Feiman-Nemser, S. (2001). From preparation to practice: Designing a continuum to strengthen and sustain teaching. Teachers College Record, 103(6), 1013-1055. Freudenthal, H. (1973). Mathematics as an educational task. Dordrecht: Reidel. Freudenthal, H. (1991). Revisiting mathematics education. Dordrecht: Kluwer Academic. Granello, D. H. (1995). Encouraging the cognitive development of supervisees: Using Bloom's taxonomy in supervision. Counselor Education and Supervision, 40(1), 31-46. https://doi.org/10.1002/j.1556-6978.2000.tb01797.x. Granello, D. H. (2001). Promoting cognitive complexity in graduate written work: Using Bloom’s taxonomy as a pedagogical tool to improve literature reviews. Counselor Education and Supervision, 40, 292-307. https://doi.org/10.1002/j.1556-6978.2001.tb01261.x. Gravemeijer, K. (1999). How emergent models may foster the constitution of formal mathematics. Mathematical Thinking and Learning, 1(2), 155-177. https://doi.org/10.1207/s15327833mtl0102_4. 38 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 Gravemeijer, K. (1994). Developing realistic mathematics education. Utrecht: Utrecht, The Netherlands: CD-β Press. Gravemeijer, K.P.E. (2001). Developmental research: Fostering a dialectic relation between theory and practice. In J. Anghileri (Ed.). Principles and practice in arithmetic teaching (147-161). London: Open University Press. Gravemeijer, K.P.E., & Cobb, P. (2006). Design research from a learning design perspective. In J.Akker, K. Gravemeijer, S. McKenney, & N. Nieveen (Eds.), Educational design research (pp. 45–85). London: Routledge, Taylor Francis Group. Gravemeijer, K. (2008). RME theory and mathematics teacher education. International handbook of mathematics teacher education, 1, 283-302. Greer, B. (1997). Modelling reality in mathematics classrooms: The case of word problems. Learning and Instruction, 7(4), 293-307. https://doi.org/10.1016/S0959-4752(97)00006-6. Hadi, S. (2002). Effective teacher professional development for the implementation of realistic mathematics education in Indonesia. Unpublished Doctoral Dissertation. University of Twente. Hammer, D. (1997). Discovery learning and discovery teaching. Cognition and Instruction, 15(4), 485-529. https://doi.org/10.1207/s1532690xci1504_2. Hamzah, U., & Bustang, B. (2014). Constructivism versus cognitive load theory: in search for an effective mathematics teaching. In Proceeding of International Conference on Research, Implementation and Education of Mathematics and Sciences, May 18-20, 2014. Yogyakarta State University. Herman, P., & Gomez, L.M. (2009). Taking guided learning theory to school. In S. Tobias, D. M. Duffy (Eds.). Constructivist theory applied to instruction: Success or failure? (pp. 62-81). New York, NY: Taylor & Francis. Hiebert, J., & Carpenter, T.P. (1992). Learning and teaching with understanding. In J. Hiebert, T. P. Carpenter, D. A. Grouws (Eds.). Handbook of research on mathematics teaching and learning: A project of the National Council of Teachers of Mathematics (pp. 65-97). New York: Macmillan. Jacobsen, D., Eggen, P., & Kauchak, D. (1993). Methods for teaching: A skills approach (4th ed.). Columbus, OH: Merrill. Jonassen, D.H. (1991). Objectivism versus constructivism: Do we need a new philosophical paradigm? Educational technology research and development, 39(3), 5-14. https://doi.org/10.1007/BF02296434. Jonassen, D.H. (1991a). Evaluating constructivistic learning. Educational Technology, 31(9), 28-33. Kilpatrick, J. (1987). Problem formulating: Where do good problems come from? In A. H. Schoenfeld (Ed.). Cognitive science and mathematics education (123-148). Hillsdale. NJ: Erlbaum. Klein, A.S. (1998). Flexibilization of mental arithmetic strategies on a different knowledge base. Utrecht: Freudenthal Institute. Kraiger, K., Ford, J.K., & Salas, E. (1993). Application of cognitive, skill-based, and affective theories of learning outcomes to new methods of training evaluation. Journal of Applied Pschology, 78(2), 311-328. Krathwohl, D.R., Bloom, B.S., & Bertram, B.M. (1973). Taxonomy of educational objectives, the classification of educational goals. Handbook II: affective domain. New York: David McKay. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 39 Krathwohl, D. R. (2002). A revision of Bloom's taxonomy: An Overview. Theory into Practice, 41(4), 212-218. https://doi.org/10.1207/s15430421tip4104_2. Kwon, O.N. (2002). Conceptualizing the Realistic Mathematics Education approach in the teaching and learning of ordinary differential equations. In Proceedings of the International Conference on the Teaching of Mathematics Hersonissos, July 1-6. Crete, Greece. Lesh, R., Doerr, H.M., Carmona, G., & Hjalmarson, M. (2003). Beyond constructivism. Mathematical Thinking and Learning, 5(2&3), 211–233. http://dx.doi.org/10.1080/10986065.2003.9680000. Lianghuo, F. (2014). Investigating the pedagogy of mathematics: How do teachers develop their knowledge?. Imperial College Press, London. Lincoln, Y.S., & Guba, E.G. (1985). Naturalistic inquiry (Vol. 75). California: Sage. Makonye, J. P. (2014). Teaching functions using a realistic mathematics education approach: A theoretical perspective. International Journal of Educational Sciences, 7(3), 653-662. https://doi.org/10.1080/09751122.2014.11890228. National Research Council. (2001). Adding it up: Helping children learn mathematics. J. Kilpatrick, J. Swafford, & B. Findell (Eds.). Mathematics Learning Study Committee, Center for Education, Division of Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. Noddings, N. (1990). Constructivism in mathematics teaching. In R. B. Davis, C. A. Maher, N. Noddings (Eds). Constructivist views on the learning and teaching of mathematics (7-18). Reston, VA: National Council of Teachers of Mathematics. (J for Res in Math Edu. Monograph Number 4). OECD. (2003). The PISA 2003 assessment framework. mathematics, reading, science and problem solving knowledge and skills. Paris: OECD. OECD. (2009). PISA 2006 Technical report. Paris: OECD. Patton, M.Q. (1990). Qualitative evaluation and research methods. (2nd ed.) Newbury Park, CA: Sage. Pearson, P.D., & Johnson, D.D. (1978). Teaching reading comprehension. New York: Holt, Rinehart, & Winston. Piaget, J. (1973). To understand is to invent: The future of education. New York: Grossman. Putnam, R.T., & Borko, H. (2000). What do new views of knowledge and thinking have to say about research on teacher learning? Educational Research, 29(1), 4-15. https://doi.org/10.3102%2F0013189X029001004. Rasmussen, C.L., & King, K.D. (2000). Locating starting points in differential equations: A realistic mathematics education approach. International Journal of Mathematical Education in Science and Technology, 31(2), 161-172. https://doi.org/10.1080/002073900287219. Richards, J. (1991). Mathematical Discussion. In E. von Glasersfeld (Ed.). Radical constructivism in mathematical education (pp. 13-52). Dortrecht: Kluwer. Sadler, T.D., & Zeidler, D.L. (2009). Scientific literacy, PISA, and socioscientific discourse: Assessment for progressive aims of science education. Journal of Research in Science Teaching, 46(8), 909-921. https://doi.org/10.1002/tea.20327. 40 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 Selter, C. (1997). Instructional design for teacher education. In M. Beishuizen, K.P.E. Gravemeijer, & E.C.D.M. van Lieshout (Eds.), The role of contexts and models in the development of mathematics strategies and procedure (pp. 55-77). Utrecht:CD-B Press/Freudenthal Institute. Selter, C. (2001). Understanding-the underlying goal of teacher education. In M. Van den HeuvelPanhuizen (Ed.), Proceeding of 25th conference international group of psychology of mathematics education, 12-17 July, Utrecht, the Netherlands. Sembiring, R.K, Hadi, S., & Dolk, M. (2008). Reforming mathematics learning in Indonesian classrooms through RME. ZDM, 40, 927-939. https://doi.org/10.1007/s11858-008-0125-9. Simon, M.A. (1995). Reconstructing mathematics pedagogy from a constructivist perspective. Journal for Research in Mathematical Education, 26(2), 114-145. https://doi.org/10.2307/749205. Simons, R.J., Van der Linden, J., & Duffy, T. (Eds.) (2000). New learning. Dordrecht, The Netherlands: Kluwer Academic Publishers. Shulman, L. (1987). Knowledge and teaching: Foundations of the new reform. Harvard educational review, 57(1), 1-23. Sowder, J.T. (2007). The mathematics education and development of teachers. In F. K. Lester Jr. (Ed.), Second handbook of research on mathematics teaching and learning (pp. 157-223). Charlotte, NC: Information Age. Sriraman, B., & English, L. (2010). Surveying theories and philosophies of mathematics education. In B. Sriraman & L. English (Eds). Theory of mathematics education. Advances in mathematics education (pp. 7-32). Berlin Heidelberg: Springer-Verlag. Stacey, K. (2005). The place of problem solving in contemporary mathematics curriculum documents. Journal of Mathematical Behavior, 24, 341–350. https://doi.org/10.1016/j.jmathb.2005.09.004. Streefland, L. (1991). Fraction in Realistic Mathematics Education, A paradigm of developmental research. Dordrecht: Kluwer Academic Publisher. Svinicki, M.D. (1998). Theoretical Foundation for Discovery Learning. Advances in Psychological Education, 20(1), S4-S7. https://doi.org/10.1152/advances.1998.275.6.S4. Szydlik, J.E., Szydlik, S.D., & Benson, S.R. (2003). Exploring changes in pre-service elementary teachers’ mathematical beliefs. Journal of Mathematics Teacher Education, 6(3), 253-279. https://doi.org/10.1023/A:1025155328511 Treffers, A., & Goffree, F. (1985). Rational analysis of realistic mathematics education - The Wiskobas Program. In L. Streefland (Ed). Proceedings of the 9th International Conference for the Psychology of Mathematics Education, July 22-29; Noordwijkerhout. Utrecht, The Netherlands. Treffers, A. (1991). Realistic mathematics education in the Netherlands 1980 - 1990. In L. Streefland (Ed.). Realistic Mathematics Education in primary schools (pp. 11–20). Utrecht: Freudenthal Institute. Treffers, A. (1987). Three dimensions. A model of goal and theory description in mathematics education. Dordrecht: Reidel. Tsamir, P. (2008). Using Theories as Tools in Mathematics Teacher Education, In D. Tirosh & T. Wood (Eds.). Tools and Processes in Mathematics Teacher Education (pp. 221-232). Rotterdam: Sense Publishers. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 41 Van den Heuvel-Panhuizen, M. (1996). Assessment and Realistic Mathematics Education. Utrecht: Freudenthal Institute. Van den Heuvel-Panhuzen, M. (2003). The didactical use of models in realistic mathematics education: An example from a longitudinal trajectory on percentage. Educational Studies in Mathematics, 54, 9-35. https://doi.org/10.1023/B:EDUC.0000005212.03219.dc. Verschaffel, L., Greer, B., & De Corte, E. (2000). Making sense of word problems. Swets & Zeitlinger, Lisse. Von Glasersfeld, E. (1987). Learning as a constructive activity, In C. Janvier (Ed.). Problems of representation in the teaching and learning of mathematics (pp. 3-18). Hillsdale, NJ: Lawrence Erlbaum Associate. Wang P.L. (1990). Assessment of Cognitive Competency. In D. E. Tupper and K. D. Cicerone (Eds.) The Neuropsychology of Everyday Life: Assessment and Basic Competencies. Foundations of Neuropsychology (pp. 219-228). Springer, Boston, MA. Webb, N. (1997). Criteria for alignment of expectations and assessments on mathematics and science education. Council of Chief State School Officers and National Institute for Science Education Research Monograph No. 6. Washington, DC: CCSSO. Webb, N. (1999). Alignment of science and mathematics standards and assessments in four states. Council of Chief State School Officers and National Institute for Science Education Research Monograph No.18. Washington, DC: CCSSO. Widjaja, W. (2008). Local instruction theory on decimals: The case of Indonesian pre-service teachers. Doctoral dissertation. Melbourne: The University of Melbourne. Widjaja, W. (2013). The use of contextual problems to support mathematical learning. Journal on Mathematics Education, 4(2), 157-168. http://dx.doi.org/10.22342/jme.4.2.413.151-159. Wu, H. (1999). Basic skills versus conceptual understanding. American Educator, 23(3), 14-19. Wubbels, T., Korthagen, F., & Broekman, H. (1997). Preparing teachers for realistic mathematics education. Educational Studies in Mathematics, 32, 1–28. https://doi.org/10.1023/A:100290052. Yackel, E., & Cobb, P. (1996). Sociomath norms, argumentation, and autonomy in mathematics. Journal for Research in Mathematical Education, 27, 75-190. https://www.jstor.org/stable/749877 Yackel, E., Gravemeijer, K., & Sfard, A. (2011). A journey in mathematics education research. Dordrecht: Springer. Yıldırım, A., & Şimşek, H. (2005). Sosyal bilimlerde nitel araştırma yöntemleri (Qualitative research methods for social sciences). Ankara: Seçkin Yayıncılık. Zulkardi. (2002). Developing a learning environment on realistic mathematics education for Indonesian student teachers. Doctoral Dissertation. Enschede: University of Twente. 42 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 APPENDIX Problem 1 When the sea snake in the picture becomes 1 month old, a black ring emerges around its body. Each month, a yellow ring emerges in the middle of the black ring; and thereby, two black and one yellow rings appear. In the following months, this continues in the same way. In other words, each black ring is cut in the middle with a yellow ring. How many rings does a 12 month old sea snake have? How can we find the age of a sea snake according to its number of rings; or the number of black and yellow rings of a sea snake of a certain age? (Altun, 2011). Comment: It was aimed to reach geometrical sequence conception through this problem. Problem 2 Ship P is going in a straight line toward point B on the shore, at a constant speed. Ship Q is going in the direction of A at double the speed of ship P. How close do the two ships get? (Wubbels, Korthagen, & Broekman, 1997). Comment: It was aimed to reach vector conception through this problem. Problem 3 The figure below shows two rice fields separated by a road. Both rice fields are planted with the same rice and they are given the same fertilizer. The dots on the figure represent rice clusters. Which rice field produces more rice? (Fauzan, 2002). Comment: It was aimed to reach reallotment-congruent area conception through this problem. Yilmaz, Prospective mathematics teachers’ cognitive competencies on realistic mathematics education 43 Problem 4 This is a photograph of a hiker on the rim of the Grand Canyon looking down trying to see the Colorado River at the bottom of the canyon. Can the hiker see the river below? From which points and perspective should the hiker look in order to see the river? (Feijs, 2005). Comment: It was aimed to discuss tangent conception through this problem. 44 Journal on Mathematics Education, Volume 11, No. 1, January 2020, pp. 17-44 International Journal on Emerging Mathematics Education (IJEME) Vol. 1, No. 1, March 2017, pp. 11-24 P-ISSN: 2549-4996, E-ISSN: 2548-5806, DOI: http://dx.doi.org/10.12928/ijeme.v1i1.5695 Teachers’ Real and Perceived of ICTs Supported-Situation for Mathematics Teaching and Learning 1Maman Fathurrohman, 2Anne L. Porter, 2Annette L. Worthy 1Universitas Sultan Ageng Tirtayasa, Jl. Raya Jakarta Km. 4, Panancangan, Serang, Banten 42124 2University of Wollongong, Northfields Ave, Wollongong NSW 2522, Australia Email: [email protected] Abstrak Tujuan dari penelitian ini adalah untuk mendapatkan informasi yang berkaitan dengan Infrastuktur, Fasilitas, dan Sumber Daya Teknologi Informasi dan Komunikasi (TIK) di Kecamatan Bojonegara, Provinsi Banten, Indonesia. Paper ini menekankan publikasi hasil tersebut karena informasi itu diperlukan untuk penerapan pembelajaran matematika berbasis teknologi. Metode yang digunakan adalah survai. Instrumen yang digunakan adalah kuesioner, pedoman wawancara tidak terstruktur, dan handycam. Selama survei, total 220 paket kuesioner dibagikan kepada guru, tetapi hanya 119 (tingkat respon 54,1%) yang diisi dan dikembalikan. Sebanyak 12 guru telah diwawancarai dengan lima dari wawancara tersebut direkam dalam video. Beberapa kepala sekolah menyambut baik dan mengizinkan peneliti untuk mengunjungi sekolah mereka dan membuat dokumentasi terkait dengan infrastruktur, fasilitas, dan sumber daya TIK di sekolah mereka, sementara yang lain tidak mengizinkan peneliti untuk melakukannya. Berdasarkan survei, berbagai fakta penting telah ditemukan. Disarankan bahwa Teacher-Centered Learning with Technology adalah metode yang paling tepat untuk diterapkan. Kata Kunci: survai, pembelajaran berbasis teknologi, TIK, infrastruktur, fasilitas, sumber daya Abstract Purpose of this research was to gain information related to the real and perceived ICT Infrastructures, Facilities, and Resources in Bojonegara Sub District, Indonesia. This article emphasize the publication of this information because it is needed for implementation of technology-based mathematics teaching and learning. The method used was survey. Instruments of survey were questionnaires, unstructured interview guideline, and handycam. During the survey, total of 220 questionnaire packages were distributed to teachers, however only 119 (response rate 54.1%) of them were filled and returned. A total of 12 teachers were interviewed, with five of these interviews were video recorded. Several head masters welcomed and allowed researcher to visit their schools and make documentation of ICT Infrastructures, facilities, and resources, while the others did not allow the researcher to do that. Based on survey, many important findings have been discovered. It is suggested that the Teachers-Centered Learning with Technology is the most appropriate method of technology-based learning to be implemented. Keywords: survey, technology-based learning, ICT, infrastructure, facilities, resources How to Cite: Fathurrohman, M., Porter, A.L., & Worthy, A.L. (2017). Teachers’ real and perceived of ICTs supported-situation for mathematics teaching and learning. International Journal on Emerging Mathematics Education, 1(1), 11-24. http://dx.doi.org/10.12928/ijeme.v1i1.5695. INTRODUCTION Technology is one aspect in human civilization that spreads quickly from one nation to other nations. The spreads of technology, especially information and communication technology, has influence and change human life. In the field of education, technology has been adopted as can be seen in the daily academic activities, not only in universities but also in Received January 24, 2017; Revised February 16, 2017; Accepted February 23, 2017 12 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 secondary and primary schools. Teaching and learning with technology have been happens in many places because of its benefit on provides advantage in making the process of teaching and learning more effective and efficient. The implementation of technology-based mathematics teaching and learning requires supporting infrastructures, facilities, and resources. For that reason, the conditions of these components, called the ICTs Supported-Situation was needed for implementation of technologybased learning, have to be known. Several initiatives have been conducted to discover ICTs Supported-Situation for Teaching and Learning. The initiative were surveys of, or related to, ICT for education with reports generating recommendations, including recommendation regarding ICT for education. Examples include, The Survey of ICT and Education and country focused surveys in ten countries, Argentina, Brazil, the Philippines, Kenya, India, Morocco, Peru, Senegal, Ukraine, and Vietnam, that also provide data and information about recent ICT conditions in these developing countries, and surveys of India and seven other developing countries in South Asia, Afghanistan, Bangladesh, Bhutan, Maldives, Nepal, Pakistan, and Sri Lanka by Information and Development Program or InfoDev (2007; 2009a;2009b; 2010). The published reports of surveys or in part of survey by the World Economic Forum (2016) and International Development Research Centre (2010) also comprehensively addressed current usage of ICT. Some sections of reports briefly discussed regional and global trends of ICT in education and selected regional ICT initiatives and projects in education. In addition to the surveys several studies related to ICTs Supported-Situation and education. However the surveys are too general to be practically used or refereed for implementation of technology-based mathematics teaching and learning. There is a need to conduct a survey in accessible area for this purpose. Term infrastructure, facility, and resource are defined as following: Infrastructure is defined as the basic physical and organizational structures and facilities needed for the operation of a society or enterprise. Facility is defined as a place, amenity, or piece of equipment provided for a particular purpose. While Resource is defined as a stock or supply of money, materials, staff, and other assets that can be drawn on by a person or organization in order to function effectively. The implementation of technology is relevant to Fathurrohman (2012) suggestion for Addressing the need of mathematics teachers in developing countries. Based on above definitions, researchers defined the objects of study which were needed to be known. For infrastructures were the conditions of ICT infrastructures at the area, the schools, and teachers’ home. For facilities were the computer lab, computer, printer, scanner, internet connection, and handphone at schools and teachers home. For resources were digital materials, CD, DVD, and software related to it. One important thing that needs to be considered was the perspective of teachers related to above components. In other word, not only the real condition, but also the perceived condition, based on teachers’ perspective about these components was needed to be taken into account. According to Oxford Dictionaries (2011), the Real is defined as actually existing as a thing or occurring in fact; not imagined or supposed. While Perceive is defiend as become aware or conscious of (something); come to realize or understand. REAL PERCEIVED Infrastructures Infrastructures Facilities Facilities Resources Resources Figure 1. Objects of Survey IJEME, Vol. 1, No. 1, March 2017, 11-24 IJEME ISSN: 2549-4996 13 The knowledge related to these components would useful to select a suitable technology-based learning method. According to seminal work by Ross (1995: 24) there are five technology-based learning methods available. These methods are based on the concept of learner control. Table 1. Matrix of technology-based learning methods Learning Method Center of Control World of Work Strategies Example Teacher-centered Teachers: The Training sessions, Multimedia learning with teachers directs specific skill presentation, technology the pace and development videotape, distance sequence instruction Integrated learning Machine: A Teaching machines Distributed ILS, system computer network lab-centered ILS and its software direct the learning Electronic Teams or partners: Developmental Local area collaboration The teams teams, joint networks, wide are learning negotiates, goals, research efforts, networks, pacing, and learning teams cooperative sequence of ventures learning Hyperlearning Learner: The Research, market Hypertext learner is in charge analysis, development, of pace and engineering design hypermedia sequence of development, learning multimedia development, network searching Electronic learning Machine and Flight simulators, Virtuality simulations learner: Learning disaster control electronic is in joint control simulations, war simulation games The table revealed five technology-based learning methods. The table also provides what is the suitable condition to implement related method. The implementation of technology-based learning experiences requires information of current condition of real and perceived ICT infrastructures, facilities, and resources in developing countries. This article emphasize publication of this information because it is needed for implementation of technology-based mathematics teaching and learning. In order to get this information, a survey. RESEARCH METHOD Method of this research is survey. Survey is defined as examine and record the area and features to construct a map, plan, or description (Oxford, 2001). In this research, purpose of this survey is to examine and record the real and perceived infrastructures, facilities, and resources to construct a plan for implementation of technology-based mathematics teaching and learning. According to De Vaus (2002) there are five ethical responsibilities towards survey participants: 1) voluntary participation, 2) informed consent, 3) no harm, 4) confidentiality anonymity, and 5) privacy. The responsibilities already considered for this activities. Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 14 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Population and Sample One accessible area and of interest to the researchers was Bojonegara Sub Disctrict, Banten Province, Indonesia. Banten Province is located in Java Island, Indonesia. Respondents of of this research are registered teachers, defined as teachers who have NUPTK (Nomor Urut Pendidik dan Tenaga Kependidikan) at the start of survey. Following is the data related to respondents Table 2. Data of registered teachers in Bojonegara Sub District, Indonesia Number of Number of No Level Percentage Schools teachers 1 Elementary School 22 313 53.50 2 Junior High School 9 194 33.16 3 Senior High School 3 78 13.33 Total 34 585 100.00 To facilitate comparisons and to have a sufficiently large sample the researchers took a higher proportion from the smaller cohorts. Researchers also consider time duration and an access to schools in choosing the number of schools to sample. Following is an estimation of the number each school type. 1. Around 1/7 of number of elementary schools teachers (1/7 x 313 = 45 teachers) 2. Around 1/5 of number of junior secondary schools teachers (1/5 x 194 = 39 teachers) 3. Around 1/2 of number of senior elementary schools teachers (1/2 x 78 = 39 teachers) The teachers were determined based on a simple random sampling scheme within each type. Data and Instruments The collected data including the data related to the supporting infrastructures, facilities, and resources for implementation of technology-based learning. Moreover, because the technology-based learning also related to learning resources and learning design, the data related to these components in teaching and learning experiences also need to be collected. Learning defined as the acquisition of knowledge or skills through study, experience, or being taught, while Design isdefined as purposes or planning that exist behind an action, fact, or object. Started from this, the following definitions are proposed. Resource is defined as a source of help or information, learning is defined as the acquisition of knowledge or skills through study, experience, or being taught, while Mathematical defined as relating to mathematics. Based on above definitions, Mathematical Learning Resources are sources of information that can be used to acquisition of knowledge or skills related to mathematics through study, experience, or being taught. In this article, Mathematical Learning Resources defined as sources of information, represented in a variety of media and format that can be used to assist student learning through study, experience, or being taught, as defined in national curricula, to acquisition of knowledge or skills related to mathematics Learning Designs are planning that exist behind actions or facts of the process acquisition of knowledge or skill through study or experience. Term learning design is IJEME, Vol. 1, No. 1, March 2017, 11-24 IJEME ISSN: 2549-4996 15 variously defined by different authors as a process of, and for, designing learning experiences. The purposes of learning design are to documents and describe learning activity that other teachers can understand it and use it (in some way) in their own context. Specifically, Learning Design defined can be considered in two ways: (1) as a process designing learning experiences and (2) as a product, that is, the outcome or artifact of the design process (Agostinho, 2009, p. 4). To gather data three types of instruments were used. 1. Questionnaire The questionnaire consists of 27 main questions. Several main questions have one or two child questions for further deep information or clarification. The topic of questions is based on focus of survey as described in Table below 2. Unstructured Guideline for Interview Unstructured guideline for interview constructed to follow the flow of discussion with teachers. Interview were conducted by following the flow of discussion. Sometime researcher found a new interesting topic while conducting discussion with teachers. 3. Handycam Handycam used to take pictures and videos related infrastructures, facilities, and resources available in the area. For each teacher, gender, working experience, academic degree & field of study, and training in ICT was assessed in relation to ICT Condition at school & home, and the learning designs & resources they had access to or experiences. Validity of data gathered is ensured through triangulation. Three sources of data based on different instruments for one same of object are used in triangulation. RESULTS AND DISCUSSION Bojonegara is a Sub District of Banten Province, located in a coastal region of Java Island, Indonesia (refer Figure 3). The area, covers 30.30 km2. Figure 2. Location of Bojonegara Sub District, Banten Province, Indonesia Geographically, rural areas dominate Bojonegara Sub District. However ICT technologies have penetrated the area, bring modernity to the people and the coutryside. As illustrated by photos there are many rice paddies, along with clusters of people, houses and buildings, such as government buildings and schools. Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 16 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Figure 3. Survey Photos of Bojonegara Sub District, Banten Province, Indonesia Participants The localised study was conducted October to February years ago, with several weeks of break around the end of December and early of January. Ten headmasters welcomed and allowed the researcher to visit and document ICT infrastructure and facilities in their schools. Only two headmasters did not allow the researcher access to the school or teachers. A total of 220 questionnaire packages were distributed to teachers in elementary, junior and senior secondary school. Over the three school levels a total number of 119 teachers completed and returned the questionnaires, giving a response rate of 54 per cent. The numbers of teachers as respondents is not exactly same as sought when the sample composition was calculated, with less responding from the elementary school and senior secondary school level, and more responding from junior secondary school level as presented in the Table 3. The variation from that intended was reasonable in terms of providing groups near 30 teachers and the responses considered viable numbers in terms of representing school ICT conditions. Table 3. Comparison of expected number and the real number of participants Expected to School level Participated Difference Participate Elementary School 45 41 -4 Junior Secondary School 39 51 + 12 Senior Secondary School 39 27 - 12 Total 123 119 -4 The teachers were considered to be participants if they either fully or partially completed the questionnaire then returned it to the researcher. In addition, a total of 12 teachers (some of them completed questionnaire and some did not) agreed to be interviewed, with five of these interviews video recorded. At each school level the respondents are predominantly female and this is in accord with a greater proportion of teachers being female. Approximately 65 per cent of respondents were female. IJEME, Vol. 1, No. 1, March 2017, 11-24 IJEME ISSN: 2549-4996 17 The teachers’ years of teaching/working experiences is presented in Table 5. Notably, approximately 80 per cent of the respondents had more than three years experience. While 43 per cent have 4 to 10 years working experiences. Indeed only 14.6 per cent of elementary teachers are in their first three years of teaching, although this increases to 20.6 per cent junior secondary and 29.6 per cent of senior secondary teachers. Based on this data, most respondents can be classified as experienced, senior teachers. Table 4. Teachers working experiences School Level Elementary Junior Secondary Senior Secondary Total 0-3 years N % 6 14.6 8 20.5 4-10 years n % 10 24.2 24 61.5 11-20 years N % 7 17.1 6 15.4 21- 30 years n % 10 24.4 1 2.6 >30 years n % 8 19.5 0 0 Total N 41 39 8 29.6 12 44.4 6 22.2 0 11 1 3.7 27 22 20.6 46 43.0 19 17.8 11 10.3 9 8.4 107 Since the participants surveyed and interviewed were predominantly experienced teachers, it is assumed they are more likely to know the state of ICT than the newer teachers and as such the information gathered represents the real conditions in Bojonegara Sub District. Data provided by teachers, regarding the ICT conditions in Bojonegara Sub District was considered the primary data but this was cross checked and validated by the researcher visiting and documenting (photos and video) ICT infrastructure, facilities and resources in the area and schools. As this study is concerned with ICT, the number of teachers trained in relation to ICT was explored as this relates to teachers ICT skills. As can be seen only a very small percentage of teachers and all from the senior secondary school had been trained in the ICT field. Table 5. Teachers experience on attending ICT related training Never 1-2 times 3-5 times > 5 times School Level N % n % N % N % Elementary 30 73.2 8 19.5 1 2.4 2 4.9 Junior 9 23.1 17 43.6 12 30.8 1 2.6 Secondary Senior 11 40.7 13 48.1 2 7.4 1 3.7 Secondary Total 50 46.7 38 35.5 15 14.0 4 3.7 Total N 41 39 27 107 Although the official educational background of teachers is good, their training, attending or participating in ICT related training is still low. In total only 3.7 per cent teachers have more than 5 training experiences related to ICT, and only 14 per cent of them have 3 to 5 training experiences related to ICT. This finding suggests that the teachers ICT skills may be poor. If there are many teachers in Bojonegara Sub District with good ICT skills, it would be because of their self-interest and experiences with ICT, and not because of the official training conducted by schools or the Education Office. This finding in fact can be used as a basis for recommending the Education Office conduct a number of ICT related training programs for teachers in Bojonegara Sub District, Banten Province, Indonesia. Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 18 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Findings of ICTs Supported-Situation There are various levels at which ICT conditions can be examined, in the sub district, at the school, and the home levels. 1. In the sub District In recent years, a substantial amount of ICT infrastructure and facilities have been developed in this region of Banten Province. Base transceiver station (BTS) towers, constructed in this region, now deliver wireless signals for hand phones and internet connection, enhancing communication both within and outside the school district. 2. At the school level In elementary schools, computer and related facilities are only used for administrative purposes, such as writing letters or administration reports; they are not used for teaching and learning. The equipment is supplied by the Education Office for school administration purposes not to support teaching and learning processes. All elementary schools visited during observation had TV and related electric equipment (such as CD/DVD players, refer Figure 7 for a typical set up). In junior and senior high schools, observation revealed that 66.7 per cent of the sampled schools had one computer laboratory for teaching and learning. Teachers through the questionnaire revealed a similar percentage of schools containing at least one laboratory. Where computer laboratories are used the usage is not high. There is no computer laboratory use for teaching and learning at elementary school. Results of interview and photo documentation confirmed junior and secondary schools usually have a computer laboratory. According to the interviews, state schools received these computers, electronic resources, and related devices from the government. Private schools receive a donation from students’ parents or grants from their founders. At least until these data were collected, although there are many private companies (some of them are foreign companies and manufacturing factories) in Bojonegara Sub District, schools in this area received neither grant or benefit from these companies for ICT infrastructure, facilities, and resources. There is an opportunity for the government to encourage companies as part of their Corporate Social Responsibility (CSR) to consider giving educational grants to schools, by giving ICT facilities to schools or by providing support in the form of scholarships for poor students. CSR is mandated for good corporate governance, and should be implemented by these companies, because they are located near enough to schools and rich enough to do it. At elementary schools, computers, printers, and other electronic equipment are only for use for administrative purposes, not for teaching and learning. Students learn in the class with conventional teaching and learning processes involving teacher instruction at the whiteboard. However at junior and secondary school, the computer laboratories are available. Although in some schools the number of computers in the computer laboratory is not more than 20 units, whereas class size is 30 to 40 students, still their condition is good and they can be used for teaching and learning. Observation indicated there was internet access (wireless) in the junior and senior high schools, and although the speed of the internet access was not good, it was still convenient to use for accessing web sites for searching and downloading learning resources from the internet. The researcher also observed one teacher accessing a IJEME, Vol. 1, No. 1, March 2017, 11-24 IJEME ISSN: 2549-4996 19 social network account during the visit using a notebook connected to the internet, a further indication of use of the internet. Notebooks (portable computers) or projectors are available at each school level. All junior secondary teachers had access to a notebook and computer in their school, however approximately 30 per cent of teachers in elementary and senior secondary school reported that they did not have access to these notebooks and projectors. This may have happened due to limitations in the number of notebooks and projectors. Table 6. The use of computer notebooks or projectors for teaching and learning No 1-2 times 3-5 times 6-10 times Total per month per month per month School Level N % N % n % n % N Elementary 29 100.0 0 0.0 0 0.0 0 0.0 29 Junior 22 71.0 5 16.1 3 9.7 1 3.2 31 Secondary Senior 24 92.3 2 7.7 0 0.0 0 0.0 26 Secondary Total 75 87.2 7 8.1 3 3.5 1 1.2 86 In terms of internet access, this facility is only available to teachers in junior and senior secondary schools. Observation confirmed this internet access with the researcher detecting the wifi signal and observing teachers connect their own notebooks to the internet using school provided connections. Table 7. School levels and the teachers’ use of internet for teaching and learning No 1-2 times 3-5 times 6-10 times Total per month per month per month School Level N % N % N % n % N Elementary 41 100.0 0 0.0 0 0.0 0 0.0 41 Junior 26 74.3 7 20.0 1 2.9 1 2.9 35 Secondary Senior 24 88.9 2 7.4 0 0.0 1 3.7 27 Secondary Total 91 88.3 9 8.7 1 1.0 2 1.9 103 Teachers reported using their personal mobile phone in the school for internet access. The interviews, in accord with the questionnaires also reveal that several teachers have an internet connection from their own mobile phone and frequently access the internet using them. They know that the internet is useful and they know how to use it. In addition, 18.8 per cent of teachers frequently access the internet using their mobile phones more than five times a day in the school. Table 8. School level and number of teacher internet accesses with mobile phones No 1-2 times 3-5 times > 5 times Total per day per day per day School Level n % N % N % n % n Elementary 12 85.7 2 14.3 0 0.0 0 0.0 14 Junior Secondary 1 5.9 8 47.1 1 5.9 7 41.2 17 Senior Secondary 6 35.3 6 35.3 3 17.6 2 11.8 17 Total 19 39.6 16 33.3 4 8.3 9 18.8 46 Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 20 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Fifty-one percent of teachers used their mobile phones to access the internet. These finding suggest that teachers are familiar with the internet, and are thus able and interested in accessing internet although they were on working days at school. With these skills and internet access at least at the high school level, it is possible that teachers could be supported to use resources on the internet for teaching and learning. The usability of ICT at schools depends upon it being maintained and serviced during technical problems that can occur during use. For this reason, there should be one or more IT staff, whether officially employed in the school or by another company, such as computer service shop. There is no IT staff at elementary schools nor did visits by IT professionals, while in total 56.1 percent teachers of junior and senior secondary staff state that IT staff is available. In junior and secondary highschool, 24.2 per cent of teachers indicated that IT staff visit only 1 or 2 times per month, while 31.8 cent stated that IT staff visit schools more than 10 times per month. 3. At the teachers’ home In contrast to the availability of technology in schools, the highest percentage of teachers who have computers at home are the elementary teachers (82.5%) with the least number of computers at home reported by junior secondary (72.2%) and then the senior secondary teacher (64%). Further results, in regard to available computer and related devices and services. Table 9. Available computer and related devices & services Elementary Junior Secondary Senior Secondary At home N % N % N % Computer 33 82.5 26 72.2 16 64.0 Printers & Scanners 28 70.0 20 55.6 14 56.0 Internet Access 1 2.5 13 35.1 18 30.8 Personal Website or Blog 0 0.0 2 6.5 5 20.8 Phone internet Access 6 15.0 27 79.4 12 63.2 IT assistance 27 67.5 18 52.9 18 81.8 The number of teachers who use their own computer at home for preparation of teaching and learning is low particularly for elementary teachers, 96.4 percent of whom never use the home computer for teaching and learning. Teachers’ use of their own computers for teaching and learning includes the preparation of learning resources, learning designs, tasks, downloading learning materials from internet. The findings show that the teachers of senior and junior secondary schools are more familiar with the use of computer for teaching and learning than teachers in elementary schools. The use of computer is not in term of programming. At home Table 10. The use of computers for teaching and learning Junior Senior Elementary Total Secondary Secondary n % N % n % % Computer Never 1-2 times per month 3-5 times per month 6-10 times per month > 10 times per month 27 1 0 0 0 IJEME, Vol. 1, No. 1, March 2017, 11-24 96.4 3.6 0.0 0.0 0.0 6 8 5 1 0 30.0 40.0 25.0 5.0 0.0 1 5 1 1 5 7.7 38.5 7.7 7.7 38.5 55.7 23.0 9.8 3.3 8.2 IJEME ISSN: 2549-4996 21 In accord with the questionnaire, those staff interviewed confirmed that the majority of teachers have their own computer at home. Teachers stated that their computer at home is used by their family, usually their children, for many purposes such as for playing games, watching movies, and other fun activities. The price of a set of desktop computer today, in Indonesia normally priced Rp4,500,000 to Rp9,000,000 or around $300 to $600 (all $ means Australian Dollar). Some teachers also indicated that they have a computer notebook, in Indonesia normally priced around Rp4,500,000 to Rp30,000,000 or around $300 to $2,000. This range of prices is affordable for many public servant teachers whose base annual salary for new teachers with no professional teaching experience or service is Rp28,423,200 and senior teachers with 32 years or more professional teaching experience or service is Rp65,026,000 per year. This range is lower than annual salary of teachers in some countries, for example in Australia, fouryear trained teacher start on salary $59,706 and the most experienced classroom teachers earning $89,050. For many public servant teachers in Indonesia, the prices of computer and related equipment are affordable. Approximately 20 per cent teachers bring the computer notebook to the school for their own purposes. Although the number of teachers who have printer or scanner is less than those who own computers, in total the 61.4 per cent of teachers have their own printer or scanner. Despite the large number of teachers (n=55, 61.4%), who have their own printer or scanner, only 31.8 (n=23) percent use it to support teaching and learning. These and other findings suggest there is a potential to support teachers’ use of their home computer for teaching and learning. The number of teachers who have and use internet access and phone internet access for teaching and learning is low particularly for elementary teachers who never use internet access and phone internet access for teaching and learning. Teachers’ use of internet access and phone internet access for teaching and learning are for activities such as downloading learning materials from internet, searching educational articles to learn how to improve their teaching and learning. The findings show that the teachers of senior and junior secondary schools are more familiar with the use of computer for teaching and learning than teachers in elementary schools. Twenty-one per cent of teachers, all but one secondary teacher has internet access at home. The teachers of senior and junior secondary schools are more familiar with the use of internet for teaching and learning with none of the elementary teachers using their home internet for teaching and learning purposes. Of the teachers who have the internet at home only around 25 per cent, use it for teaching and learning purposes, however it is possible that teachers can be supported to take advantage of the resources on the internet for teaching and learning. At home, 48.4 per cent of teachers access the internet using their mobile phone, and the number of them who frequently access the internet using mobile phone is also large, up to 59.1 per cent. Only a small number (7.4%) of teachers have a personal web site or blog, however the use of personal web sites or blogs for teaching and learning is negligible. Only three teachers (15%) of those who had a blog or personal website used these for school purposes. At home IT assistance No schedule 1-2 times per week Everyday available Table 11. IT assistance at home Elementary Junior Secondary Senior Secondary N % n % n % 14 77.8 14 70.0 13 76.5 4 22.2 6 30.0 3 17.6 0 0.0 0 0.0 1 5.9 Total n Total % 41 13 74.5 1 23.6 1.9 Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 22 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 In regard to mathematical learning resources and learning designs. Teachers and students can freely use Buku Sekolah Elektronik (BSE) that is Electronic School Books. These electronic books are available in PDF format. There are several books that can be used by students and teachers from Class I, the first stage of formal education in Elementary Schools to Class XII, the last stage of Senior High School before university level. The Ministry of Education and Culture owns the copyright to these books and the contents are evaluated and monitored by the Ministry of Education and Culture. Teachers revealed in the interviews that they did not use them for teaching and learning. Some did not have or know about these books. Teachers might not consider the information in these books to be effective. Several senior and junior secondary schools have their own notebooks. According to interview, teachers stated that around 20 percent of teachers in their schools bring it to the school for their own purposes. Teachers also explained that they and their fellow teachers use the internet to download educational content for use in teaching and learning. The internet is sometimes their alternative source of teaching materials as they locate many internet available resources that are useful for their teaching. Exploring Possibilities for Mathematics Teaching and Learning Teachers were asked about the possibility of them implementing technologybased mathematical learning resources, through the internet or online platform in their school or class. As indicated in Table, 53.1 percent of junior secondary teachers and 23.5 percent teachers of senior secondary schools believe that the implementation is possible, however all, elementary school teachers considered that the implementation of technology-based mathematical learning resources “may be possible but it would be difficult”. These results are confirmed through interview with teachers. However teachers stated that the condition of ICT in their schools is problematic when it comes to the implementation of technology-based teaching and learning. One of teachers in the interviews stated that in general many teachers have good skills and are interested in implementing technology in the teaching and learning, including the integration of the internet to improve the quality of teaching and learning, however ICT infrastructure and facilities, such as the quantity and quality of available computers for teaching and learning, the lack of sufficient supporting equipment, such as projectors, the quality of internet access, software, and financial support required would make proper implementation of technology-based teaching and learning difficult. Other teachers interviewed made similar statements. In general teachers stated that poor ICT conditions is the main problem for the implementation of technology-based teaching and learning in Bojonegara Sub District, although they believe that the teachers have good computer skills, and are able to operate computers and other facilities for teaching and learning. The observer/researcher saw that one of the problems related to a mindset of teachers that the computers, related equipment, and internet access must be available before the implementation of technology-based teaching and learning. This meant that teachers did not try to or know how to optimize their use of the limited ICT for teaching and learning under the current ICT conditions. There were possibilities to support teaching and learning, or in general to improve the learning experiences of students using the current condition of ICT even though the ICT conditions in their school were poor. IJEME, Vol. 1, No. 1, March 2017, 11-24 IJEME ISSN: 2549-4996 23 The researcher/observer was seeking ways maximize the impact of ICT on teaching and learning given the current condition of ICT infrastructure and facilities, such as computers, the related equipment and internet access in schools and teachers’ homes, to improve students’ learning experiences, especially in mathematics, without having to request an additional infrastructure, facilities, and resources that would be costly for schools or government. This thinking lead to the development of tools related to the technology-based teaching and learning, appropriate to current ICT conditions, but tools that could contribute toward improving learning experiences. When examining the desire or willingness for teachers to create and share both resources and learning designs, seventy four per cent of teachers indicated that they had experienced creating or modifying learning resources. For learning designs, only 19.3 percent of teachers have tried to initiate learning designs in the classroom or school. In the case of sharing of learning resources and learning designs, as revealed in Table, 27.4 percent of teachers wish to share their own learning resources, compared to 93.3 percent of teachers who wish to share their learning design. The researcher also asked teachers about whether they had experiences creating or modifying digital learning resources for use through the internet or online. With respect to the use of learning design, 86.4 percent of teachers stated that they wish to use other teachers mathematical learning design, with only a few teachers experienced in creating digital mathematical learning resources. Teachers also wanted to use other teachers’ learning designs, with few teachers creating their own designs; therefore the circulation of a good learning design between teachers appears warranted. This finding became the foundation for the mapping learning design for circulation by one teacher to other teachers and complementing this the mapping of internet accessible learning resources for use by teachers. The above condition is suitable for condition where teachers directs the pace and sequence. The implementation can consider result of empirical study on how the implementation of technology will contribute to students performance Fathurrohman, Porter, and Worthy (2012) This condition also supports the condition where the instruction of teaching and learning, related to presentation for in class sessions, including the target on specific skill development. Based on above discussion, it is suggested that the Teachers-Centered Learning with Technology is the most appropriate method of technology-based learning to be implemented in Bojonegara Sub District, Banten Province Indonesia CONCLUSION Based on the results of observation, several important findings about the physical state and teachers’ awareness of ICT infrastructures, facilities, and resources were gathered. Teachers’ access and use of the internet to gather educational content for use in teaching and learning has been explored. The internet is one of the sources of learning resources. Since the teachers wish to use other teachers learning designs and have little experience in creating learning designs, it is appropriate to develop tools that assist the circulation of good learning designs between teachers. TeachersCentered Learning with Technology is the most appropriate method of technologybased learning to be implemented. Teachers’ Real and Perceived of ICTs Supported-Situation … Fathurrohman, Porter, & Worthy 24 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 REFERENCES Agostinho, S. (2009). Learning Design Representations to Document, Model, and Share Teaching Practice. In Lockyer, L., et al. (2009). Handbook of Research on Learning Design and Learning Objects; Issues, Application, and Technologies. Vol. 1. New York: Information Science Reference. De Vaus, D., A. (2002). Surveys in Social Research (5th Edition). Crows Nest: Allen & Unwin. Fathurrohman, M., & Porter, A. (2012). Addressing the needs of a developing nation: electronic maps of mathematical learning resources accessible via the internet. Journal of Computers in Mathematics and Science Teaching, 31(4), 337-362. Fathurrohman, M., Porter, A., & Worthy, A., L. (2014). Comparison of performance due to guided hyperlearning, unguided hyperlearning, and conventional learning in mathematics: an empirical study. International Journal of Mathematical Education in Science and Technology, 45(5), 682-692. InfoDev. (2007). Survey of ICT for Education in Africa. World Bank. InfoDev. (2009a). Survey of ICT for Education in Caribbean. World Bank. InfoDev. (2009b). Various Country Study Reports and Case Study Working Papers (Argentina, Brazil, Kenya, Korea, Morocco, Nigeria, Philippines, Peru, Senegal, Sudan, Tunisia, Uganda, and Vietnam) in regard to technology and or education. World Bank. InfoDev. (2010). Survey of ICT for Education in India and South Asia Countries. World Bank. International Development Research Centre. (2010). Digital Review of Asia Pacific. Oxford Dictionaries. (2011). Available at: http://oxforddictionaries.com/ Ross, T.W., & Bailey, G.D. (1995). Technology-Based Learning; A Handbook for Principals and Technology Leaders. New York: Scholastic Inc. World Economic Forum. (2016). The Global Competitiveness Report. Geneva. IJEME, Vol. 1, No. 1, March 2017, 11-24 International Journal on Emerging Mathematics Education (IJEME) Vol. 1, No. 1, March 2017, pp. 87-98 P-ISSN: 2549-4996, E-ISSN: 2548-5806, DOI: http://dx.doi.org/10.12928/ijeme.v1i1.5698 Using the 5E Learning Cycle with Metacognitive Technique to Enhance Students’ Mathematical Critical Thinking Skills 1Runisah, 2Tatang Herman, 2Jarnawi Afgani Dahlan 1Wiralodra University, Jl. Ir. H Juanda KM. 03, Karanganyar, Indramayu, Jawa Barat 45213 2 Universitas Pendidikan Indonesia, Jl. Dr. Setiabudhi, No. 229, Bandung 40154 Email: [email protected] Abstrak Penelitian ini bertujuan untuk mendeskripsikan peningkatan dan pencapaian kemampuan berpikir kritis matematis siswa yang menerima Learning Cycle 5E dengan teknik Metakognitif, Learning Cycle 5E, dan pembelajaran konvensional. Penelitian ini menggunakan metode eksperimen dengan desain kelompok kontrol pretest-posttest. Populasi adalah siswa SMP di Indramayu, Indonesia. Sampel terdiri dari tiga kelas siswa kelas VIII dari sekolah level tinggi dan tiga kelas dari sekolah level sedang. Hasil Penelitian mengungkapkan bahwa dilihat dari siswa secara keseluruhan, peningkatan dan pencapaian kemampuan berpikir kritis siswa yang menerima Learning Cycle 5E dengan teknik Metakognitif lebih baik dari siswa yang menerima Learning Cycle 5E dan pembelajaran konvensional. Kemampuan berpikir kritis matematis siswa yang menerima Learning Cycle 5E lebih baik dari siswa yang menerima pembelajaran konvensional. Tidak ada pengaruh interaksi antara model pembelajaran dan level sekolah terhadap peningkatan dan pencapaian kemampuan berpikir kritis matematis siswa. Kata Kunci: kemampuan berpikir kritis matematis, learning cycle 5E, teknik metakognitif Abstract This study aims to describe enhancement and achievement of mathematical critical thinking skills of students who received the 5E Learning Cycle with Metacognitive technique, the 5E Learning Cycle, and conventional learning. This study use experimental method with pretest-posttest control group design. Population are junior high school students in Indramayu city, Indonesia. Sample are three classes of eighth grade students from high level school and three classes from medium level school. The study reveal that in terms of overall, mathematical critical thinking skills enhancement and achievement of students who received the 5E Learning Cycle with Metacognitive technique is better than students who received the 5E Learning Cycle and conventional learning. Mathematical critical thinking skills of students who received the 5E Learning Cycle is better than students who received conventional learning. There is no interaction effect between learning model and school level toward enhancement and achievement of students’ mathematical critical thinking skills. Keywords: mathematical critical thinking skills, 5E learning cycle, metacognitive technique. How to Cite: Runisah, Herman, T., & Dahlan, J.A. (2017). Using the 5E learning cycle with metacognitive technique to enhance students’mathematical critical thinking skills. International Journal on Emerging Mathematics Education, 1(1), 87-98. http://dx.doi.org/10.12928/ijeme.v1i1.5698. INTRODUCTION Various problems in life aspect occur in 21st century. To overcome this problem, critical thinking skills is needed. According to Chukwuyenum (2013), critical thinking had been used as one way to solve the problem in daily life because it involve logical reasoning, interpretation, analysis, and evaluate information so enable us to obtain valid and reliable decision. Based on his idea, someone who has critical thinking skills, will chose problem representation which is most suitable to help solving the problem, Received January 24, 2017; Revised February 24, 2017; Accepted February 27, 2017 88 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 then choosing and using strategy to solve the problem which is backed by reason and proof. Therefore, problem solving and decision which are given will be backed by accurate reason or proof. Critical thinking skills is high order thinking skills, in which the expert defined it in different way. One of them is Ennis (1996) who defined, “Critical thinking is reasonable and reflective thinking that is focused on deciding what to believe or do”. Reflective means consider or thinking again everything faced before making decision. Reasonable means that all beliefs, views, or everything done which are backed by appropriate proof or reason. Critical thinking is needed in various domains, included mathematics. According to Balcaen and Klassen (2007), critical thinking in mathematics is involvement of thinking through mathematical problem and make reasonable assessment about strategy, approach and solution. In accord with that opinion, Glazer (2001) explained that critical thinking in mathematics is ability and disposition to involve prior knowledge, mathematical reasoning, and cognitive strategy to generalize, prove or evaluate unfamiliar mathematical situation reflectively. Based on that definition, critical thinking in mathematics can be defined as ability to integrate prior knowledge, mathematical reasoning, and problem solving strategy to solve mathematical problem reflectively. Furthermore, according to Innabi (2003) critical thinking aspect related with learning material which comprise: concept, generalization, skill, algorithm and problem solving. `The result of study shows that mathematical critical thinking skills (MCTS) development can enhance mathematics achievement (Jacob, 2012; Chukwuyenum, 2013). Similarly, critical thinking skills will encourage students to think independently and solve problem in school or in the context of everyday life (Jacob, 2012). Critical thinking is not limited to reflection, inference, and synthesis the information, enable individual to make reasonable assessment not only in class but in daily life (Beaumont, 2010). Thus, the critical thinking skills, which are developed through learning activities in the school, will be useful for solving various problems, whether the problems are directly related to learning activity or problems are in their daily lives. In other words, through the critical thinking skills, students will be able to consider or choose the important information that can be used to solve problems or to make a reasonable decision. By viewing how important the development of critical thinking skills is, so that critical thinking development become curriculum agenda in the world, particularly in Indonesia. In Kurikulum Tingkat Satuan Pendidikan (KTSP) as stated in Permendiknas No.22 (2006) which is applied in Indonesia, it is implied that mathematics need to be given to students to equip them with logical, analytical, systematical, critical and creative thinking. Furthermore in 2013 curriculum as stated in Permendikbud No. 64 (2013), critical thinking is also stated in learning mathematics. Even though critical thinking skills become one goal in learning particularly in mathematics, several result of study showed that this ability is still low. Hiebert (Lithner, 2008) reported that generally students still use thinking based on memorization than doing reasoning process in solving mathematical problem. The result of Trends in International Mathematics and Science Study (TIMSS) in 2007 and 2011 showed that average score of mathematics subject achievement in 2011 is in 38th rank from 42 participating countries. In TIMSS 2011 students are involved in various cognitive processes to solve the problem (Mullis, et al., 2012). Furthermore, Runisah IJEME, Vol. 1, No. 1, March 2017, 87-98 IJEME ISSN: 2549-4996 89 (2015) based on study toward 8th grade in Indramayu city, Indonesia concluded that students’ mathematical critical thinking skills is still low. One factor which affect the lack of students’ mathematical critical thinking skills is learning process. In Indonesia, teaching practice focused on material content and ignore students’ thinking ability development (Rohaeti, 2010). From inquiry result of Balitbang Depdiknas in 2007, implementation of learning in Indonesia generally still use lecture and ask and answer methods (Balitbang Depdiknas, 2007). Activities can be done to enhance critical thinking skills in mathematics such as comparing, making conjecture, making induction, making generalization, making specialization, making classification, making deduction process, making visualization, ordering, making prediction, making validation, proving, analyzing, evaluating and making pattern, and determining functional relation among variables (Appelbaum, 1999). Futhermore, Beaumont (2010) stated that to enhance students’ critical thinking skills, exercises in the form of task need to be given which require high reasoning to solve it such as task of observing, identifying assumption, challenging material to be understood, task of interpreting, task of discovering and investigating, task of analyzing and evaluating, and task of making decision. Therefore, to enhance mathematical critical thinking skills non routine tasks or problem need to be given which require high reasoning to solve it. One of learning potentially to develop critical thinking skills is the 5E Learning Cycle with Metacognitive technique (LCM). The 5E Learning Cycle (LC) is developed by researcher team of Biological Science Curriculum Study leaded by Bybee. According to Bybee, et al. (2006) LC is influenced by Herbart psychology, John Dewey and Jean Piaget thinking. Piaget with his constructivism principle viewed that knowledge is not a set of fact, concept or rule which are ready to be transferred by teacher. Students should construct that knowledge and give meaning through various experiences in learning. According to Bybee, et al. (2006), LC has five stages namely: engage, explore, explain, elaborate and evaluate. In engage stage, teacher access students’ prior knowledge and help them to involve in new concept which encourage students interest to learn. In explore stage, students are involved in concept exploration activity to produce new ideas. In explain stage, students explain conceptual understanding or process skill which is obtained in earlier stage. In evaluate stage, assessment is done toward students’ understanding and ability and give opportunity to teacher to evaluate students progress to achieve educational aim. Those five stages can trigger students’ critical thinking skill, because it involve prior knowledge, non routine situation, reasoning, cognitive strategy, and involve students in discussion to do exploration. This is in accord with Appelbaum (1999) and Beaumont (2010) argument that condition to critical thinking in mathematics beside should contain non routine situation, it should also use prior knowledge, reasoning, and cognitive strategy. Furthermore, according to Slavin (2011), critical thinking teaching which is effective depend on classroom determination that encourage different point of view acceptance and free discussion. Meanwhile, according to Ergin (Tuna & Kacar, 2013), LC involve high thinking order skill by stimulating students to explore. LC transmit critical thinking skill to students. The 5E Learning Cycle with Metacognitive technique (LCM) is learning model that integrate directly Metacognitive technique in every stage of LC. Metacognition is a term introduced by Flavell in 1976. Flavell (Lioe et al., 2006) stated that metacognition is one’s consciousness about his/her cognitive process and independency to achieve Using the 5E learning cycle with metacognitive technique to enhance students’ … Runisah, Herman, & Dahlan 90 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 the goal. One of Metacognitive technique is self-asking. In this study, question which is made focus on three categories which are adopted from Beeth (Mittlefehldt & Grotzer, 2003) namely intelligibility, wide-applicability, and plausibility. In first category that is intelligibility, the question asked is, “Am I able to understand the concept I learned?” In second category, that is wide-applicability, the question asked is, “What concept that can be used to solve this problem?” or, “Is the concept I learned can be used to solve the problem in another domain or in daily life?” In third category, that is plausibility, the question asked is: “Is problem solution I made can be reliable?”. From the above description, the use of LCM has potencies to develop students' critical thinking skills. However, the application of the model may not have the same effectiveness when it is applied to students who have different academic abilities. In other words, the use of LCM is probably more effective for a particular group rather than it is used in another group of students who have different academic abilities. This is in line with the opinion of Glazer (2001) as it is already described above that in critical thinking, reasoning, cognitive strategies, and the students' prior knowledge play an important role. Therefore, the analysis of the effect of the interaction between learning model and school level toward critical thinking skills is required. It is useful to determine the level of school where the model can be used more effectively, because the two level different schools have difference in a student's academic ability There have been numerous studies on several approaches that is different from the 5E Leaning Cycle to enhance students’ mathematical critical thinking skills (Rohaeti, 2010; Noer, 2010; Kurniati, Kusumah, Sabandar, & Herman, 2015; Yumiati, 2015; Firdaus, Kailani, Nor, & Bakry, 2015). Moreover some studies have been done on the use of the 5E Learning Cycle to enhance students’ mathematical critical thinking skills (Erlian, 2009; Fatimah, 2012; Sofuroh, Masrukan, & Kartono, 2014; Kadarisma, 2015). Meanwhile some studies on Metacognitive empowermen has been done (Schraw (Toit & Kotze, 2009); Camahalan, 2006; Ozcan & Erktin, 2015). However, only a bit studies on the use of the 5E Leaning Cycle with Metacognitive technique to enhance students’mathematical critical thinking skills. Therefore, the present study purposes at examining the use of the 5E Learning Cycle with Metacognitive technique to enhance mathematical critical thinking skills of junior high school students in one of cities in west Java, Indonesia. This research was conducted in Indramayu west Java by considering the problem that is identified by the preliminary study. Preliminary study involved 33 junior high school students (Runisah, 2015) reported that the mean score of MCTS test of students is only 5.19 of the ideal maximum score of 16. This result shows that the critical thinking skills of students in mathematics were still low. Every question on the MCTS test was related to aspects of critical thinking with mathematical content, it includes concepts, generalization, and problem solving RESEARCH METHOD This study is a experimental with pretest-postest control group design (Ruseffendi, 2005) described as follows: A O X1 O A O X2 O AOO IJEME, Vol. 1, No. 1, March 2017, 87-98 IJEME ISSN: 2549-4996 91 Information: A : The selection of a random sample of classes at population X1: The application of The 5E Learning Cycle with Metacognitive technique (LCM) X2 : The application of The 5E Learning Cycle (LC) O: Mathematical Critical Thinking Skills (MCTS) test (pretest-postest). Population and Sample Population in this study is junior high school students in Indramayu city, West Java Province, Indonesia. The sample is eighth grade students amounting to 173 students from two school level, classified as high amounting to 83 students and medium level amounting to 90 students. The selection of that school is done randomly from all Junior High Schools in Indramayu city. Three classes are selected randomly from each school level, one class are taught by LCM, one class are taught by LC, and one class are taught by CL. The determination of school level, based on accreditation score which is valid until the year 2016. Instruments Instrument used in this study consist of mathematical critical thinking skills (MCTS) test, Mathematical Prior Ability (MPA) test, and observation sheet. MCTS test consist of 10 items with ideal maximum score is 40. MCTS test is given to students before and after learning is implemented. MCTS test material is tailored with material given in the time of study, that is material of 8th semester 1 is in accord with curriculum used. Before used, the experts consider MCTS test to fulfill face and content validity. Then try out test is done in limited scale. After being improved, instrument is tested in wide scale. Based on test result, it is obtained that test is valid and reliable with reliability coefficient r = 0.84 and according to Creswell (2012) it is in high category. Meanwhile, material of MPA test was adjusted to the subject matter of Mathematics, which has been studied in the previous semester refered to the curriculum. Based on analysis, it is obtained that test is valid and reliable with reliability coefficient r = 0.83 for MPA test, with objective form and r = 0.64, for MPA with analytical test. MPA test is used for further convince that the MPA at the high school level is better than MPA at medium school level. The magnitude of students’ MCTS achievement is obtained from MCTS posttest score. The formulation developed by Meltzer (2002) is used to calculate the magnitude of enhancement. Whereas the calculation result of gain is interpreted by using classification of gain from Hake (1998). RESULTS AND DISCUSSION This part describes the result of the study and its discussion which is related to the relevant studies and theories. The Enhancement of Students’ MCTS Base on normality test, data were not distributed normally. Therefore KruskalWallis test is used to test mean difference of MCTS enhancement which is presented in Table 1. Using the 5E learning cycle with metacognitive technique to enhance students’ … Runisah, Herman, & Dahlan 92 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Table 1. Summary of Mean Difference Test of MCTS Enhancement School g Group Chi-Square Sig. Ho Level High LCM; LC; CL 0.68; 0.53; 0.38 30.21 0.000 rejected Medium LCM; LC; CL 0.57; 0.46; 0.30 35.07 0.000 rejected Total LCM; LC; CL 0.62; 0.49; 0.34 57.37 0.000 rejected LCM : The 5E Learning Cycle with Metacognitive Technique LC : The 5E Learning Cycle CL : Conventional Learning. Based on Table 1, it is found out that probability (significance) value is less than significance degree α = 0.05, thus H0 is rejected. Thus in terms of overall and for each school level, it means that at least there is one group who has mean gain which is different from another group. Further, based on Multiple Comparison Between Treatments test at significance degree α = 0.05, in terms of overall and in high school level, MCTS enhancement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS enhancement of students who are taught by LC is better than students who are taught by CL. In medium school level, there is no difference of MCTS enhancement between students who are taught by LCM and students who are taught by LC. However, MCTS enhancement of students who are taught by LCM and students who are taught LC is better than students who are taught by CL. Interaction Effect between Learning Model and School Level toward Students’ MCTS Enhancement The Adjusted Rank Transform test (Leys and Schumann, 2010) is done to find out interaction effect between learning model and school level toward students’ MCTS enhancement. From the calculations, the value of F = 1.20 with a probability value 0.304. Thus, it can be concluded that there is no interaction effect between learning model and school level toward students’ MCTS enhancement. The Achievement of Students’ MCTS The achievement of students’ MCTS is determined based on posttest score. Further, percentage of students’ MCTS achievement can be seen in Figure 1. IJEME, Vol. 1, No. 1, March 2017, 87-98 IJEME ISSN: 2549-4996 93 Figure 1. The Achievement of Students’ MCTS From Figure 1, it can be found out that in terms of overall and for all school levels, MCTS achievement of LCM group is higher than LC group and CL group, MCTS achievement of LC group is higher than CL group. Further, based on normality test, data were not distributed normally. Therefore Kruskal Wallis test is used to test mean difference of MCTS achievement which is presented on Table 2. Table 2. Summary of Mean Difference Test of MCTS Achievement School Level Group Posttest Chi-Square Sig. High LCM; LC; CL 29.31; 24.30; 19.03 24,59 0.000 Medium LCM; LC; CL 25.57; 21.20; 16.57 24.29 0.000 Total LCM; LC; CL 27.30; 22.84; 17.80 44.01 0.000 LCM : The 5E Learning Cycle with Metacognitive Technique LC : The 5E Learning Cycle CL : Conventional Learning. Based on Table 2, it is found out that probability (significance) value is less than significance degree α = 0.05, thus H0 is rejected. Thus in terms of overall and for each school level, it means that at least, there is one group who has mean gain which is different from another group. Further, based on Multiple Comparison Between Treatments test at significance degree α = 0.05, in terms of overall and from high and medium school level, MCTS achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught CL. MCTS achievement of students who are taught by LC is better than students who are taught by CL Using the 5E learning cycle with metacognitive technique to enhance students’ … Runisah, Herman, & Dahlan 94 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Interaction Effect between Learning Model and School Level toward Students’ MCTS Achievement The Adjusted Rank Transform test (Leys and Schumann, 2010) is done to find out interaction effect between learning model and school level toward students’ MCTS achievement. From the calculations, the value of F = 0.421 with a probability value 0.667. Thus, it can be concluded that there is no interaction effect between learning model and school level toward students’ MCTS achievement. In terms of overall, MCTS enhancement of students who are taught by LCM is 0.62, whereas students who are taught by LC is 0.49 and students who are taught by CL is 0.34. That enhancement is in medium category based on classification of Hake. MCTS enhancement of students is supported by its achievement. MCTS achievement of students who are taught by LCM is 27.30 or 68.3% from ideal maximal score. MCTS achievement of students who are taught by LC is 22.84 or 57.1% from ideal maximal score. Meanwhile, MCTS achievement of students who are taught by CL is 17.80 or 44.5% from ideal maximal score. Furthermore, based on statistic test result MCTS enhancement and achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS enhancement and achievement of students who are taught by LC is better than students who are taught by CL. Based on statistic test result, the result obtained in terms of overall and students from high school level has similarity namely MCTS enhancement and achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS enhancement and achievement of students who are taught by LC is better than students who are taught by CL. In medium school level, it is concluded that there is no difference of MCTS enhancement between students who are taught by LCM and students who are taught LC in significance degree of 0.05. Nevertheless, MCTS achievement of students who are taught by LCM is higher than students who are taught by LC, MCTS enhancement of students who are taught by LCM is 0.57 and for students who are taught by LC is 0.46. Meanwhile, MCTS enhancement of students who are taught by LCM and students who are taught by LC is better than students who are taught by CL. Futhermore, MCTS achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS achievement of students who are taught by LC is better than students who are taught by CL. In general, the result of study show that LCM is better in facilitating students to develop MCTS than LC and CL, and LC is better in facilitating students to develop MCTS than CL. This is possible because in LC students are involved in learning activity actively through discussion to do activities such as comparing, making conjecture, making generalization, making prediction, making validation, analyzing, evaluating, and determining functional relation among variables. Those activities will develop students’ mathematical critical thinking skills. It is proven what has been stated by several expert (Appelbaum, 1999; Beaumont, 2010; Glazer, 2004). In line with it, according to Slavin (2011) the effective teaching of critical thinking depend on determination of classroom which encourage the acceptance of different point of view and free discussion. Furthermore, Ergin (2012) added that the 5E model is the most effective way to involve students in learning. Students involvement in learning will develop their thinking ability among other critical thinking skills. IJEME, Vol. 1, No. 1, March 2017, 87-98 IJEME ISSN: 2549-4996 95 In LC, students are involved in exploration activity toward concept learned, thus students understanding will become deeper. According to Carpenter (Franke & Kazemi, 2001), when individuals learn with understanding, they can use the knowledge to solve new problems. Meanwhile, in CL, teaching and learning activity is more teacher centered. In CL, teacher give concept which is learned directly, students just receive what is delivered by teacher, then students are given problem exercises. Therefore, in CL learning is dominated by teacher. Thus in CL development of critical thinking skills is lacking. In LCM, besides having strengths contained in LC, students’ metacognition is more empowered compared to LC and CL. Students’ metacognition empowerment is done by guiding student to ask themselves and answer it. Therefore, students will try to realize their thinking process. They will think about their experience toward concept, another domain or relation among concepts. This is strongly support development of critical thinking skills that which will be used in solving the problem. This is in accord with Panaoura and Phillippou (2005) that if someone not aware of his/her process and cognitive ability, we will not be able to improve his/her performance. Furthermore, Schraw and Dennison (Panaoura & Philippou, 2005) concluded that students who are skillful in assessing their Metacognitive and aware of their ability to think are better than students who not aware of their mental system mechanism in solving mathematical problem. This study result is in accord with study result of several expert that the use of the 5E Learning Cycle support students’ critical thinking skills in mathematics (Erlian, 2009; Fatimah, 2012; Sofuroh, Masrukan, & Kartono, 2014). Other than, study result of several expert showed that there is positive influence from constructivism based learning toward enhancement of students’ mathematical critical thinking skills (Rohaeti, 2010; Noer, 2010; Kurniati, Kusumah, Sabandar, & Herman, 2015; Yumiati, 2015; Firdaus, et al. , 2015). This study also show that school level factor has significant effect toward achievement and enhancement of students’ MCTS. For each learning model, students in high school level obtain achievement and enhancement of MCTS which is higher than students in medium school level. In other word, students in high school level get more advantage in achievement and enhancement of MCTS than students in medium school level. This occurs because Mathematical Prior Ability (MPA) of students at the high school level is better than MPA of students at medium school Level. MPA of students is one aspect that support the critical thinking skills in mathematics. This is in line with the opinion of Glazer (2001) that the critical thinking in math involves prior knowledge of the students. This study also find that there is no interaction effect between learning model and school level toward enhancement and achievement of students’ MCTS. This is possible because in one class, each discussion group has relatively the same academic ability. Each group consists of students who have the academic ability of high, medium, and low. This condition causes the application of LCM and LC run smoothly on the high school level as well as at the medium school level. Therefore, LCM and LC can be used in medium and high level school, because in whichever level, MCTS enhancement and achievement of students who are taught by LCM will be higher than students who are taught by LC and CL. MCTS enhancement and achievement of students who are taught by LC will be higher than students who are taught by CL. Using the 5E learning cycle with metacognitive technique to enhance students’ … Runisah, Herman, & Dahlan 96 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 CONCLUSION Based on result study, it can be concluded that in terms of overall and in high school level, MCTS enhancement and achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS enhancement and achievement of students who are taught by LC is better than students who are taught by CL. In medium school level, there is no difference of MCTS enhancement between students who are taught by LCM and LC, however MCTS enhancement of students who are taught by LCM and students who are taught by LC is better than students who are taught by CL. Whereas, MCTS achievement of students who are taught by LCM is better than students who are taught by LC and students who are taught by CL. MCTS achievement of students who are taught by LC is better than students who are taught by CL. In terms of overall and students in high school level, MCTS enhancement of students who are taught by LCM, LC, and CL is in medium category. In medium school level, MCTS enhancement of students who are taught by LCM and LC is in medium category, but MCTS enhancement of students who are taught by CL is in low category. Thus the application of LCM and LC has not yet gave maximal effect toward students’ MCTS enhancement. Therefore, further research is needed to examine its cause. There is no interaction effect between learning model and school level toward enhancement and achievement of students’ MCTS. Thus, LCM and LC can be used for high school level and medium school level to enhance students’ MCTS. REFERENCES Appelbaum, P.M. (1999). Eight critical points for Mathematics. Retrieved from http://gargoyle.arcadia.edu/appelbaum/8points.html. Badan Penelitian dan Pengembangan Pusat Kurikulum Departemen Pendidikan Nasional (Balitbang Depdiknas). (2007). Kajian kebijakan kurikulum mata pelajaran matematika. Jakarta: Depdiknas. Balcaen, & Klassen (2007). Teaching critical mathematics thinking (mathematical mindedness). Retrieved from http://libocs.lib.sfu.ca:8087/fedcan/index.php/csse/csse2008/paper/view/437/308 Beaumont, J. (2010). A sequence of critical thinking tasks. TESOL Journal,1(4),427-448. Bybee, R.W., Taylor, J. A., Gardner, A., Scotter P.V., Powel, J.C., Westbrook, A., & Landes, N. (2006). The BSCS the 5E instructional model: Origins and effectiveness and applications executive summary. Retrieved from http://bscs.org/sites/default/files/_legacy/BSCS_the 5E_Instructional_ModelExecutive_Summary_0.pdf Camahalan, F. M. G. (2006). Effects of self regulated learning on mathematics achievement on selected southeast asian children. Journal of Instructional Psychology, 33(3), 194-205. Chukwuyenum, A.N. (2013). Impact of critical thinking on performance in mathematics among senior secondary school students in Lagos state. IOSR Journal of Research & Method in Education, 3(5),18-25. Creswell, J.W. (2012). Educational research: planning, conducting, and evaluating. quantitative and qualitative research, 4nd edition. Boston: Pearson Ennis, R.H. (1996). Critical thinking. United States of America: Prentice-Hall, Inc. Ergin, I. (2012). Constructivist approach based 5E model and usability instructional physics. Latin-American Journal of Physics Education, 6(1), 14 – 20. IJEME, Vol. 1, No. 1, March 2017, 87-98 IJEME ISSN: 2549-4996 97 Erlian, B.P. (2009). Pegaruh penggunaan model daur belajar (learning cycle) terhadap kemampuan berpikir kritis dan sikap siswa kelas XI MA Muallimat NW Pancor pada mata pelajaran Matematika. Jurnal Educatio, 2(4). 73-83. Fatimah, N. (2012). Penerapan model learning cycle 5E dalam mata pelajaran matematika untuk meningkatkan kemampuan berpikir kritis siswa SMA. Retrieved from http://repository.upi.edu/, 2012 Firdaus, Kailani, I. Nor, B.B., & Bakry. (2015). Developing critical thinking skills of students in mathematics learning. Journal of Education and Learning, 9(3), 226236. Franke, M.L., & Kazemi, E. (2001). Learning to teach mathematics focus on student thinking. Theory into practice. 40(2), 102 – 109. Glazer, E. (2001). Using web sources to promote critical thinking in high school Mathematics. Retrieved from http://math.unipa.it/~grim/AGlazer79-84.PDF Glazer, E (2004). Technology enhanced learning environments that are conductive to critical thinking in mathematics: Implication for research about critical thinking on the world wide web. Retrieved from http://www.http://lonstar.texas.net/ ~mseifert / crit2.html. Hake, R.R. (1998). Interactive-engagement versus traditional methods: A sixthousand-student survey of mechanics test data for introductory physics courses. Journal of American Association of Physics Teachers, 66(1), 64 -74. Innabi, H. (2003). Aspects of critical thinking in classroom instruction of secondary school mathematics teachers in jordan. Proceedings of The International Conference, Brno, Czech Republik. 124-129. Jacob, S.M. (2012). Mathematical achievement and critical thinking skills in asynchronous discussion forums. Procedia - Social and Behavioral Sciences, 31 (2012), 800 – 804. Kadarisma, G. (2015). Meningkatkan kemampuan berpikir kritis dan logis matematis serta kemandirian belajar siswa SMP melalui learning cycle 5E dan discovery learning. Thesis. Bandung: Sekolah Pascasarjana Universitas Pendidikan Indonesia. Kurniati, Kusumah, Y.S., Sabandar, J., & Herman, T. (2015). Mathematical critical thinking skills through contextual teaching and learning. IndoMS-JME, 6(1), 5362. Leys, C., & Schumann, S. (2010). A nonparametric method to analyze interactions: The adjusted rank transform test. Journal of Experimental Social Psychology, 46(4), 587-700. Lioe, L.T, Fai, H.K., & Hedberg, J.G. (2006). Students’metacognitive problem solving strategies in solving open-ended problems in pairs. Retrieved from http://conference.nie.edu.sg/paper/new converted/aboo 287.pdf. Lithner. (2008). Research framework for creative and initative reasoning. Educational Studies in Mathematics, 67(3), 255-276. Meltzer, D.E. (2002). The relationship between mathematics preparation and conceptual learning gains in physics: a posible “hidden variable” in diagnostic pretest scores. American Journal of Physics, 70(12), 1259-1268. Mittlefehldt, S., & Grotzer, T. (2003). Using metacognition to facilitate the transfer of causal models in learning density and pressure. National Association of Research in Using the 5E learning cycle with metacognitive technique to enhance students’ … Runisah, Herman, & Dahlan 98 P-ISSN: 2549-4996 | E-ISSN: 2548-5806 Science Teaching (NARST) Conference, Philadelphia. Retrieved from ftp://202.83.110.129/Volume_1/from%202TB/FAC%20FOUND%20STUDIES/F YLHR9~3/Subject%20Components/Grey%20matters/R54ZK1~G/Project%20Z ero/www.pz.harvard.edu/Research/UCPpapers/P2NARST03.pdf. Mullis, I.V.S., Martin, O., Michael, Foy, P., & Arora, A. (2012). TIMSS 2011 international result in Mathematics. Retrieved from http://timssandpirs.bc.edu/timss2011/downloads/T11_IR_Mathematics_Fullbo ok.pdf Peraturan Menteri Pendidikan Nasional (Permendiknas) Republik Indonesia No. 22 Tahun 2006 tentang Standar Isi untuk Satuan Pendidikan Dasar dan Menengah. (2006). Jakarta: Departemen Pendidikan Nasional. Peraturan Menteri Pendidikan dan Kebudayaan (Permendikbud) Republik Indonesia Nomor 64 Tahun 2013 tentang Standar Isi Pendidikan Dasar dan Menengah. (2013). Jakarta: Departemen Pendidikan dan Kebudayaan. Noer, S.H. (2010). Peningkatan kemampuan berpikir kritis, kreatif, dan reflektif (K2R) matematis siswa SMP melalui pembelajaran berbasis masalah. Dissertation. Bandung: Sekolah Pascasarjana Universitas Pendidikan Indonesia. Ozcan, Z.C., & Erktin, E. (2015) Enhancing mathematics achievement of elementary school students through homework assignments enriched with metacognitive questions. Eurasia Journal of Mathematics, Science & Technology Education, 11(6), 1415-1427. Panaoura, A., & Philippou, G. (2005). The measurement of young pupils ́ metacognitive ability in mathematics: the case of self-representation and self-evaluation. Proceeding of the Fourth Congress of the European Society for Research in Mathematics Education, Spain, 255-264. Rohaeti, E.E. (2010). Critical and creative mathematical thinking of junior high school student. Educationist Journal, 4(2), 99-106. Runisah. (2015). A Study on critical thinking skills in mathematics on eight grade students. Proceeding of International Seminar on Mathematics, Science, and Computer Science Education (pp. 5-10), Bandung. Ruseffendi, E.T. (2005). Dasar-Dasar Ilmu Pendidikan & Bidang Non Esakta Lainnya. Bandung: Tarsito. Slavin, E.R. (2011). Educational psychology: theory and practice, 9th ed. New Jersey: Pearson Education, Inc. Sofuroh, Masrukan, & Kartono. (2014). Model learning cycle 5E dengan pendekatan scientific untuk meningkatkan disposisi matematis dan berpikir kritis. Journal of Mathematics Education Research, 3(2), 91-97. Toit, S.D., & Kotze, G. (2009). Metacognitive strategies in the teaching and learning of Mathematics. Pythagoras, 70, 57-67. Tuna, A., & Kacar, A. (2013). The effect of the 5E learning cycle model in teaching trigonometry on students’ academic achievement and the permanence of their knowledge. International Journal on New Trends in Education and Their Implications, 4(1), 3-87. Yumiati. (2015). Meningkatkan kemampuan berpikir aljabar, berpikir kritis matematis, dan self-regulated learning siswa SMP melalui pembelajaran CORE (Connecting, Organizing, Reflecting, Extending). Dissertation. Bandung: Sekolah Pascasarjana Universitas Pendidikan Indonesia. IJEME, Vol. 1, No. 1, March 2017, 87-98