摘要:The aim of this research was to assess the classification of science test items of TIMSS grade 8 based on higher order thinking skills (HOTS) and determine whether those classified-science test items can be an assessment tool in science class. Sixteen sample test items of HOTS were chosen from 37 reasoning items of TIMSS 1999, 2003, and 2011; which were 6 of analysing, 6 of evaluating, and 4 of creating. The selected items were tested to 410 ninth grade students in 14 public schools in Jember, Indonesia. Data were analysed by using point-biserial correlation to measure the index of discrimination and degree of difficulty at items of each level of HOTS test. The result revealed that the point-biserial index of discrimination for each item was higher than 0.25. The degree of difficulty of analysing, evaluating and creating test items exhibited a similar trend, which was in good range. Each test item has significant validity. Whilst reliability analysis showed that each test item was acceptable and indicating a high level of internal consistency. In conclusion, the classified science test items of TIMSS are good to use as assessment tools to measure HOTS of students in science class.