News

The Text Analysis and Retrieval course will be on hiatus for one year. This means the course will not be offered in the next academic year (2024/2025).

However, we are excited to announce that when we return in the summer semester of the 2025/2026 academic year, the course will be revamped to include the latest advancements in natural language processing.

We look forward to bringing you an updated and enriched learning experience!

Author: Jan Šnajder
TAR 2024: Important information for...

We're excited to announce the 11th edition of the Text Analysis and Retrieval (TAR) course. If you're interested in search engines, text analysis, statistical natural language processing, and the application of machine learning to natural language processing, then this course is for you. However, since TAR might be a bit different than the courses you took in the past, we ask you to consider the following information before you make your final decision about enrolling in the course.

  1. Due to organizational constraints, we accept a limited number of students. This year, we offer 51 places and the selection among the applicants will be based on the motivation letter and undergraduate grade point average. Four spots will be reserved for Erasmus students.
  2. TAR is taught in English only (level L3). All course material, including tests, will be in English. There is no "Croatian group" and no material in Croatian. If you enroll in this course, we assume that you're accepting these terms and that you have a good enough command of English to follow the lectures and participate in class.
  3. TAR has no midterms and no finals. Instead, you are expected to attend all classes and participate. You can be absent from some of the classes but at the cost of losing points. Note that you need to meet several thresholds, so being absent from too many classes means you risk failing the course and having to enroll again next year. Be aware that the course can only be passed via continuous (in-class) assessments; there are no regular exams for this course.
  4. In the first half of the course, there is weekly prep work (pre-reading and watching video lectures), so you will be expected to prepare for each class. There will be weekly quizzes that cover prep work as well as last week's in-class material. Additionally, you will have several lab assignments to solve during the first half of the course.
  5. In the second half of the course, the classes will revolve exclusively around paper reading sessions. What this means is that you will be asked to read scientific papers (which are in English) published at recent and renowned conferences, summarize the papers, answer key questions about them, and finally participate in discussions, which we'll be having together in class. There will be no way around this: reading sessions are an integral part of the course; we're doing them for you and you only, and you can't make up for them by doing something else. There will be reading quizzes, and if you don't attend the reading, you'll lose points and risk not meeting the threshold. Now, why are we doing all this? Because we firmly believe that being able to read, review, and discuss scientific papers is a tremendously important skill, regardless of whether you intend to pursue an academic career. We're also doing it because we found out it's a much better and more amusing way to engage with the topics we want to cover. And while most students felt that reading and discussing papers was indeed a lot of fun, it is certainly not for everybody.
  6. The central activity of the course is team project work. Projects are done primarily in the second half of the course and revolve around a practical and trendy information retrieval or natural language processing task. You get to choose a topic from a list of topics. Three points deserve a mention here. First, the projects are done in 3-person teams. There is no way around this; you can't do the project on your own, and you have to team up by yourself. Second, the project results will need to be wrapped up in the form of a short scientific paper. You can write the paper in English or Croatian. Third, you'll present your work in a 15-min talk at the end of the course. This again can be done in English or Croatian, and it suffices that one project member presents your work. However, all students must attend all presentations.
  7. Although machine learning is not a formal prerequisite for TAR, taking the course without knowing the ML basics will probably cause much frustration. Here we don't necessarily mean that you should have completed FER's ML course (or you intend to complete it before TAR starts in the summer semester): any other course or self-study that provided you with the basics of ML will be fine. On the other hand, if you absolutely had no prior exposure to ML, we don't advise enrolling in TAR.

If all this sounds like your cup of tea, we'll be happy to have you on board! ;)
In the extended post, you can read about the previous experiences of students who have completed the course.

Author: Jan Šnajder