Obavijesti

We moved to the English version of our course site:
https://www.fer.unizg.hr/en/course/taar

Autor: Josip Jukić
TAR 2023: Important information for...

We're excited to announce the 10th edition of the Text Analysis and Retrieval (TAR) course. If you're interested in search engines, text analysis, statistical natural language processing, and the application of machine learning to natural language processing, then this course is for you. However, since TAR might be a bit different than the courses you took in the past, we ask you to consider the following information before you make your final decision about enrolling in the course.

  1. Due to organizational constraints, we accept a limited number of students. This year, we offer 51 places and the selection among the applicants will be based on the motivation letter and undergraduate grade point average. Four spots will be reserved for Erasmus students.
  2. TAR is taught in English only (level L3). All course material, including tests, will be in English. There is no "Croatian group" and no material in Croatian. If you enroll in this course, we assume that you're accepting these terms and that you have a good enough command of English to follow the lectures and participate in class.
  3. TAR has no midterms and no finals. Instead, you are expected to attend all classes and participate. You can be absent from some of the classes but at the cost of losing points. Note that you need to meet several thresholds, so being absent from too many classes means you risk failing the course and having to enroll again next year. Be aware that the course can only be passed via continuous (in-class) assessments; there are no regular exams for this course.
  4. In the first half of the course, there is weekly prep work (pre-reading and/or watching video lectures), so you will be expected to prepare for each class. There will be weekly quizzes that cover prep work as well as last week's in-class material. Additionally, you will have several lab assignments to solve during the first half of the course.
  5. In the second half of the course, the classes will revolve exclusively around paper reading sessions. What this means is that you will be asked to read scientific papers (which are in English) published at recent and renowned conferences, summarize the papers, answer key questions about them, and finally participate in discussions, which we'll be having together in class. There will be no way around this: reading sessions are an integral part of the course, we're doing them for you and you only, and you can't make up for them by doing something else. There will be reading quizzes, and if you don't attend the reading, you'll lose points and risk not meeting the threshold. Now, why are we doing all this? Because we firmly believe that being able to read, review, and discuss scientific papers is a tremendously important skill, regardless of whether you intend to pursue an academic career. We're also doing it because we found out it's a much better and more amusing way to engage with the topics we want to cover. And while most students felt that reading and discussing papers was indeed a lot of fun, it is certainly not for everybody.
  6. The central activity of the course is team project work. Projects are done primarily in the second half of the course and revolve around a practical and trendy information retrieval or natural language processing task. You get to choose a topic from a list of topics. Three points deserve a mention here. First, the projects are done in 3-person teams. There is no way around this; you can't do the project on your own, and you have to team up by yourself. Second, the project results will need to be wrapped up in the form of a short scientific paper. You can write the paper in English or Croatian. Third, you'll present your work in a 15-min talk at the end of the course. This again can be done in English or Croatian, and it suffices that one project member presents your work, however, all students must attend all presentations.
  7. Although machine learning is not a formal prerequisite for TAR, taking the course without knowing the ML basics will probably cause much frustration. Here we don't necessarily mean that you should have completed FER's ML course (or you intend to complete it before TAR starts in the summer semester): any other course or self-study that provided you with the basics of ML will be fine. On the other hand, if you absolutely had no prior exposure to ML, we don't advise enrolling in TAR.

If all this sounds like your cup of tea, we'll be happy to have you on board! ;)
In the extended post, you can read about the previous experiences of students who have completed the course.

Autor: Josip Jukić

Project presentations will be divided into two sessions: on Monday, June 6 (A-301), and Tuesday,  June 7 (A-211). You can find the schedule in the extended post below. The talks should be strictly 10 minutes long, with additional 5 minutes for Q&A (and an additional 5 minutes for picking up any slack due to technical difficulties). You should attend all presentations in both sessions.

Autor: Josip Jukić

It's time for a new adventure. If you find research in NLP compelling, this might be an interesting opportunity.

TakeLab is looking for new interns on multiple projects. Please apply before 27 May 2022 if you wish to join our ranks. You can find out more details and how to apply in this post.

Autor: Josip Jukić

We have uploaded the second lab assignment, which you can find in the repository. Follow the instructions in Instructions.pdf to set up your assignment, which is included in the archive. Once you solve the assignment, upload the corresponding jupyter notebook to FER-Moodle under the Lab assignment 2 activity. 

The submission deadline is May 4, 2022, 23:59 CET.

As we don't expect you to solve lab assignments during the midterms, the deadline is after the midterms.

As always, if you have any questions, don't hesitate to contact us via the form.

Autor: Josip Jukić

We have organized the first project checkpoint (alpha) on Monday, April 4, and Tuesday, April 5. It will be held in classroom D306 in the periods of 16:00-18:30 (Monday) and 11:30-14:00 (Tuesday), with 15 minutes allocated per team. Ideally, all team members should be present at your assigned time slot (whenever this is possible). Please follow the time slots given below (in the extended post). To minimize unnecessary waiting for the teams scheduled after you, please don't be late.

We advise you to check out this year's project vademecum before the checkpoint, where you can find general information about the project as well as what is expected for each milestone.

You may switch slots in agreement with other teams but let us know about it via e-mail (josip.jukic@fer.hr). Moreover, if your entire team can't make your assigned time slot, also let us know ASAP.

Autor: Josip Jukić