Info 2402 information retrieval technologies course_outline
1. INTERNATIONAL ISLAMIC UNIVERSITY MALAYSIA
COURSE OUTLINE
Kulliyyah Information and Communication Technology
Department Information Systems
Programme Bachelor of Information Technology
Course Title Information Retrieval Technologies
Course Code INFO 2402
Status Department Required
Level 2
Credit Hours 3
Contact Hours 3
Pre-requisites
(if any)
None
Co-requisites
(if any)
None
Instructional
Strategies Lecture
Class Discussion
Course
Assessment
State weightage
of each type of
assessment.
Learning Outcome Method %
1, 2 Assignment (Individual) 15
3 Web-based IR system development
(Group project)
20
3 Tutorials/Quizzes (Individual) 10
1, 2 Midterm 15
1, 2, 3 Final Examination 40
Total 100
1
2. Instructor(s) Mohd Izzuddin Mohd Tamrin
Room: Level 5, KICT
Tel.: 03-6196-6429
Fax: 03-6196-5179
Email: izzuddin@iium.edu.my
Semester
Offered
Semester I & Semester II
Course Synopsis This course covers an overview of information retrieval technologies, IR
models, Web retrieval technologies, relevance feedback, stemming, XML
retrieval, multimedia information retrieval technologies, evaluation of
information retrieval technologies, citation tracker, and retrieval texts.
Course
Objectives
The objectives of this course are to:
1. introduce and expose students to the concepts, models and
technologies for an information retrieval system
2. enable students to evaluate critically the model and technologies
implemented in commercial IR systems and search engines
3. enable students to work as a team to apply the model and
technologies for developing a Web-based information retrieval
system, in the form of a prototype and oral reports
Learning
Outcomes
Upon completion of this course, students should be able to:
1. understand the concepts, models and technologies for an
information retrieval system
2. evaluate critically the model and technologies implemented in
commercial IR systems and search engines
3. work as a team to apply the model and technologies for developing
a Web-based information retrieval system in the form of a
prototype and oral reports
2
3. Content Outlines Weeks Topics Task/Reading
1 Information Retrieval Technologies: An
Overview
- What is Information Retrieval
- Search Engines
- Search Engineers
Croft et. al.,
Chap. 1
2 Architecture of Search Engine
- Text Acquisition
- Text Transformation
- Index Creation
- User Interaction
- Ranking
- Evaluation
Croft et. al.,
Chap. 2
3 Crawl and Feeds
- Crawling the Web
- Crawling Documents and Email
- Document Feeds
- Storing Documents
- Detecting Duplicates
Croft et. al.,
Chap. 3
4 Processing Text
- From Word to Term
- Text Statistics
- Document Parsing
- Link Analysis
- Information Extraction
Croft et. al.,
Chap. 4
5 Ranking with Indexes – Part A
- Inverted Indexes
- Compression
- Auxiliary Structure
Croft et. al.,
Chap. 5
6 Ranking with Indexes – Part B
- Index Construction
- Query Processing
Croft et. al.,
Chap. 5
7 Queries and Interface
- Information Needs and Queries
- Query Transformation and Refinement
- Showing Results
Croft et. al.,
Chap. 6
3
4. 8 Retrieval Models
- Probabilistic Models
- Ranking on Language Model
- Complex Queries
- Machine Learning and IR
Croft et. al.,
Chap. 7
9 Evaluation of information retrieval
technologies
- Logging
- Effectiveness Metrics
- Efficiency Metrics
- Training, Testing and Statistics
Croft et. al.,
Chap. 8
10 Classification and Clustering
- Naïve Bayes
- Support Vector Machine
- Hierarchical Clustering
- K-Means Clustering
- K Nearest Neighbor Clustering
Croft et. al.,
Chap. 9
11 Searching in Social Environment
- Tag and Manual Indexing
- Searching with Communities
- Filtering and Recommendation
- Peer to Peer and Metasearch
Croft et. al.,
Chap. 10
12 Other IR Models
- Structure Based Retrieval Model
- Retrieval Model for Picture and Music
Croft et. al.,
Chap. 11
Presentation for Group Project
Submission of Project Report and
Individual Assignment
References Required
Croft, W.B., Metzler, D., & Strohman, T. (2010). Search engines:
Information retrieval in practice. Boston: Pearson.
Recommended
Answers Corporation. (2008). Answers.com. Retrieved from
http://www.answers.com
Bakkalbasil, N., Bauer, K., Glover, J, & Wang, L. (2006). Three options
for citation tracking: Google Scholar, Scopus and Web of Science.
4
5. BioMed Digital Libraries. 29, 3-7.
Google. (2008). Google. Retrieved from http://www.google.com.my/
IBM. (2008). IMARS. Retrieved from
http://www.alphaworks.ibm.com/tech/imars
Islamicity.com. (2008). Islamicity. Retrieved from
http://www.Islamicity.com
Othman, R., & Noordin, M.F. (2005). Web of knowledge for Quranic
text: A proposed structure. In Mohd. Yusof, Z., Mohd. Noah, S.A.,
Mat Zin, N.A., Salim, J., & Jaafar, A. (Eds.), CAMP ’05: Seminar
Capaian Maklumat dan Pengurusan Pengetahuan (pp. 199-210).
Bangi, Malaysia: Universiti Kebangsaan Malaysia.
Swoogle. (2008). Swoogle. Retrieved from http://swoogle.umbc.edu/
Yahoo! (2008). Yahoo! Retrieved from http://www.yahoo.com
5
6. BioMed Digital Libraries. 29, 3-7.
Google. (2008). Google. Retrieved from http://www.google.com.my/
IBM. (2008). IMARS. Retrieved from
http://www.alphaworks.ibm.com/tech/imars
Islamicity.com. (2008). Islamicity. Retrieved from
http://www.Islamicity.com
Othman, R., & Noordin, M.F. (2005). Web of knowledge for Quranic
text: A proposed structure. In Mohd. Yusof, Z., Mohd. Noah, S.A.,
Mat Zin, N.A., Salim, J., & Jaafar, A. (Eds.), CAMP ’05: Seminar
Capaian Maklumat dan Pengurusan Pengetahuan (pp. 199-210).
Bangi, Malaysia: Universiti Kebangsaan Malaysia.
Swoogle. (2008). Swoogle. Retrieved from http://swoogle.umbc.edu/
Yahoo! (2008). Yahoo! Retrieved from http://www.yahoo.com
5