<!ELEMENT bs-submission (topic-fields, description, topic+)>
<!ATTLIST bs-submission
participant-id CDATA #REQUIRED
run-id CDATA #REQUIRED
paired-run-id CDATA #REQUIRED
task (book-retrieval) #REQUIRED
query (automatic | manual) #REQUIRED
result-type (book) #REQUIRED
retrieval-type (non-specific | book-specific) #REQUIRED
>
<!ELEMENT topic-fields EMPTY>
<!ATTLIST topic-fields
title (yes|no) #REQUIRED
description (yes|no) #REQUIRED
narrative (yes|no) #REQUIRED
>
<!ELEMENT description (#PCDATA)>
<!ELEMENT topic (book+)>
<!ATTLIST topic topic-id CDATA #REQUIRED >
<!ELEMENT book (bookid, rank?, rsv?)>
<!ELEMENT bookid (#PCDATA)>
<!ELEMENT rank (#PCDATA)>
<!ELEMENT rsv (#PCDATA)>
Each submission run must contain the following:
| •@participant-id: | The Participant ID number of the submitting institute. |
| •@run-id: | A run ID (which must be unique across all submissions sent from one organization - also please use meaningful, but short names if possible). |
| •@paired-run-id: | The run-id identifying the run that the current submission is paired with (i.e., if the current run is the book-specific ranking then the paired run-id should be the id of the generic ranking - these two runs can then be compared to each other). If a single run is submitted, please use "NA". |
| •@task: | Identification of the task, which should just be "book-retrieval". |
| •@query: | Specification whether the search query was constructed automatically ("automatic") or manually ("manual") from the topic. |
| •@result-type: | Specification of the result-type, which should just be set to "book". |
| •@retrieval-type: | Specifies whether the run is a result of generic, "non-specific" IR methods, or "book-specific" IR techniques that make use of book-specific features or algorithms. |
| •topic-fields: | Specification of which topic fields were used for constructing the search query (i.e., title and/or description and/or narrative). |
| •description: | A description of the retrieval approach applied to generate the run. Please add as much detail as you can, as this would help with the comparison and analysis of the results later on. |
Furthermore, a run should contain the search results for each topic, confirming to the following criteria:
|
| •topic: | Contains the ranked list of books estimated relevant to the given topic, ordered by decreasing value of relevance. Only a maximum of 1000 books should be returned for each topic. |
| •@topic topic-id: | Identifies the topic. |
| •book: | Contains information for each book result in the ranking. |
| •bookid: | Each book should be identified using its bookID, which is the name of the directory that contains the XML source of the book (along with the MARC metadata file). |
| •rank/rsv: | The rank position and RSV value can be recorded for each book in the ranking. Please note, however, that the evaluation will likely rely on the actual ordering of results alone (values of the rank and rsv fields may thus be ignored). |
<!ELEMENT bs-submission (topic-fields, description, topic+)>
<!ATTLIST bs-submission
participant-id CDATA #REQUIRED
run-id CDATA #REQUIRED
task (book-ad-hoc) #REQUIRED
query (automatic | manual) #REQUIRED
result-type (element | passage | page) #REQUIRED
>
<!ELEMENT topic-fields EMPTY>
<!ATTLIST topic-fields
title (yes|no) #REQUIRED
description (yes|no) #REQUIRED
narrative (yes|no) #REQUIRED
>
<!ELEMENT description (#PCDATA)>
<!ELEMENT topic (book+)>
<!ATTLIST topic topic-id CDATA #REQUIRED >
<!ELEMENT book (bookid, rank?, rsv?, result+)>
<!ELEMENT result ((path|passage), rank?, rsv?)>
<!ELEMENT bookid (#PCDATA)>
<!ELEMENT path (#PCDATA)>
<!ELEMENT passage EMPTY>
<!ATTLIST passage
start (#PCDATA) #REQUIRED
end (#PCDATA) #REQUIRED
>
<!ELEMENT rank (#PCDATA)>
<!ELEMENT rsv (#PCDATA)>
Each submission must contain the following:
| •@participant-id: | The Participant ID number of the submitting institute. |
| •@run-id: | A run ID (which must be unique across all submissions sent from one organization - also please use meaningful, but short names if possible). |
| •@task: | Identification of the task, which should just be "book-ad-hoc". |
| •@query: | Specification whether the search query was constructed automatically ("automatic") or manually ("manual") from the topic. |
| •@result-type: | Specification of the result-type, which can be either "element", "passage" or "page". An element is an XML element of arbitrary granularity, given by its XPath (see Appendix A). A passage is an arbitrary sized passage, given by its start and end offsets. A page is an XML element of given granularity, given by its XPath. Result elements/passages must not overlap with any other retrieved element/passage. |
| •topic-fields: | Specification of which topic fields were used for constructing the search query (i.e., title and/or description and/or narrative). |
| •description: | A description of the retrieval approach applied to generate the run. Please add as much detail as you can, as this would help with the comparison and analysis of the results later on. |
Furthermore, a run should contain the search results for each topic confirming to the following criteria:
|
| •topic: | Contains the ranked list of books estimated relevant to the given topic, ordered by decreasing value of relevance. Only a maximum of 1000 books should be returned for each topic. |
| •@topic topic-id: | Identifies the topic. |
| •book: | Contains information for each book result in the ranking. |
| •bookid: | Each book should be identified using its bookID, which is the name of the directory that contains the XML source of the book (along with the MARC metadata file). |
| •rank/rsv: | The rank position and RSV value can be recorded for each book in the ranking. Please note, however, that the evaluation will likely rely on the actual ordering of results alone (values of the rank and rsv fields may thus be ignored). |
| •result: | For each book, a ranked list of book part results estimated relevant to the topic should be returned. |
| •path/passage: | Book part results may be non-overlapping XML elements, passages or book pages. XML elements and pages are identified by their XPaths, while passages are given by their start and end offsets. For information on XPath, please see Appendix A. |
| •@start/@end: | Defines the character offset of the start and end of a passage within a given book, using XPath, please see Appendix A. |
| •rank/rsv: | For each result inside a book, its rank and/or RSV score can be recorded. Please note that the evaluation may rely on the rank order of the books and of the results inside books alone (values of the rank and rsv fields may be ignored). |
<bs-submission participant-id="25"
run-id="BM25F-Focused-PageLevelRetrieval-With-ToC-BackOfBookIndex-Streams"
task="book-ad-hoc" query="automatic"
result-type="page">
<topic-fields title="yes" description="no" narrative="no"/>
<description>
BM25F using 2 streams extracted from the table of contents and the
back-of-book index sections, indexing and retrieval only at page level,
no relevance propagation
</description>
<topic topic-id="01">
<book>
<bookid>384D10DAEA4E34A8</bookid><rank>1</rank>
<result><path>/document[1]/page[27]</path><rank>1</rank></result>
<result><path>/ document[1]/page [122]</path><rank>2</rank></result>
<result><path>/ document[1]/page [5]</path><rank>3</rank></result>
...
</book>
<book>
<bookid>5AFEE130174076E3</bookid><rank>2</rank>
<result><path>/ document[1]/page [531]</path><rank>1</rank></result>
<result><path>/ document[1]/page [14]</path><rank>2</rank></result>
...
</book>
<book>...</book>
...
</topic>
<topic> ... </topic>
</bs-submission>
<!ELEMENT bs-submission (source-files, description, book+)>
<!ATTLIST bs-submission
participant-id CDATA #REQUIRED
run-id CDATA #REQUIRED
task (book-toc) #REQUIRED
toc-creation (automatic | semi-automatic) #REQUIRED
toc-source (book-toc | no-book-toc | full-content | other) #REQUIRED
>
<!ELEMENT source-files EMPTY>
<!ATTLIST source-files
xml (yes|no) #REQUIRED
pdf (yes|no) #REQUIRED
jpg (yes|no) #REQUIRED
>
<!ELEMENT description (#PCDATA)>
<!ELEMENT book (bookid, toc-entry+)>
<!ELEMENT bookid (#PCDATA)>
<!ELEMENT toc-entry(toc-entry*)>
<!ATTLIST toc-entry
title (#PCDATA) #REQUIRED
page (#PCDATA) #REQUIRED
>
Each submission must contain the following:
| •@participant-id: | The Participant ID number of the submitting institute. |
| •@run-id: | A run ID (which must be unique across all submissions sent from one organization - also please use meaningful, but short names if possible). |
| •@task: | Identification of the task, which should just be "book-toc". |
| •@toc-creation: | Specification whether the ToC was constructed fully automatically ("automatic") or with some manual aid ("semi-automatic"). |
| •@toc-source: | Specification of whether the ToC was built based only on the table of contents part of the book ("book-toc"), any other part of the book excluding the ToC pages ("no-book-toc"), or based on the full content of the book ("full-content"). If neither of these applies, please specify or simply use "other". |
| •source-files: | Specification of the source files used as input, i.e., the XML file (@xml="yes"), the PDF file (@pdf="yes"), and/or the JPEG files (@jpg="yes"). |
| •description: | A description of the approach used to generate the ToC. Please add as much detail as you can, as this would help with the comparison and analysis of the results later on. |
Furthermore, a run should contain the search results for each topic confirming to the following criteria:
|
| •book: | Contains the ToC information for each book. |
| •bookid: | Each book should be identified using its bookID, which is the name of the directory that contains the XML source of the book (along with the MARC metadata file). |
| •toc-entry: | Contains details of each entry of the table of contents for a given book. Entries may be nested, e.g., sections in a chapter should be nested within the ToC entry of the chapter. |
| •@title: | The title of the ToC entry (e.g., chapter title). |
| •@page | The page counter that corresponds to the start of the section represented by the ToC entry. The page counter starts with 1 on the first page of the book (i.e., cover page). Note that this is different from the page number that may be printed in the book itself (which may only start on the first content page and may include different formats, e.g., v, xii, 2-18, etc.). |
| Objectives | Learn about at least two different classic approaches to children's education. |
| Task | You are looking for material on children's education and pedagogy with references to different approaches such as Pestalozzi, Steiner or Montessori. You need material describing their approaches in order to find out commonalities and differences. |
| Motivation | You want to get a better idea of the options available as you are preparing a project proposal dealing with children and e-learning. |
| Context | The information you will be collecting and comparing has to be expressed in a scientific sort of way and allow for comparison. You are looking for books that either present and/or review different approaches or monographs on specific pedagogues' schools. You should concentrate on the use of table of contents as you need books focusing on the topic and providing in depth discussion of it. |
| Background | You have a good background in e-learning for adults and need to explore the children’s environment. |
| Completion | The task will be considered completed if at least two different approaches to children education have been found. |
| Success | The task can be considered successful if the material retrieved enables the discussion of different approaches with references to e-learning as part of the project proposal. |
| Objectives | Explore the concepts of emotion and affection. You want to get an understanding of the basic concepts looking at classic literature on the subject. |
| Task | You are looking for material describing theory about affection and emotions in general. |
| Motivation | |
| Context | The information you will be collecting has to be expressed in a scientific sort of way and be used to write your introduction section. You are looking for books that define or review these concepts with scientific rigor. Poems or fiction will not be considered. You should use, when available, the back of the book index to get directly to basic definitions and build a relevant terminology. |
| Background | You have a good background on HCI, but want to explore affective computing. |
| Completion | The task will be considered completed if you provide a basic terminology relevant to the topic. |
| Success | The task can be considered successful if the material retrieved enables the composition of a complete terminology relevant to affective computing. |