damsym:semantic_search_engine
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| damsym:semantic_search_engine [2026/01/15 16:25] – created fincons | damsym:semantic_search_engine [2026/02/12 12:23] (current) – [Detailed View (“More Details”)] fincons | ||
|---|---|---|---|
| Line 4: | Line 4: | ||
| The semantic search module represents the core functionality of the DaMSym tool. | The semantic search module represents the core functionality of the DaMSym tool. | ||
| - | It is designed to allow users to query multilingual textual corpora — Arabic, Slavic, Greek, Latin, Sanskrit, and Greek + Latin — through an intelligent system based on conceptual similarity between the query terms and the contents present in the texts. | ||
| - | Unlike traditional search engines, which operate on literal matches, DaMSym processes the entered terms by evaluating semantic proximity between concepts. | ||
| - | The result is a list of texts related not only by keyword, but also by meaning and linguistic context, supporting comparative analysis and philological study. | ||
| - | Semantic search is available to all users, including unauthenticated users (Guest), while some advanced functions — such as resource addition, review, or approval — require authenticated access. | ||
| + | It is designed to allow users to query multilingual textual corpora — Arabic, Church Slavonic, Greek, Latin, Sanskrit, and Latin & Greek — through an intelligent system based on conceptual similarity between query terms and the content present in the texts. | ||
| + | |||
| + | Unlike traditional search engines, which operate on literal matches, DaMSym processes the inserted terms by evaluating semantic proximity between concepts. | ||
| + | |||
| + | The result is a list of texts correlated not only by keyword, but also by meaning and linguistic context, thereby supporting comparative analysis and philological study. | ||
| + | |||
| + | Semantic search is available to all users, including unauthenticated users (Guest), while some advanced functions — such as adding resources, reviewing, or approving — require authenticated access. | ||
| + | |||
| + | {{damsym: | ||
| ---- | ---- | ||
| ===== Interface and Structure ===== | ===== Interface and Structure ===== | ||
| - | The search engine interface is organized in a simple | + | The search engine interface is organized in a simple |
| - | All input tools are located | + | |
| + | All input tools are placed | ||
| * the main search bar, | * the main search bar, | ||
| * the “Advanced Search” section, | * the “Advanced Search” section, | ||
| - | * the dynamic | + | * the filters panel. |
| + | |||
| + | The right side of the page is dedicated to displaying search results, which are updated in real time according to the selected settings. | ||
| + | |||
| + | The visual separation between input area and output area ensures clarity and immediacy, allowing users to modify parameters without reloading the page. | ||
| - | The right side of the page is dedicated to displaying results, which are updated in real time according to the selected settings. | ||
| - | The visual separation between the input area and the output area ensures clarity and immediacy, allowing users to adjust parameters without reloading the page. | ||
| All search parameters are interdependent: | All search parameters are interdependent: | ||
| Line 28: | Line 36: | ||
| ===== Supported Languages and Search Domains ===== | ===== Supported Languages and Search Domains ===== | ||
| - | The system supports six main linguistic domains, each with its own rules and metadata. | + | The system supports six main linguistic domains, each characterized by its own internal |
| - | Despite internal differences, | + | |
| + | Despite internal differences, | ||
| ^ Language / Domain ^ Main Characteristics ^ | ^ Language / Domain ^ Main Characteristics ^ | ||
| - | | **Arabic, Greek, Latin, | + | | **Arabic, Greek, Latin, Latin & Greek** | Common filter structure: |
| - | | **Slavic** | Includes a dedicated font selector for correct character rendering. Provides filters for multiple languages or historical/regional events. Does not include | + | | **Church Slavonic** | Includes a dedicated font selection dropdown to ensure |
| - | | **Sanskrit** | Includes only the “Works” filter. No additional parameters are currently available. Automatic transliteration is provided | + | | **Sanskrit** | Includes only the Works filter. No additional parameters are provided. Automatic transliteration is available |
| ---- | ---- | ||
| Line 40: | Line 49: | ||
| ===== Advanced Search ===== | ===== Advanced Search ===== | ||
| - | The Advanced Search | + | The Advanced Search |
| - | For each term, it is possible to assign a numerical weight between 0 and 1, indicating its relative importance within the search. | + | |
| - | Weights can be assigned only to the additional words or phrases entered in the Advanced Search section (up to a maximum of two), while the main word typed in the search | + | |
| - | The sum of the assigned weights must always be equal to 1, and the system automatically recalculates the values to maintain semantic balance. | + | |
| - | Practical example: | + | The main search always represents the core of the search process. |
| - | A user can search | + | |
| - | The semantic engine balances the search according to these proportions, | + | Additional phrases serve exclusively to refine, contextualize, |
| - | This weighting logic makes the search more flexible and suitable for comparative or multidisciplinary studies. | + | |
| + | Each search block contributes to the final result through a numerical weight ranging from 0 to 1, representing its relative importance compared to the other active searches. | ||
| + | |||
| + | When the user activates the Advanced Search section: | ||
| + | |||
| + | * The main search remains mandatory and can be assigned a minimum weight of 0.5 up to a maximum of 0.9. | ||
| + | * A first additional | ||
| + | |||
| + | These values represent an initial suggestion and can be modified by the user. | ||
| + | |||
| + | {{damsym: | ||
| + | |||
| + | The user may choose to add a second additional phrase by clicking the “+” button. | ||
| + | |||
| + | In this case, a new section becomes available, and the user can assign three distinct weights to the three search terms or phrases, starting from the default values proposed by the system. | ||
| + | |||
| + | {{damsym: | ||
| + | |||
| + | If both additional phrases are active, clicking the “–” button allows the user to remove the most recently added one. | ||
| + | |||
| + | The sum of the assigned weights must always be equal to 1. The system automatically recalculates values to maintain the semantic balance of the overall search configuration. | ||
| + | |||
| + | The semantic engine balances the search according to these proportions, | ||
| + | |||
| + | This weighting logic makes the search more flexible and suitable for comparative or multidisciplinary studies. | ||
| + | |||
| + | The weight assigned to each search phrase can be adjusted using a slider or through an input field, which provides arrow controls for changing the value. | ||
| - | Figure 3, Advanced Search | ||
| ---- | ---- | ||
| Line 56: | Line 87: | ||
| ===== Filters and Search Parameters ===== | ===== Filters and Search Parameters ===== | ||
| - | In addition to semantic terms, users can narrow the search | + | In addition to semantic terms, users can narrow the scope of the search |
| - | **Common | + | ==== Common |
| - | * **Chronological range** → selectable via time slider or by manually | + | * **Chronological range (Period)** → available for all languages except Church Slavonic and Sanskrit. The filter is dynamic and automatically adapts to the queried dataset. The range can be selected through a time slider or by manually |
| - | * **Authors** → multiple selection to include | + | * **Authors** → includes a search bar for selecting and filtering by one or more authors. |
| - | * **Works** → multiple selection of specific works or collections. | + | * **Works** → includes a search bar for selecting and filtering by one or more works. |
| - | **Slavic filters** | + | ==== Church Slavonic Filters ==== |
| - | * **Font** → selection of the character type used for text rendering; | + | * **Font** → selection of the character type used for text rendering. |
| - | * **Language** → filter allowing selection of one or more available languages, | + | * **Language** → filter allowing selection of one or more available languages, |
| - | * **Historical | + | * **Historical/Regional Variants** → geographic |
| - | **Sanskrit | + | ==== Sanskrit |
| - | * **Works** → the only filter currently available. | + | * **Works** → multiple selection of specific works or collections. |
| - | All filters | + | There is synchronization between the Authors and Works filters: |
| - | The **Reset** button is also available, allowing | + | |
| + | Consequently, | ||
| + | |||
| + | A Reset button is also available, allowing | ||
| ---- | ---- | ||
| Line 81: | Line 115: | ||
| ===== Search Results ===== | ===== Search Results ===== | ||
| - | Search results are displayed as a list on the right side of the screen. | + | Search results are displayed as a list on the right side of the screen. |
| - | Each row includes: | + | |
| + | Each entry includes: | ||
| * the title of the work or fragment; | * the title of the work or fragment; | ||
| - | * a text excerpt; | + | * an excerpt of the corresponding |
| - | * the **Similarity Score**, a numerical value expressing the degree of semantic similarity between the result and the user’s query; | + | * the Similarity Score, a numerical value expressing the degree of semantic similarity between the result and the user’s query; |
| * the “More Details” button, which opens the detailed metadata view. | * the “More Details” button, which opens the detailed metadata view. | ||
| Results can be sorted according to three criteria: | Results can be sorted according to three criteria: | ||
| - | | + | * Similarity (default) |
| - | | + | * Date |
| - | | + | * Author |
| - | Text highlighting | + | In the results page, no direct text highlighting is applied |
| - | In Greek and Latin corpora, results are displayed without highlighting. | + | |
| - | Figure 4, Search Results | + | By selecting the “More Details” button, the user accesses the detailed metadata view, where a contextualized portion of text is displayed — specifically, |
| + | {{damsym: | ||
| ---- | ---- | ||
| ===== Detailed View (“More Details”) ===== | ===== Detailed View (“More Details”) ===== | ||
| - | By clicking “More Details” | + | By clicking “More Details” |
| - | * title, author, and language; | + | * title; |
| * full or extended text of the fragment; | * full or extended text of the fragment; | ||
| - | * list of metadata (work, period, place, source, etc.), | + | * list of metadata (work, period, place, source, etc.), |
| - | * semantic | + | * semantic |
| - | * **Feedback or Rating** section (visible only to Reviewer users). | + | |
| - | From this same view, authenticated users (Researcher | + | From this same view, authenticated users (Researcher, Reviewer, and WP Lead) may propose corrections directly on the highlighted |
| - | + | ||
| - | Figure 5, More Details | + | |
| + | {{damsym: | ||
| ---- | ---- | ||
| ===== Text Editing and Corrections ===== | ===== Text Editing and Corrections ===== | ||
| - | Within the More Details view, users with the Researcher | + | Within the More Details view, users with the roles of Researcher, Reviewer, and WP Lead can interact with the displayed text and metadata according to controlled procedures. |
| - | * select words or fragments to propose a semantic or philological correction using the **Edit Text (Corrections)** function; | + | In particular: |
| - | * modify existing metadata using the **Edit** buttons next to each metadata field; | + | |
| - | * use the **Add Metadata** button to add new fields or information (e.g. sources, original titles, notes, bibliographic references). | + | |
| - | All modifications are saved as proposals | + | * text modification is allowed exclusively on contextualized portions automatically highlighted by the system according to the search query. These portions can be selected to propose semantic or philological corrections using the Edit Text (Corrections) function; |
| + | * metadata are structured as name–value pairs and can be modified using the Edit buttons displayed next to each field; | ||
| + | * through the Add Metadata button, users may propose the addition of new informational fields (for example, sources, original titles, notes, or bibliographic references). | ||
| + | |||
| + | All modifications | ||
| + | |||
| + | Proposals are visible in the Corrections section of the Dashboard, where: | ||
| + | |||
| + | * Researcher and Reviewer users can modify, delete, submit, | ||
| + | * the WP Lead can evaluate each proposal and approve, reject, or delete it using the dedicated management buttons. | ||
| ---- | ---- | ||
| Line 132: | Line 172: | ||
| ===== Interdependence Between Search and Filters ===== | ===== Interdependence Between Search and Filters ===== | ||
| - | Search and filters in DaMSym do not operate independently, but in a relationship of mutual dependence: | + | Search and filters in DaMSym do not operate independently but in a relationship of dependency: any modification to parameters influences the semantic processing of the query. |
| - | any change in parameters influences the semantic processing of the query. | + | |
| For example, selecting a specific author automatically restricts the semantic context to the subset of texts associated with that author. | For example, selecting a specific author automatically restricts the semantic context to the subset of texts associated with that author. | ||
| - | This dynamic architecture | + | |
| + | This dynamic architecture | ||
| + | |||
| + | ---- | ||
damsym/semantic_search_engine.1768490705.txt.gz · Last modified: by fincons
