Oracle Ultra Search Online Documentation Release 9.2 |
|
Related Topics | ||
This section lets you specify query-related settings, such as data source groups, URL submission, relevancy boosting, and query statistics.
Data source groups are logical entities exposed to the search engine user. When entering a query, the user is asked to choose one or more data groups to search from. Use this page to define these data groups.
A data group consists of one or more data sources. Data source can be assigned to multiple data groups. Data groups are sorted first by name. Within each data group, individual data sources are listed and can be sorted by source name or source type.
To create a new data source group, do the following:
- Specify a name for the group.
- Assign data sources to group. To assign a Web or table data source to this data group, choose one or more available Web sources or table sources and click >>. After a data source has been assigned to a group, it cannot be assigned to any other group. To unassign a Web or table data source, choose one or more scheduled sources and click << .
- Click Finish.
URL Submission Methods
URL submission lets query users submit URLs. These URLs are added to the seed URL list and included in the Ultra Search crawler search space. You can allow or disallow query users to submit URLs.
URL Boundary Rules Checking
URLs are submitted to a specific Web data source. URL boundary rules checking ensures that submitted URLs comply with the URL boundary rules of the web data source. You can allow or disallow URL boundary rules checking.
Relevancy boosting lets administrators override the search results and influence the order that documents are ranked in the query result list. This can be used to promote important documents to higher scores. It also makes them easier to find.
There are two methods for locating URLs for relevancy boosting: locate by search or manual URL entry.Locate by Search
To boost a URL, first locate a URL by performing a search. You can specify a hostname to narrow the search. After you have located the URL, click Information to edit the query string and score for the document.
Manual URL Entry
If a document has not been crawled or indexed, then it cannot be found in a search. However, you can provide a URL and enter the relevancy boosting information with it. To do so, click Create, and enter the following:
- Specify the document URL. You must assign the URL to a data source. This document is indexed the next time it is crawled.
- Enter scores in the range of 1 to 100 for one or more query strings. When a user performs a search using the exact query string, the score applies for this URL.
The document is searchable after the document is loaded for the term. The document is also indexed the next time the schedule is run.
With manual URL entry, you can only assign URLs for Web data sources. Users will get an error message on this page if no Web data source is defined.
Note: Ultra Search provides a command-line tool to load metadata, such as document relevance boosting, into an Ultra Search database. If you have a large amount of data, this is probably faster than using the HTML-based administration tool. For more information, see Loading Metadata into Ultrasearch.
Enabling Query Statistics
This section lets you enable or disable the collection of query statistics. The logging of query statistics reduces query performance. Therefore, Oracle recommends that you disable the collection of query statistics during regular operation.
Note: After you enable query statistics, the table that stores statistics data is truncated every Sunday at 1:00 A.M.
Viewing Statistics
If query statistics is enabled, you can click one of the following categories:
- Daily summary of query statistics
- Top 50 queries
- Top 50 ineffective queries
- Top 50 failed queries
Daily Summary of Query Statistics
This summarizes all query activity on a daily basis. The statistics gathered are:
- Average query time: the average time taken over all queries
- Number of queries: the total number of queries made in the day
- Number of hits: the average number of results returned by each query
Top 50 Queries
This summarizes the 50 most frequent queries that occurred in the past 24 hours.
- Query string: the query string
- Average query time: the average time to return a result
- Number of queries: the total number of queries made in the past 24 hours
- Number of hits: the average number of results returned by each query
- Frequency: the number of queries divided by total number of queries over all query strings
- % ineffective queries: the number of ineffective queries divided by total number of queries over all query strings
Top 50 Ineffective Queries
This summarizes the 50 most frequent queries that occurred in the past 24 hours. Each row in the table describes statistics for a particular query string.
- Query string: the query string
- Number of queries: the total number of queries made in the past 24 hours
- % ineffective queries: the number of ineffective queries divided by total number of queries for that string
Top 50 Failed Queries
This summarizes the top 50 queries that failed over the past 24 hours. A failed query is one where the search engine end-user did not locate any query results.
The columns are:
- Query string: the query string
- Number of queries: the total number of queries made in the past 24 hours
- Frequency: the percentage occurrence of a failed query
- Cumulative frequency: the cumulative percentage occurrence of all failed queries
Copyright © 2002 Oracle Corporation. All Rights Reserved. |
|