Release 1.2
Commit | Description |
---|---|
Tue Nov 06 14:32:08 CET 2012 by Michael Peter Christen | added solr faceted search support to YaCy search results added solr highlighting / YaCy snippets to YaCy search results - facets are now much more complete - facets are computed and searched much faster - snippet computation is done by solr if solr knows the snippet Changed Files: htroot/AccessTracker_p.java, htroot/Crawler_p.java, htroot/HostBrowser.java, htroot/yacy/search.java, htroot/yacysearch.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java, htroot/yacysearchtrailer.java, source/net/yacy/cora/federate/solr/responsewriter/OpensearchResponseWriter.java, source/net/yacy/cora/protocol/Domains.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RankingProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetWorker.java, source/net/yacy/search/snippet/ResultEntry.java, source/net/yacy/search/snippet/TextSnippet.java |
Tue Nov 06 00:29:37 CET 2012 by Michael Peter Christen | added the visualization of error-urls to host browser - only visible for admins - a faceted search generates a huge list for all hosts in the host list - the faceted search algorithms had to be modified for that - within the browsing of the directory path, the error cause is written to the url which is presented as error-url - the errors are also accumulated for directory sums Changed Files: htroot/CrawlResults.java, htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/env/base.css, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/federate/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/search/index/Fulltext.java |
Mon Nov 05 15:23:03 CET 2012 by Michael Peter Christen | update to web interface structure Changed Files: htroot/ConfigAppearance_p.html, htroot/ConfigLanguage_p.html, htroot/ConfigLiveSearch.html, htroot/ConfigPortal.html, htroot/ConfigProfile_p.html, htroot/ConfigSearchBox.html, htroot/CrawlStartExpert_p.html, htroot/Crawler_p.java, htroot/IndexCreateQueues_p.html, htroot/Ranking_p.html, htroot/Surftips.html, htroot/Table_API_p.html, htroot/env/templates/header.template, htroot/env/templates/simpleheader.template, htroot/env/templates/submenuComputation.template, htroot/env/templates/submenuIndexControl.template, htroot/env/templates/submenuSearchIntegration.template |
Mon Nov 05 03:19:28 CET 2012 by Michael Peter Christen | renovated the way how search results are count. should be correct now... Changed Files: htroot/AccessTracker_p.java, htroot/HostBrowser.java, htroot/IndexControlRWIs_p.java, htroot/yacy/search.java, htroot/yacysearch.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java, htroot/yacysearchtrailer.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/AccessTracker.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RankingProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SearchEventCache.java |
Fri Nov 02 13:57:43 CET 2012 by Michael Peter Christen | update to HostBrowser: - time-out after 3 seconds to speed up display (may be incomplete) - showing also all links from the balancer queue in the host list (after the '/') and in the result browser view with tag 'loading' Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/IndexCreateQueues_p.java, source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/search/index/Fulltext.java |
Fri Nov 02 12:29:48 CET 2012 by Michael Peter Christen | migration to solr 4.0.0 Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, defaults/solr/solrconfig.xml, htroot/gsa/searchresult.java, htroot/solr/select.java, lib/apache-solr-core-4.0.0.jar, lib/apache-solr-solrj-4.0.0.jar, lib/lucene-analyzers-common-4.0.0.jar, lib/lucene-analyzers-phonetic-4.0.0.jar, lib/lucene-core-4.0.0.jar, lib/lucene-grouping-4.0.0.jar, lib/lucene-highlighter-4.0.0.jar, lib/lucene-memory-4.0.0.jar, lib/lucene-misc-4.0.0.jar, lib/lucene-queries-4.0.0.jar, lib/lucene-queryparser-4.0.0.jar, lib/lucene-spatial-4.0.0.jar, lib/lucene-suggest-4.0.0.jar, lib/spatial4j-0.3.jar, lib/zookeeper-3.3.6.jar, source/net/yacy/cora/federate/solr/SolrServlet.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/cora/federate/solr/responsewriter/EnhancedXMLResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/JsonResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/OpensearchResponseWriter.java, source/net/yacy/search/index/Fulltext.java |
Thu Nov 01 17:16:43 CET 2012 by Michael Peter Christen | tried to clean up the search process mess Changed Files: htroot/IndexControlRWIs_p.java, htroot/yacy/search.java, htroot/yacysearch.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java, htroot/yacysearchtrailer.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/peers/graphics/ProfilingGraph.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/HeuristicResult.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RankingProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SearchEventCache.java, source/net/yacy/search/query/SearchEventType.java, source/net/yacy/search/query/SecondarySearchSuperviser.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/query/SnippetWorker.java |
Thu Nov 01 10:22:22 CET 2012 by Michael Peter Christen | fixed a problem with local search from solr results: now all results from solr are shown (again) Changed Files: htroot/IndexControlRWIs_p.java, htroot/yacy/search.java, htroot/yacysearch.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java, htroot/yacysearchtrailer.java, source/net/yacy/cora/sorting/WeakPriorityBlockingQueue.java, source/net/yacy/kelondro/util/FileUtils.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetProcess.java |
Wed Oct 31 17:44:45 CET 2012 by Michael Peter Christen | - added a delete button in host browser to delete a complete subpath - removed storage of default collection name - default is now "user" - made stacking of crawl start points concurrently Changed Files: htroot/CrawlStartExpert_p.java, htroot/Crawler_p.java, htroot/HostBrowser.html, htroot/HostBrowser.java, source/net/yacy/kelondro/util/FileUtils.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java |
Sun Oct 28 22:48:11 CET 2012 by Michael Peter Christen | replaced the custom robots.txt loader by the standard http loader Changed Files: htroot/CrawlCheck_p.java, htroot/api/getpageinfo.java, htroot/api/getpageinfo_p.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/CrawlQueues.java, source/net/yacy/crawler/data/Latency.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/crawler/retrieval/Request.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/crawler/robots/RobotsTxtParser.java, source/net/yacy/repository/LoaderDispatcher.java, source/net/yacy/search/Switchboard.java |
Sun Oct 28 19:56:02 CET 2012 by Michael Peter Christen | - removed unnecessary synchronized and deadlock in crawler - removed problem with monitoring object on Balancer.wait - added missing user agent settings Changed Files: source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/cora/protocol/ClientIdentification.java, source/net/yacy/cora/protocol/http/HTTPClient.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/data/WorkTables.java, source/net/yacy/document/parser/sitemapParser.java, source/net/yacy/interaction/AugmentHtmlStream.java, source/net/yacy/interaction/Interaction.java, source/net/yacy/interaction/contentcontrol/ContentControlImportThread.java, source/net/yacy/kelondro/workflow/AbstractBusyThread.java, source/net/yacy/peers/SeedDB.java, source/net/yacy/peers/operation/yacyRelease.java, source/net/yacy/search/Switchboard.java, source/net/yacy/server/http/HTTPDProxyHandler.java, source/net/yacy/server/serverSwitch.java, source/net/yacy/yacy.java |
Sun Oct 28 13:24:49 CET 2012 by orbiter | update to Balancer algorithm: - create a load list from the current list of known hosts - do not create this list for each Balancer.pop access - create the list from those hosts which have a zero-waiting time - select 1/3 from that list which have the most urls waiting - get hosts from the wainting list in random order - fixes for some delta-time computations - always load all urls from hosts which have never been loaded before Changed Files: htroot/IndexCreateQueues_p.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/data/Latency.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/retrieval/FTPLoader.java, source/net/yacy/crawler/retrieval/HTTPLoader.java |
Thu Oct 25 16:05:04 CEST 2012 by Michael Peter Christen | - added a method for the RasterPlotter to draw arrow endings to lines - replaced the dot in the NetworkGraph with arrows - enhanced the image drawing speed using pre-computed color values - added more attention for OOM cases during very large image painting Changed Files: htroot/AccessPicture_p.java, htroot/WebStructurePicture_p.java, htroot/imagetest.java, source/net/yacy/dbtest.java, source/net/yacy/peers/graphics/Banner.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/peers/graphics/ProfilingGraph.java, source/net/yacy/visualization/ChartPlotter.java, source/net/yacy/visualization/GraphPlotter.java, source/net/yacy/visualization/HexGridPlotter.java, source/net/yacy/visualization/PngEncoder.java, source/net/yacy/visualization/RasterPlotter.java |
Tue Oct 23 19:08:44 CEST 2012 by orbiter | removed warnings Changed Files: htroot/SettingsAck_p.java, source/net/yacy/cora/federate/solr/connector/ShardSelection.java, source/net/yacy/document/Document.java, source/net/yacy/kelondro/blob/TablesColumnIndex.java, source/net/yacy/kelondro/data/citation/CitationReference.java, source/net/yacy/kelondro/data/word/Word.java, source/net/yacy/kelondro/rwi/AbstractIndex.java, source/net/yacy/kelondro/rwi/Index.java, source/net/yacy/kelondro/table/SQLTable.java, source/net/yacy/kelondro/workflow/InstantBusyThread.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/search/query/SearchEvent.java |
Thu Oct 18 14:29:11 CEST 2012 by Michael Peter Christen | Refactoring and redesign of data architecture to make URIMetadataRow superfluous. The target is to make a solr document as the core of YaCy documents which would cause that many conversions can be removed. On the way to this target the Equivalence of URIMetadataRow and URIMetadataNode had to be removed to expose the usage of the old URIMetadataRow data structure. This refactoring already removes unneccessary conversions and should make memory usage during indexing lower. Changed Files: htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/api/yacydoc.java, htroot/yacy/crawlReceipt.java, htroot/yacy/transferURL.java, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/crawler/data/ResultURLs.java, source/net/yacy/document/Document.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/data/word/WordReferenceVars.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/Transmission.java, source/net/yacy/repository/Blacklist.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/snippet/TextSnippet.java |
Wed Oct 17 17:45:41 CEST 2012 by Michael Peter Christen | removed hack which translated Solr documents to virtual RWI entries which had been then mixed with remote RWIs. Now these Solr documents are feeded into the result set as they appear during local and remote search. That makes the search much faster. Changed Files: htroot/yacy/transferURL.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/crawler/data/ResultURLs.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/word/WordReferenceRow.java, source/net/yacy/peers/Protocol.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/ranking/ReferenceOrder.java, source/net/yacy/search/snippet/ResultEntry.java |
Tue Oct 16 18:11:57 CEST 2012 by Michael Peter Christen | - removed dependencies from URIMetadataRow and made direct access to URIMetadataNode which creates the opportunity to access Solr objects directly and use their information richness - lazy initialization of the URIMetadataNode object - should cause less computation and memory usage during search. - removed dead code Changed Files: defaults/solr.keys.list, htroot/Bookmarks.java, htroot/CrawlResults.java, htroot/HostBrowser.java, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/ViewFile.java, htroot/Vocabulary_p.java, htroot/api/yacydoc.java, htroot/yacy/urls.java, htroot/yacysearch.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/retrieval/SitemapImporter.java, source/net/yacy/data/ymark/YMarkMetadata.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/data/word/WordReferenceRow.java, source/net/yacy/kelondro/data/word/WordReferenceVars.java, source/net/yacy/kelondro/index/RowSet.java, source/net/yacy/peers/Transmission.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/RWIProcess.java |
Tue Oct 16 17:13:18 CEST 2012 by Michael Peter Christen | enhanced the HostBrowser: - showing also outbound links to other domains if there are any - the outbound links browser shows also the link structure image - showing even inbound links if the web structure graph has information about that - removed the left menu and made the HostBrowser a part of the top menu for search - moved the file search also to the top menu - added hover information in the HostBrowser to explain what the click means - because the HostBrowser also links to the Metadata viewer ViewFile, there should be a button to switch back to the HostBrowser: added that also. Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/ViewFile.html, htroot/env/templates/header.template, htroot/env/templates/simpleheader.template, htroot/yacyinteractive.html, source/net/yacy/search/index/SolrConfiguration.java |
Mon Oct 15 13:17:13 CEST 2012 by Michael Peter Christen | - enhanced generation of url objects - enhanced computation of link structure graphics - enhanced collection of data for link structures Changed Files: htroot/CrawlStartScanner_p.java, htroot/Crawler_p.java, htroot/ServerScannerList.java, htroot/WatchWebStructure_p.html, htroot/WatchWebStructure_p.java, htroot/WebStructurePicture_p.java, htroot/api/webstructure.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/data/BookmarkHelper.java, source/net/yacy/document/parser/sevenzipParser.java, source/net/yacy/document/parser/tarParser.java, source/net/yacy/document/parser/zipParser.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/snippet/MediaSnippet.java, source/net/yacy/server/http/HTTPDProxyHandler.java |
Wed Oct 10 11:46:22 CEST 2012 by Michael Peter Christen | removed the option to prevent removal of & parts inside of the MultiProtocolURI during normalform computation because that should always be done and also be done during initialization of the MultiProtocolURI Object. The new normalform method takes only one argument which should be 'true' unless you know exactly what you are doing. Changed Files: htroot/Bookmarks.java, htroot/Collage.java, htroot/CrawlCheck_p.java, htroot/CrawlResults.java, htroot/CrawlStartScanner_p.java, htroot/Crawler_p.java, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/IndexCreateLoaderQueue_p.java, htroot/IndexCreateParserErrors_p.java, htroot/IndexCreateQueues_p.java, htroot/IndexImportOAIPMH_p.java, htroot/Load_RSS_p.java, htroot/QuickCrawlLink_p.java, htroot/ServerScannerList.java, htroot/SettingsAck_p.java, htroot/ViewFile.java, htroot/ViewImage.java, htroot/Vocabulary_p.java, htroot/api/getpageinfo.java, htroot/api/getpageinfo_p.java, htroot/api/webstructure.java, htroot/api/yacydoc.java, htroot/cytag.java, htroot/rct_p.java, htroot/yacy/crawlReceipt.java, htroot/yacy/transferURL.java, htroot/yacy/urls.java, htroot/yacysearch.java, htroot/yacysearchitem.java, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/cora/document/RSSFeed.java, source/net/yacy/cora/document/RSSMessage.java, source/net/yacy/cora/protocol/Scanner.java, source/net/yacy/cora/protocol/http/HTTPClient.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/data/Cache.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/data/CrawlQueues.java, source/net/yacy/crawler/data/ResultImages.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/crawler/retrieval/FTPLoader.java, source/net/yacy/crawler/retrieval/FileLoader.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/crawler/retrieval/RSSLoader.java, source/net/yacy/crawler/retrieval/SMBLoader.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/data/BookmarksDB.java, source/net/yacy/data/WorkTables.java, source/net/yacy/data/ymark/YMarkCrawlStart.java, source/net/yacy/data/ymark/YMarkTables.java, source/net/yacy/document/Condenser.java, source/net/yacy/document/Document.java, source/net/yacy/document/Parser.java, source/net/yacy/document/TextParser.java, source/net/yacy/document/content/DCEntry.java, source/net/yacy/document/importer/OAIPMHImporter.java, source/net/yacy/document/importer/OAIPMHLoader.java, source/net/yacy/document/importer/ResumptionToken.java, source/net/yacy/document/parser/augment/AugmentParser.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/document/parser/html/EmbedEntry.java, source/net/yacy/document/parser/html/ImageEntry.java, source/net/yacy/document/parser/sitemapParser.java, source/net/yacy/interaction/AugmentHtmlStream.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/data/word/WordReferenceVars.java, source/net/yacy/repository/LoaderDispatcher.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/snippet/MediaSnippet.java, source/net/yacy/search/snippet/ResultEntry.java, source/net/yacy/server/http/AugmentedHtmlStream.java, source/net/yacy/server/http/HTTPDProxyHandler.java |
Tue Oct 09 11:48:55 CEST 2012 by Michael Peter Christen | since the solr index is now used for all pages that are indexed locally, there is no need for the RWI index if the index is not transfered to another peer. Therefore the creation of RWI index data is now suppressed if DHT is disabled. This applies for all intranet and portal mode configurations, but not for public robinson modes. A robinson may switch back to public mode and then transmit its data. That means if someone wants to switch never to DHT mode, it would be more appropriate to choose the portal mode. Changed Files: htroot/ConfigNetwork_p.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/SwitchboardConstants.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Segment.java |
Mon Oct 08 10:50:24 CEST 2012 by Michael Peter Christen | code cleanup: removed unised methods and made more methods and objects private Changed Files: htroot/imagetest.java, htroot/yacy/transferRWI.java, source/net/yacy/data/ymark/YMarkCrawlStart.java, source/net/yacy/kelondro/blob/ArrayStack.java, source/net/yacy/kelondro/blob/BEncodedHeap.java, source/net/yacy/kelondro/blob/Compressor.java, source/net/yacy/kelondro/blob/HeapReader.java, source/net/yacy/kelondro/blob/HeapWriter.java, source/net/yacy/kelondro/blob/MapHeap.java, source/net/yacy/kelondro/blob/Stack.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/blob/TablesColumnIndex.java, source/net/yacy/kelondro/data/citation/CitationReference.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/data/navigation/NavigationReferenceRow.java, source/net/yacy/kelondro/data/navigation/NavigationReferenceVars.java, source/net/yacy/kelondro/data/word/Word.java, source/net/yacy/kelondro/data/word/WordReferenceRow.java, source/net/yacy/kelondro/data/word/WordReferenceVars.java, source/net/yacy/kelondro/logging/GuiHandler.java, source/net/yacy/kelondro/logging/Log.java, source/net/yacy/kelondro/logging/LogParser.java, source/net/yacy/kelondro/logging/LogalizerHandler.java, source/net/yacy/kelondro/logging/ThreadDump.java, source/net/yacy/kelondro/rwi/ReferenceContainerArray.java |
Sun Oct 07 07:46:55 CEST 2012 by Michael Peter Christen | - redesign of solr query construction - fix for solr boosts and location search - fix for number of search results in local search Changed Files: source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RemoteSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/federate/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/document/content/DCEntry.java, source/net/yacy/kelondro/index/RowCollection.java, source/net/yacy/peers/Protocol.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SearchEventCache.java |
Sat Oct 06 03:34:52 CEST 2012 by Michael Peter Christen | clean-up: removed unused methods in kelondro Changed Files: source/net/yacy/cora/federate/solr/connector/ShardSelection.java, source/net/yacy/kelondro/rwi/AbstractIndex.java, source/net/yacy/kelondro/rwi/IODispatcher.java, source/net/yacy/kelondro/rwi/Index.java, source/net/yacy/kelondro/rwi/IndexCell.java, source/net/yacy/kelondro/rwi/ReferenceContainerArray.java, source/net/yacy/kelondro/rwi/ReferenceContainerCache.java, source/net/yacy/kelondro/rwi/ReferenceIterator.java, source/net/yacy/kelondro/rwi/TermSearch.java, source/net/yacy/kelondro/table/SQLTable.java, source/net/yacy/kelondro/util/BDecoder.java, source/net/yacy/kelondro/util/BEncoder.java, source/net/yacy/kelondro/util/Bitfield.java, source/net/yacy/kelondro/util/ByteArray.java, source/net/yacy/kelondro/util/ByteBuffer.java, source/net/yacy/kelondro/util/ConsoleInterface.java, source/net/yacy/kelondro/util/FileUtils.java, source/net/yacy/kelondro/util/Formatter.java, source/net/yacy/kelondro/util/ISO639.java, source/net/yacy/kelondro/util/OS.java, source/net/yacy/kelondro/util/ReverseMapIterator.java, source/net/yacy/kelondro/util/RotateIterator.java, source/net/yacy/kelondro/workflow/AbstractBusyThread.java, source/net/yacy/kelondro/workflow/AbstractThread.java, source/net/yacy/kelondro/workflow/BusyThread.java, source/net/yacy/kelondro/workflow/InstantBlockingThread.java, source/net/yacy/kelondro/workflow/InstantBusyThread.java, source/net/yacy/kelondro/workflow/WorkflowJob.java, source/net/yacy/kelondro/workflow/WorkflowProcessor.java |
Fri Oct 05 18:54:39 CEST 2012 by sof | Merge remote branch 'origin/master' Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, defaults/solr.keys.list, defaults/yacy.init, htroot/gsa/searchresult.java, lib/dependencies.txt, lib/httpcore-4.2.2.License, lib/httpcore-4.2.2.jar, locales/de.lng, nbproject/project.xml, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/cora/language/synonyms/SynonymLibrary.java, source/net/yacy/data/ymark/YMarkAutoTagger.java, source/net/yacy/document/Condenser.java, source/net/yacy/document/LibraryProvider.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/document/parser/torrentParser.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/server/serverObjects.java |
Fri Oct 05 18:54:26 CEST 2012 by apfelmaennchen | Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based on the jaudiotagger library. The parser is disabled by default as it needs to store temporary files for non file:// protocols, which might be disliked. For your local MP3-collection it loads nicely Artist, Title, Album etc. from the audio files meta data. Changed Files: .classpath, build.xml, defaults/yacy.init, lib/jaudiotagger-2.0.4-20111207.115108-15.License, lib/jaudiotagger-2.0.4-20111207.115108-15.jar, source/net/yacy/document/TextParser.java, source/net/yacy/document/parser/audioTagParser.java, source/net/yacy/search/Switchboard.java |
Tue Oct 02 00:02:50 CEST 2012 by orbiter | added a synonyms_t field to solr and a process to read synonym files. This can be used to add another stemming to solr using stemming files that are expressed as synonyms for grammatical alternatives. The synonym/stemming files must have the following form: - each line is a comma-separated list of synonyms - the list of synonyms may be enclosed with {} (like the GSA synonyms file) - the file may contain comments which are lines starting with a '#' The synonym file(s) must be placed in DATA/DICTIONARIES/synonyms/ and are activated by default whenever a synonym file is in place. Then, for each word that is found in a document all synonyms are added to a long text field which is stored into synonyms_t. Processes using the synonyms must query with that field as optional matcher. Changed Files: defaults/solr.keys.list, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/data/ymark/YMarkAutoTagger.java, source/net/yacy/document/Condenser.java, source/net/yacy/document/LibraryProvider.java, source/net/yacy/document/parser/torrentParser.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java |
Fri Sep 28 22:45:16 CEST 2012 by Michael Peter Christen | added new Host Browser to main menu: this new search interface is something completely new for search, but completely common on desktops: browser a web space like one would browse a file system in a file browser. The file listing is created using the search index and a faceted restriction to specific domains. Changed Files: defaults/solr.keys.list, htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/Ranking_p.html, htroot/env/templates/header.template, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/search/Switchboard.java |
Fri Sep 28 13:50:13 CEST 2012 by Michael Peter Christen | extended solr connector with a method to retrieve a single facet. Changed Files: source/net/yacy/cora/document/UTF8.java, source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/federate/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/search/Switchboard.java |
Wed Sep 26 13:38:04 CEST 2012 by Michael Peter Christen | - removed ip_s from default profile since that needs a DNS lookup to create an document entry. This makes remote search much slower. - removed synchronization of add method if ip_s is activated to prevent that a user configuration causes bad behavior. The disadvantage of that is, that a index dump can cause data loss if an indexing is running during index dump - catched more exceptions and more NPE - better abstraction in MirrorSolrConnector - slight performance enhancement when only the index count is requested (rows=0 is sufficient to get a total count) Changed Files: defaults/solr.keys.list, htroot/IndexFederated_p.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/search/index/Fulltext.java |
Tue Sep 25 21:20:03 CEST 2012 by Michael Peter Christen | refactoring Changed Files: htroot/Bookmarks.java, htroot/CrawlResults.java, htroot/CrawlStartExpert_p.java, htroot/Crawler_p.java, htroot/DictionaryLoader_p.java, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/IndexFederated_p.java, htroot/Load_RSS_p.java, htroot/PerformanceMemory_p.java, htroot/QuickCrawlLink_p.java, htroot/ViewFile.java, htroot/ViewImage.java, htroot/api/getpageinfo.java, htroot/api/getpageinfo_p.java, htroot/api/schema_p.java, htroot/api/webstructure.java, htroot/gsa/searchresult.java, htroot/solr/select.java, htroot/yacy/transferRWI.java, htroot/yacysearch.java, htroot/yacysearch_location.java, source/net/yacy/cora/federate/SearchAccumulator.java, source/net/yacy/cora/federate/SearchHub.java, source/net/yacy/cora/federate/SearchResult.java, source/net/yacy/cora/federate/opensearch/SRURSSConnector.java, source/net/yacy/cora/federate/solr/Schema.java, source/net/yacy/cora/federate/solr/SolrServlet.java, source/net/yacy/cora/federate/solr/SolrType.java, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RemoteSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/federate/solr/connector/ShardSelection.java, source/net/yacy/cora/federate/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/cora/federate/solr/responsewriter/EnhancedXMLResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/JsonResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/OpensearchResponseWriter.java, source/net/yacy/cora/federate/yacy/CacheStrategy.java, source/net/yacy/cora/federate/yacy/ConfigurationSet.java, source/net/yacy/cora/federate/yacy/Distribution.java, source/net/yacy/cora/federate/yacy/Peer.java, source/net/yacy/cora/federate/yacy/Peers.java, source/net/yacy/cora/federate/yacy/api/Network.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/data/CrawlQueues.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/crawler/retrieval/RSSLoader.java, source/net/yacy/data/ymark/YMarkAutoTagger.java, source/net/yacy/data/ymark/YMarkCrawlStart.java, source/net/yacy/data/ymark/YMarkMetadata.java, source/net/yacy/document/importer/OAIListFriendsLoader.java, source/net/yacy/document/importer/OAIPMHLoader.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/peers/DHTSelection.java, source/net/yacy/peers/Dispatcher.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/Seed.java, source/net/yacy/peers/SeedDB.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/peers/graphics/OSMTile.java, source/net/yacy/peers/operation/yacyRelease.java, source/net/yacy/repository/LoaderDispatcher.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/snippet/MediaSnippet.java, source/net/yacy/search/snippet/TextSnippet.java, source/net/yacy/server/serverObjects.java |
Tue Sep 25 21:04:58 CEST 2012 by Michael Peter Christen | refactoring Changed Files: htroot/CrawlResults.java, htroot/CrawlStartExpert_p.java, htroot/IndexFederated_p.java, htroot/PerformanceMemory_p.java, htroot/api/schema_p.java, htroot/gsa/searchresult.java, htroot/solr/select.java, source/net/yacy/cora/services/federated/solr/Schema.java, source/net/yacy/cora/services/federated/solr/SolrServlet.java, source/net/yacy/cora/services/federated/solr/SolrType.java, source/net/yacy/cora/services/federated/solr/YaCySchema.java, source/net/yacy/cora/services/federated/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/RemoteSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/ShardSelection.java, source/net/yacy/cora/services/federated/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/SolrConnector.java, source/net/yacy/cora/services/federated/solr/connector/SolrServerConnector.java, source/net/yacy/cora/services/federated/solr/responsewriter/EnhancedXMLResponseWriter.java, source/net/yacy/cora/services/federated/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/cora/services/federated/solr/responsewriter/JsonResponseWriter.java, source/net/yacy/cora/services/federated/solr/responsewriter/OpensearchResponseWriter.java, source/net/yacy/cora/services/federated/yacy/ConfigurationSet.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/peers/Protocol.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/server/serverObjects.java |
Fri Sep 21 16:46:57 CEST 2012 by Michael Peter Christen | refactoring Changed Files: htroot/ConfigAccounts_p.java, htroot/CrawlStartScanner_p.java, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/SettingsAck_p.java, htroot/Supporter.java, htroot/Surftips.java, htroot/User.java, htroot/ViewProfile.java, htroot/WebStructurePicture_p.java, htroot/api/bookmarks/posts/all.java, htroot/api/bookmarks/posts/get.java, htroot/api/webstructure.java, htroot/rct_p.java, htroot/yacy/hello.java, htroot/yacy/search.java, htroot/yacy/transferRWI.java, htroot/yacysearch.java, source/net/yacy/cora/date/MicroDate.java, source/net/yacy/cora/order/Base64Order.java, source/net/yacy/cora/order/Digest.java, source/net/yacy/cora/order/NaturalOrder.java, source/net/yacy/cora/order/StringOrder.java, source/net/yacy/cora/services/federated/yacy/YaCySchema.java, source/net/yacy/cora/services/federated/yacy/dht/HorizontalPartition.java, source/net/yacy/cora/services/federated/yacy/dht/Partition.java, source/net/yacy/cora/services/federated/yacy/dht/VerticalPartition.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/Cache.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/data/CrawlQueues.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/data/ResultURLs.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/crawler/retrieval/RSSLoader.java, source/net/yacy/crawler/retrieval/Request.java, source/net/yacy/data/BlogBoard.java, source/net/yacy/data/BlogBoardComments.java, source/net/yacy/data/BookmarkDate.java, source/net/yacy/data/BookmarksDB.java, source/net/yacy/data/MessageBoard.java, source/net/yacy/data/UserDB.java, source/net/yacy/data/WorkTables.java, source/net/yacy/data/wiki/WikiBoard.java, source/net/yacy/dbtest.java, source/net/yacy/document/Condenser.java, source/net/yacy/document/WordTokenizer.java, source/net/yacy/document/parser/vcfParser.java, source/net/yacy/kelondro/blob/ArrayStack.java, source/net/yacy/kelondro/blob/BEncodedHeap.java, source/net/yacy/kelondro/blob/BEncodedHeapBag.java, source/net/yacy/kelondro/blob/BEncodedHeapShard.java, source/net/yacy/kelondro/blob/Heap.java, source/net/yacy/kelondro/blob/HeapReader.java, source/net/yacy/kelondro/blob/MapColumnIndex.java, source/net/yacy/kelondro/blob/MapDataMining.java, source/net/yacy/kelondro/blob/MapHeap.java, source/net/yacy/kelondro/blob/ObjectBuffer.java, source/net/yacy/kelondro/blob/Stack.java, source/net/yacy/kelondro/blob/TablesColumnIndex.java, source/net/yacy/kelondro/blob/TablesColumnRAMIndex.java, source/net/yacy/kelondro/data/citation/CitationReference.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/kelondro/data/meta/URIMetadata.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/data/navigation/NavigationReferenceRow.java, source/net/yacy/kelondro/data/word/Word.java, source/net/yacy/kelondro/data/word/WordReference.java, source/net/yacy/kelondro/data/word/WordReferenceRow.java, source/net/yacy/kelondro/data/word/WordReferenceVars.java, source/net/yacy/kelondro/index/BinSearch.java, source/net/yacy/kelondro/index/BufferedObjectIndex.java, source/net/yacy/kelondro/index/IndexTest.java, source/net/yacy/kelondro/index/RAMIndex.java, source/net/yacy/kelondro/index/RAMIndexCluster.java, source/net/yacy/kelondro/index/Row.java, source/net/yacy/kelondro/index/RowCollection.java, source/net/yacy/kelondro/index/RowHandleSet.java, source/net/yacy/kelondro/index/RowSet.java, source/net/yacy/kelondro/rwi/AbstractIndex.java, source/net/yacy/kelondro/rwi/IndexCell.java, source/net/yacy/kelondro/rwi/ReferenceContainer.java, source/net/yacy/kelondro/rwi/TermSearch.java, source/net/yacy/kelondro/table/Relations.java, source/net/yacy/kelondro/table/SQLTable.java, source/net/yacy/kelondro/table/SplitTable.java, source/net/yacy/kelondro/table/Table.java, source/net/yacy/kelondro/util/Bitfield.java, source/net/yacy/kelondro/util/MergeIterator.java, source/net/yacy/kelondro/util/RotateIterator.java, source/net/yacy/kelondro/util/StackIterator.java, source/net/yacy/migration.java, source/net/yacy/peers/DHTSelection.java, source/net/yacy/peers/Dispatcher.java, source/net/yacy/peers/Network.java, source/net/yacy/peers/NewsDB.java, source/net/yacy/peers/NewsQueue.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/peers/Seed.java, source/net/yacy/peers/SeedDB.java, source/net/yacy/peers/Transmission.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/peers/operation/yacyRelease.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/ranking/BlockRank.java, source/net/yacy/search/ranking/ReferenceOrder.java, source/net/yacy/search/snippet/MediaSnippet.java, source/net/yacy/search/snippet/ResultEntry.java, source/net/yacy/search/snippet/TextSnippet.java, source/net/yacy/server/http/HTTPDFileHandler.java, source/net/yacy/server/http/HTTPDemon.java, source/net/yacy/server/serverSwitch.java, source/net/yacy/utils/CryptoLib.java, source/net/yacy/utils/crypt.java, source/net/yacy/utils/cryptbig.java |
Fri Sep 21 15:48:16 CEST 2012 by Michael Peter Christen | refactoring Changed Files: htroot/AccessGrid_p.java, htroot/AccessPicture_p.java, htroot/AccessTracker_p.java, htroot/AugmentedBrowsingFilters_p.java, htroot/AugmentedBrowsing_p.java, htroot/AugmentedParsing_p.java, htroot/Banner.java, htroot/BlacklistCleaner_p.java, htroot/BlacklistImpExp_p.java, htroot/BlacklistTest_p.java, htroot/Blacklist_p.java, htroot/Blog.java, htroot/BlogComments.java, htroot/Bookmarks.java, htroot/CacheResource_p.java, htroot/Collage.java, htroot/ConfigAccounts_p.java, htroot/ConfigAppearance_p.java, htroot/ConfigBasic.java, htroot/ConfigHTCache_p.java, htroot/ConfigHeuristics_p.java, htroot/ConfigLanguage_p.java, htroot/ConfigLiveSearch.java, htroot/ConfigNetwork_p.java, htroot/ConfigParser.java, htroot/ConfigPortal.java, htroot/ConfigProfile_p.java, htroot/ConfigProperties_p.java, htroot/ConfigRobotsTxt_p.java, htroot/ConfigSearchBox.java, htroot/ConfigUpdate_p.java, htroot/Connections_p.java, htroot/ContentControl_p.java, htroot/ContentIntegrationPHPBB3_p.java, htroot/CookieMonitorIncoming_p.java, htroot/CookieMonitorOutgoing_p.java, htroot/CookieTest_p.java, htroot/CrawlMonitorRemoteStart.java, htroot/CrawlProfileEditor_p.java, htroot/CrawlResults.java, htroot/CrawlStartExpert_p.java, htroot/CrawlStartScanner_p.java, htroot/CrawlStartSite_p.java, htroot/Crawler_p.java, htroot/DemoServlet.java, htroot/DemoServletInteraction.java, htroot/DemoServletRDF.java, htroot/DictionaryLoader_p.java, htroot/Help.java, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/IndexCreateDomainCrawl_p.java, htroot/IndexCreateLoaderQueue_p.java, htroot/IndexCreateParserErrors_p.java, htroot/IndexCreateQueues_p.java, htroot/IndexFederated_p.java, htroot/IndexImportMediawiki_p.java, htroot/IndexImportOAIPMHList_p.java, htroot/IndexImportOAIPMH_p.java, htroot/IndexShare_p.java, htroot/Load_MediawikiWiki.java, htroot/Load_PHPBB3.java, htroot/Load_RSS_p.java, htroot/MessageSend_p.java, htroot/Messages_p.java, htroot/Network.java, htroot/NetworkPicture.java, htroot/News.java, htroot/PeerLoadPicture.java, htroot/PerformanceConcurrency_p.java, htroot/PerformanceGraph.java, htroot/PerformanceMemory_p.java, htroot/PerformanceQueues_p.java, htroot/PerformanceSearch_p.java, htroot/Performance_p.java, htroot/ProxyIndexingMonitor_p.java, htroot/QuickCrawlLink_p.java, htroot/Ranking_p.java, htroot/RegexTest.java, htroot/RemoteCrawl_p.java, htroot/SearchEventPicture.java, htroot/ServerScannerList.java, htroot/SettingsAck_p.java, htroot/Settings_p.java, htroot/Status.java, htroot/Steering.java, htroot/Supporter.java, htroot/Surftips.java, htroot/Table_API_p.java, htroot/Table_RobotsTxt_p.java, htroot/Table_YMark_p.java, htroot/Tables_p.java, htroot/Threaddump_p.java, htroot/Trails.java, htroot/Triple_p.java, htroot/Triplestore_p.java, htroot/User.java, htroot/ViewFile.java, htroot/ViewImage.java, htroot/ViewLog_p.java, htroot/ViewProfile.java, htroot/Vocabulary_p.java, htroot/WatchWebStructure_p.java, htroot/WebStructurePicture_p.java, htroot/Wiki.java, htroot/WikiHelp.java, htroot/YBRFetch_p.java, htroot/YMarks.java, htroot/YaCySearchPluginFF.java, htroot/api/blacklists.java, htroot/api/blacklists_p.java, htroot/api/bookmarks/get_bookmarks.java, htroot/api/bookmarks/get_folders.java, htroot/api/bookmarks/posts/add_p.java, htroot/api/bookmarks/posts/all.java, htroot/api/bookmarks/posts/delete_p.java, htroot/api/bookmarks/posts/get.java, htroot/api/bookmarks/tags/addTag_p.java, htroot/api/bookmarks/tags/editTag_p.java, htroot/api/bookmarks/tags/getTag.java, htroot/api/bookmarks/xbel/xbel.java, htroot/api/config_p.java, htroot/api/feed.java, htroot/api/getpageinfo.java, htroot/api/getpageinfo_p.java, htroot/api/latency_p.java, htroot/api/schema_p.java, htroot/api/status_p.java, htroot/api/table_p.java, htroot/api/termlist_p.java, htroot/api/timeline.java, htroot/api/trail_p.java, htroot/api/version.java, htroot/api/webstructure.java, htroot/api/yacydoc.java, htroot/api/ymarks/add_ymark.java, htroot/api/ymarks/delete_ymark.java, htroot/api/ymarks/get_metadata.java, htroot/api/ymarks/get_tags.java, htroot/api/ymarks/get_treeview.java, htroot/api/ymarks/get_xbel.java, htroot/api/ymarks/get_ymark.java, htroot/api/ymarks/import_ymark.java, htroot/api/ymarks/manage_tags.java, htroot/api/ynetSearch.java, htroot/autoconfig.java, htroot/compare_yacy.java, htroot/cytag.java, htroot/env/style.java, htroot/gsa/searchresult.java, htroot/imagetest.java, htroot/index.java, htroot/interaction/GetRDF.java, htroot/interaction/PutRDF.java, htroot/interaction/Table.java, htroot/interaction/Triple.java, htroot/interaction_elements/Document_part.java, htroot/interaction_elements/Footer.java, htroot/interaction_elements/Loginstatus_part.java, htroot/interaction_elements/OverlayInteraction.java, htroot/interaction_elements/Tag_part.java, htroot/mediawiki_p.java, htroot/opensearchdescription.java, htroot/osm.java, htroot/rct_p.java, htroot/robots.java, htroot/sharedBlacklist_p.java, htroot/solr/select.java, htroot/ssitestservlet.java, htroot/suggest.java, htroot/test.java, htroot/www/welcome.java, htroot/yacy/crawlReceipt.java, htroot/yacy/hello.java, htroot/yacy/idx.java, htroot/yacy/list.java, htroot/yacy/message.java, htroot/yacy/profile.java, htroot/yacy/query.java, htroot/yacy/search.java, htroot/yacy/transferRWI.java, htroot/yacy/transferURL.java, htroot/yacy/urls.java, htroot/yacyinteractive.java, htroot/yacysearch.java, htroot/yacysearch_location.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java, htroot/yacysearchtrailer.java, source/net/yacy/cora/ai/example/ConnectFour.java, source/net/yacy/cora/ai/example/Hanoi.java, source/net/yacy/cora/ai/example/SchwarzerPeter.java, source/net/yacy/cora/ai/example/testorder.java, source/net/yacy/cora/ai/greedy/AbstractFinding.java, source/net/yacy/cora/ai/greedy/AbstractModel.java, source/net/yacy/cora/ai/greedy/Agent.java, source/net/yacy/cora/ai/greedy/Asset.java, source/net/yacy/cora/ai/greedy/Attempts.java, source/net/yacy/cora/ai/greedy/Battle.java, source/net/yacy/cora/ai/greedy/Challenge.java, source/net/yacy/cora/ai/greedy/Context.java, source/net/yacy/cora/ai/greedy/ContextFactory.java, source/net/yacy/cora/ai/greedy/Engine.java, source/net/yacy/cora/ai/greedy/Finding.java, source/net/yacy/cora/ai/greedy/Goal.java, source/net/yacy/cora/ai/greedy/Model.java, source/net/yacy/cora/ai/greedy/Role.java, source/net/yacy/cora/ai/greedy/Unirole.java, source/net/yacy/cora/lod/JenaTripleStore.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/Cache.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/data/CrawlQueues.java, source/net/yacy/crawler/data/Latency.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/data/ResultImages.java, source/net/yacy/crawler/data/ResultURLs.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/crawler/retrieval/FTPLoader.java, source/net/yacy/crawler/retrieval/FileLoader.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/crawler/retrieval/ImporterException.java, source/net/yacy/crawler/retrieval/RSSLoader.java, source/net/yacy/crawler/retrieval/Request.java, source/net/yacy/crawler/retrieval/Response.java, source/net/yacy/crawler/retrieval/SMBLoader.java, source/net/yacy/crawler/retrieval/SitemapImporter.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/crawler/robots/RobotsTxtEntry.java, source/net/yacy/crawler/robots/RobotsTxtParser.java, source/net/yacy/data/BlogBoard.java, source/net/yacy/data/BlogBoardComments.java, source/net/yacy/data/BookmarkDate.java, source/net/yacy/data/BookmarkHelper.java, source/net/yacy/data/BookmarksDB.java, source/net/yacy/data/DidYouMean.java, source/net/yacy/data/Diff.java, source/net/yacy/data/ListManager.java, source/net/yacy/data/MessageBoard.java, source/net/yacy/data/Translator.java, source/net/yacy/data/URLLicense.java, source/net/yacy/data/UserDB.java, source/net/yacy/data/WorkTables.java, source/net/yacy/data/list/ListAccumulator.java, source/net/yacy/data/list/XMLBlacklistImporter.java, source/net/yacy/data/wiki/AbstractWikiParser.java, source/net/yacy/data/wiki/WikiBoard.java, source/net/yacy/data/wiki/WikiCode.java, source/net/yacy/data/wiki/WikiParser.java, source/net/yacy/data/ymark/MonitoredReader.java, source/net/yacy/data/ymark/TablesRowComparator.java, source/net/yacy/data/ymark/YMarkAutoTagger.java, source/net/yacy/data/ymark/YMarkCrawlStart.java, source/net/yacy/data/ymark/YMarkDMOZImporter.java, source/net/yacy/data/ymark/YMarkDate.java, source/net/yacy/data/ymark/YMarkEntry.java, source/net/yacy/data/ymark/YMarkHTMLImporter.java, source/net/yacy/data/ymark/YMarkImporter.java, source/net/yacy/data/ymark/YMarkJSONImporter.java, source/net/yacy/data/ymark/YMarkMetadata.java, source/net/yacy/data/ymark/YMarkRDF.java, source/net/yacy/data/ymark/YMarkSMWJSONImporter.java, source/net/yacy/data/ymark/YMarkTables.java, source/net/yacy/data/ymark/YMarkTag.java, source/net/yacy/data/ymark/YMarkUtil.java, source/net/yacy/data/ymark/YMarkXBELImporter.java, source/net/yacy/document/Document.java, source/net/yacy/document/importer/MediawikiImporter.java, source/net/yacy/document/importer/OAIListFriendsLoader.java, source/net/yacy/document/importer/OAIPMHLoader.java, source/net/yacy/document/parser/augment/AugmentParser.java, source/net/yacy/interaction/AugmentHtmlStream.java, source/net/yacy/interaction/Interaction.java, source/net/yacy/interaction/contentcontrol/ContentControlImportThread.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/blob/TablesColumnBLOBIndex.java, source/net/yacy/kelondro/data/meta/URIMetadata.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/kelondro/logging/ThreadDump.java, source/net/yacy/kelondro/util/OS.java, source/net/yacy/peers/Network.java, source/net/yacy/peers/PeerActions.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/Seed.java, source/net/yacy/peers/SeedDB.java, source/net/yacy/peers/graphics/OSMTile.java, source/net/yacy/peers/operation/yacyRelease.java, source/net/yacy/peers/operation/yacySeedUploadFile.java, source/net/yacy/peers/operation/yacySeedUploadFtp.java, source/net/yacy/peers/operation/yacySeedUploadScp.java, source/net/yacy/peers/operation/yacySeedUploader.java, source/net/yacy/repository/LoaderDispatcher.java, source/net/yacy/search/IndexingQueueEntry.java, source/net/yacy/search/ResourceObserver.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/SwitchboardConstants.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SearchEventCache.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/snippet/MediaSnippet.java, source/net/yacy/search/snippet/TextSnippet.java, source/net/yacy/server/http/AlternativeDomainNames.java, source/net/yacy/server/http/AugmentedHtmlStream.java, source/net/yacy/server/http/ChunkedInputStream.java, source/net/yacy/server/http/ChunkedOutputStream.java, source/net/yacy/server/http/ContentLengthInputStream.java, source/net/yacy/server/http/HTTPDFileHandler.java, source/net/yacy/server/http/HTTPDProxyHandler.java, source/net/yacy/server/http/HTTPDemon.java, source/net/yacy/server/http/MultiOutputStream.java, source/net/yacy/server/http/ProxyLogFormatter.java, source/net/yacy/server/http/RobotsTxtConfig.java, source/net/yacy/server/http/ServerSideIncludes.java, source/net/yacy/server/http/TemplateEngine.java, source/net/yacy/server/serverAccessTracker.java, source/net/yacy/server/serverClassLoader.java, source/net/yacy/server/serverCore.java, source/net/yacy/server/serverCoreSocket.java, source/net/yacy/server/serverHandler.java, source/net/yacy/server/serverObjects.java, source/net/yacy/server/serverSwitch.java, source/net/yacy/server/serverSwitchAbstractAction.java, source/net/yacy/server/servletProperties.java, source/net/yacy/utils/CryptoLib.java, source/net/yacy/utils/ListDirs.java, source/net/yacy/utils/PKCS12Tool.java, source/net/yacy/utils/SignatureOutputStream.java, source/net/yacy/utils/UPnP.java, source/net/yacy/utils/bitfield.java, source/net/yacy/utils/crypt.java, source/net/yacy/utils/cryptbig.java, source/net/yacy/utils/disorderHeap.java, source/net/yacy/utils/disorderSet.java, source/net/yacy/utils/enumerateFiles.java, source/net/yacy/utils/gzip.java, source/net/yacy/utils/loaderCore.java, source/net/yacy/utils/loaderProcess.java, source/net/yacy/utils/loaderThreads.java, source/net/yacy/utils/nxTools.java, source/net/yacy/utils/tarTools.java, source/net/yacy/utils/whois.java, source/net/yacy/yacy.java |
Fri Sep 21 11:02:36 CEST 2012 by orbiter | removed more dependencies in cora from kelondro Changed Files: htroot/CrawlResults.java, htroot/CrawlStartExpert_p.java, htroot/IndexFederated_p.java, htroot/api/schema_p.java, htroot/gsa/searchresult.java, htroot/solr/select.java, source/de/anomic/server/serverObjects.java, source/net/yacy/cora/lod/JenaTripleStore.java, source/net/yacy/cora/services/federated/solr/AbstractSolrConnector.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/cora/services/federated/solr/MirrorSolrConnector.java, source/net/yacy/cora/services/federated/solr/OpensearchResponseWriter.java, source/net/yacy/cora/services/federated/solr/ShardSelection.java, source/net/yacy/cora/services/federated/yacy/ConfigurationSet.java, source/net/yacy/cora/services/federated/yacy/YaCySchema.java, source/net/yacy/cora/util/LookAheadIterator.java, source/net/yacy/cora/util/Memory.java, source/net/yacy/cora/util/SpaceExceededException.java, source/net/yacy/kelondro/blob/ArrayStack.java, source/net/yacy/kelondro/blob/HeapReader.java, source/net/yacy/kelondro/blob/MapHeap.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/rwi/ReferenceIterator.java, source/net/yacy/kelondro/table/ChunkIterator.java, source/net/yacy/kelondro/util/StandardMemoryStrategy.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java |
Thu Sep 20 19:38:22 CEST 2012 by orbiter | removed kelondro dependencies from cora Changed Files: htroot/DictionaryLoader_p.java, htroot/api/yacydoc.java, htroot/yacysearch.java, htroot/yacysearch_location.java, source/de/anomic/data/DidYouMean.java, source/net/yacy/cora/document/RSSMessage.java, source/net/yacy/cora/document/WordCache.java, source/net/yacy/cora/geo/GeoLocation.java, source/net/yacy/cora/geo/GeoPoint.java, source/net/yacy/cora/geo/GeonamesLocation.java, source/net/yacy/cora/geo/IntegerGeoPoint.java, source/net/yacy/cora/geo/Locations.java, source/net/yacy/cora/geo/OpenGeoDBLocation.java, source/net/yacy/cora/geo/OverarchingLocation.java, source/net/yacy/cora/lod/JenaTripleStore.java, source/net/yacy/cora/lod/vocabulary/Tagging.java, source/net/yacy/cora/protocol/Domains.java, source/net/yacy/cora/sorting/OrderedScoreMap.java, source/net/yacy/cora/storage/KeyList.java, source/net/yacy/cora/util/StringBuilderComparator.java, source/net/yacy/document/Autotagging.java, source/net/yacy/document/Condenser.java, source/net/yacy/document/LibraryProvider.java, source/net/yacy/document/WordTokenizer.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/query/AccessTracker.java, source/net/yacy/search/query/QueryParams.java |
Fri Sep 14 12:25:46 CEST 2012 by Michael Peter Christen | - added the possibility to have not one but a list of crawl start urls - the list of urls is entered in the expert crawl start in a textfield; the one-line input field was replaced with a text box - start urls can also be given in one single line where the urls are separated by a '|'-character - as an effect, the crawl profile cannot carry a single start url for identificaton because it is possible to have more. Therefore the url was removed from the crawl profile - this affect all servlets which display a crawl profile: removed the url field from all there servlets - to work consistently with several start urls and the other crawl starts which computed crawl start url lists from sitelists or sitemaps, the crawl start servlet was restructured completely - new rules for must-match patterns were created to make it possible that site crawl starts also work with several crawl starts at once Changed Files: htroot/CrawlProfileEditor_p.html, htroot/CrawlProfileEditor_p.java, htroot/CrawlStartExpert_p.html, htroot/CrawlStartExpert_p.java, htroot/Crawler_p.html, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java, source/de/anomic/crawler/CrawlProfile.java, source/de/anomic/crawler/CrawlQueues.java, source/de/anomic/crawler/CrawlSwitchboard.java, source/de/anomic/data/ymark/YMarkCrawlStart.java |
Mon Sep 10 10:12:38 CEST 2012 by Michael Peter Christen | - updated lucene libraries to 3.6.1 - added lucene-grouping which enables faceted search; try this: http://localhost:8090/solr/select?q=*:*&start=0&rows=3&facet=true&facet.field=host_s Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, lib/lucene-analyzers-3.6.1.jar, lib/lucene-core-3.6.1.jar, lib/lucene-grouping-3.6.1.jar, lib/lucene-highlighter-3.6.1.jar, lib/lucene-phonetic-3.6.1.jar, lib/lucene-spatial-3.6.1.jar, lib/lucene-spellchecker-3.6.1.jar, source/net/yacy/kelondro/blob/Tables.java |
Mon Sep 10 07:05:20 CEST 2012 by Michael Peter Christen | Merge remote-tracking branch 'origin/master' Conflicts: htroot/api/ymarks/import_ymark.java source/de/anomic/data/ymark/YMarkEntry.java source/de/anomic/data/ymark/YMarkTables.java Changed Files: htroot/Table_YMark_p.java, htroot/YMarks.html, htroot/YMarks.java, htroot/YMarks.rdf, htroot/api/ymarks/get_tags.java, htroot/api/ymarks/get_treeview.java, htroot/api/ymarks/get_xbel.java, htroot/api/ymarks/get_ymark.java, htroot/api/ymarks/import_ymark.java, htroot/api/ymarks/manage_tags.java, htroot/js/yacy-ymarks-bookmark-actions.js, htroot/js/yacy-ymarks.js, source/de/anomic/data/ymark/MonitoredReader.java, source/de/anomic/data/ymark/YMarkAutoTagger.java, source/de/anomic/data/ymark/YMarkCrawlStart.java, source/de/anomic/data/ymark/YMarkDMOZImporter.java, source/de/anomic/data/ymark/YMarkEntry.java, source/de/anomic/data/ymark/YMarkHTMLImporter.java, source/de/anomic/data/ymark/YMarkImporter.java, source/de/anomic/data/ymark/YMarkRDF.java, source/de/anomic/data/ymark/YMarkTables.java, source/de/anomic/data/ymark/YMarkUtil.java, source/de/anomic/data/ymark/YMarkXBELImporter.java, source/net/yacy/cora/lod/vocabulary/AnnoteaA.java, source/net/yacy/cora/lod/vocabulary/AnnoteaB.java, source/net/yacy/cora/lod/vocabulary/DCElements.java, source/net/yacy/cora/lod/vocabulary/DMOZ.java, source/net/yacy/cora/lod/vocabulary/Rdf.java, source/net/yacy/interaction/contentcontrol/ContentControlFilterUpdateThread.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/blob/TablesColumnBLOBIndex.java, source/net/yacy/kelondro/blob/TablesColumnIndex.java, source/net/yacy/kelondro/blob/TablesColumnRAMIndex.java |
Sun Sep 09 09:53:58 CEST 2012 by apfelmaennchen | - added dmoz RDF dump importer - added indexing to Tables columns to support larger bookmark collections - added RDF output (HTTP) for public bookmarks at /YMarks.rdf - YMarkRDF also provides a Jena RDF Model as "internal" API - various other changes/fixes for YMarks (mainly backend) Changed Files: htroot/Table_YMark_p.java, htroot/YMarks.html, htroot/YMarks.java, htroot/YMarks.rdf, htroot/api/ymarks/get_tags.java, htroot/api/ymarks/get_treeview.java, htroot/api/ymarks/get_xbel.java, htroot/api/ymarks/get_ymark.java, htroot/api/ymarks/import_ymark.java, htroot/api/ymarks/manage_tags.java, htroot/js/yacy-ymarks-bookmark-actions.js, htroot/js/yacy-ymarks.js, source/de/anomic/data/ymark/MonitoredReader.java, source/de/anomic/data/ymark/YMarkAutoTagger.java, source/de/anomic/data/ymark/YMarkCrawlStart.java, source/de/anomic/data/ymark/YMarkDMOZImporter.java, source/de/anomic/data/ymark/YMarkEntry.java, source/de/anomic/data/ymark/YMarkHTMLImporter.java, source/de/anomic/data/ymark/YMarkImporter.java, source/de/anomic/data/ymark/YMarkRDF.java, source/de/anomic/data/ymark/YMarkTables.java, source/de/anomic/data/ymark/YMarkUtil.java, source/de/anomic/data/ymark/YMarkXBELImporter.java, source/net/yacy/cora/lod/vocabulary/AnnoteaA.java, source/net/yacy/cora/lod/vocabulary/AnnoteaB.java, source/net/yacy/cora/lod/vocabulary/DCElements.java, source/net/yacy/cora/lod/vocabulary/DMOZ.java, source/net/yacy/cora/lod/vocabulary/Rdf.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/blob/TablesColumnBLOBIndex.java, source/net/yacy/kelondro/blob/TablesColumnIndex.java, source/net/yacy/kelondro/blob/TablesColumnRAMIndex.java |
Tue Sep 04 11:23:41 CEST 2012 by Michael Peter Christen | small style changes Changed Files: htroot/ContentIntegrationPHPBB3_p.html, htroot/CookieMonitorIncoming_p.html, htroot/CookieMonitorOutgoing_p.html, htroot/CookieTest_p.html, htroot/IndexControlRWIs_p.html, htroot/IndexControlURLs_p.html, htroot/IndexCreateDomainCrawl_p.html, htroot/env/grafics/good.png, htroot/env/grafics/ok.png, skins/pdblue.css |
Mon Sep 03 16:04:57 CEST 2012 by Michael Peter Christen | added new button design to more buttons Changed Files: htroot/ConfigAccounts_p.html, htroot/ConfigAppearance_p.html, htroot/ConfigBasic.html, htroot/ConfigHTCache_p.html, htroot/ConfigLanguage_p.html, htroot/ConfigNetwork_p.html, htroot/ConfigParser.html, htroot/ConfigPortal.html, htroot/ConfigProfile_p.html, htroot/ConfigProperties_p.html, htroot/ConfigRobotsTxt_p.html, htroot/ConfigUpdate_p.html, htroot/ContentControl_p.html, htroot/ContentIntegrationPHPBB3_p.html, htroot/CookieMonitorIncoming_p.html, htroot/CookieMonitorOutgoing_p.html, htroot/CookieTest_p.html, htroot/CrawlProfileEditor_p.html, htroot/CrawlStartScanner_p.html, htroot/IndexControlRWIs_p.html, htroot/IndexControlURLs_p.html, htroot/IndexCreateDomainCrawl_p.html, htroot/api/table_p.html, htroot/api/ymarks/test_import.html, htroot/www/welcome.html, skins/pdblue.css |
Mon Sep 03 15:26:08 CEST 2012 by Michael Peter Christen | added a collection attribute to crawls and searches: - a solr field collection_sxt can be used to store a set of crawl tags - when this field is activated, a crawl tag can be assigned when crawls are started - the content of the collection field can be comma-separated, all of them are assigned to the documents when they are indexed as result of such a crawl start - a search result can be drilled down to a specific collection; this is currently only available in the solr interface and also in the gsa interface using the 'site' option - this adds a mandatory field for gsa queries (the google api demands that field all the time) Changed Files: defaults/solr.keys.list, defaults/yacy.init, htroot/CrawlStartExpert_p.html, htroot/CrawlStartExpert_p.java, htroot/CrawlStartSite_p.html, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java, htroot/api/ymarks/import_ymark.java, htroot/gsa/searchresult.java, source/de/anomic/crawler/CrawlProfile.java, source/de/anomic/crawler/CrawlSwitchboard.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Fri Aug 31 13:03:00 CEST 2012 by Michael Peter Christen | - extended the solr interface by a references-by-word-count method - reduced danger that a non-existing RWI database causes NPEs - added Solr queries to did-you-mean: this makes it possible that our did-you-mean algorithm works together with only Solr and without RWIs Changed Files: htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.java, htroot/IndexShare_p.java, htroot/PerformanceQueues_p.java, htroot/api/status_p.java, htroot/suggest.java, htroot/yacy/query.java, htroot/yacy/transferRWI.java, htroot/yacysearch.java, source/de/anomic/data/DidYouMean.java, source/net/yacy/cora/services/federated/solr/EmbeddedSolrConnector.java, source/net/yacy/cora/services/federated/solr/MirrorSolrConnector.java, source/net/yacy/cora/services/federated/solr/MultipleSolrConnector.java, source/net/yacy/cora/services/federated/solr/RetrySolrConnector.java, source/net/yacy/cora/services/federated/solr/ShardSolrConnector.java, source/net/yacy/cora/services/federated/solr/SolrConnector.java, source/net/yacy/cora/services/federated/solr/SolrServerConnector.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/RWIProcess.java |
Fri Aug 31 10:30:43 CEST 2012 by Michael Peter Christen | - added new solr fields: title_count_i, title_chars_val, title_words_val description_count_i, description_chars_val, description_words_val - added many asserts to ensure data type correctness from YaCy to Solr and vice versa - made many fixes according to new findings from these asserts (!) Changed Files: defaults/solr.keys.list, defaults/yacy.logging, htroot/AccessTracker_p.html, htroot/api/getpageinfo.java, htroot/api/getpageinfo_p.java, source/de/anomic/crawler/retrieval/Response.java, source/de/anomic/http/server/HTTPDFileHandler.java, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/cora/document/RSSFeed.java, source/net/yacy/cora/document/RSSMessage.java, source/net/yacy/document/AbstractParser.java, source/net/yacy/document/Document.java, source/net/yacy/document/content/DCEntry.java, source/net/yacy/document/parser/augment/AugmentParser.java, source/net/yacy/document/parser/csvParser.java, source/net/yacy/document/parser/docParser.java, source/net/yacy/document/parser/genericParser.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/document/parser/htmlParser.java, source/net/yacy/document/parser/images/genericImageParser.java, source/net/yacy/document/parser/mmParser.java, source/net/yacy/document/parser/odtParser.java, source/net/yacy/document/parser/ooxmlParser.java, source/net/yacy/document/parser/pdfParser.java, source/net/yacy/document/parser/pptParser.java, source/net/yacy/document/parser/rdfParser.java, source/net/yacy/document/parser/rdfa/impl/RDFaParser.java, source/net/yacy/document/parser/rssParser.java, source/net/yacy/document/parser/rtfParser.java, source/net/yacy/document/parser/sidAudioParser.java, source/net/yacy/document/parser/sitemapParser.java, source/net/yacy/document/parser/swfParser.java, source/net/yacy/document/parser/torrentParser.java, source/net/yacy/document/parser/vcfParser.java, source/net/yacy/document/parser/vsdParser.java, source/net/yacy/document/parser/xlsParser.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Wed Aug 29 09:04:28 CEST 2012 by cominch | add content control features for custom filter lists Changed Files: defaults/yacy.init, htroot/ContentControl_p.html, htroot/ContentControl_p.java, htroot/env/templates/submenuBlacklist.template, source/de/anomic/data/ymark/YMarkEntry.java, source/de/anomic/data/ymark/YMarkSMWJSONImporter.java, source/de/anomic/data/ymark/YMarkTables.java, source/net/yacy/interaction/contentcontrol/ContentControlFilterUpdateThread.java, source/net/yacy/interaction/contentcontrol/ContentControlImportThread.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/query/RWIProcess.java |
Tue Aug 28 16:58:06 CEST 2012 by Michael Peter Christen | - added a solr type definition verifier - fixed type definition found by the verifier - added multivalue-string fields for solr with extension 'sxt' - added multivalue-integer fields for solr with extension 'val' - renamed some solr attributes from txt to sxt - changed solr query line to an explicit AND/OR structure - added a country code second level domain list to Domains class; with parser - added a host string parser to get domain class name, country-code second-level domain and subdomain out of it - removed old coordinate attributes Changed Files: defaults/solr.keys.list, defaults/solr/schema.xml, htroot/api/schema_p.java, source/net/yacy/cora/protocol/Domains.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java, source/net/yacy/cora/services/federated/solr/SolrType.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java, source/net/yacy/search/query/QueryParams.java |
Mon Aug 27 14:41:33 CEST 2012 by Michael Peter Christen | - added faceted drill-down for host and geolocation to solr queries - added a new geolocation field to index schema, the old values are migrated if possible Changed Files: defaults/solr.keys.list, source/net/yacy/cora/services/federated/solr/SolrType.java, source/net/yacy/document/geolocation/GeoLocation.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/RemoteSearch.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/SearchEvent.java |
Commit | Description |
---|---|
Wed Nov 07 17:27:13 CET 2012 by Michael Peter Christen | fixed media search Changed Files: source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/SearchEvent.java |
Wed Nov 07 15:05:44 CET 2012 by Michael Peter Christen | fixed a problem with non-terminating crawls Changed Files: source/net/yacy/crawler/CrawlSwitchboard.java |
Wed Nov 07 14:58:28 CET 2012 by Michael Peter Christen | fix to ftp client Changed Files: source/net/yacy/cora/protocol/ftp/FTPClient.java |
Wed Nov 07 13:53:29 CET 2012 by Michael Peter Christen | fix for filetype naviagtor Changed Files: htroot/suggest.java, htroot/yacysearch.java, source/net/yacy/cora/sorting/AbstractScoreMap.java, source/net/yacy/kelondro/data/meta/DigestURI.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/RankingProcess.java, source/net/yacy/search/query/SearchEvent.java |
Wed Nov 07 12:52:19 CET 2012 by Michael Peter Christen | bugfixes for crawler Changed Files: htroot/Crawler_p.html, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/search/Switchboard.java |
Wed Nov 07 02:46:51 CET 2012 by Michael Peter Christen | fixed npe for surrogate import Changed Files: source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java |
Wed Nov 07 02:29:33 CET 2012 by Michael Peter Christen | fixed wrong order of result count values Changed Files: htroot/Crawler_p.java, htroot/yacysearchitem.java, htroot/yacysearchlatestinfo.java |
Mon Nov 05 22:14:52 CET 2012 by Michael Peter Christen | fix for some interface problems Changed Files: htroot/env/templates/header.template, source/net/yacy/server/http/HTTPDFileHandler.java |
Mon Nov 05 18:08:00 CET 2012 by Michael Peter Christen | fixed filetype modified for media types in text search Changed Files: source/net/yacy/cora/document/Classification.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetWorker.java |
Mon Oct 29 11:56:07 CET 2012 by Michael Peter Christen | fixed getSize() which can use the cache size while the crawl is running Changed Files: source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java |
Thu Oct 25 10:23:43 CEST 2012 by Michael Peter Christen | fix for host browser Changed Files: htroot/HostBrowser.java |
Tue Oct 23 18:11:49 CEST 2012 by Michael Peter Christen | fix for highlighting in gsa search Changed Files: source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java |
Thu Oct 18 15:21:05 CEST 2012 by Michael Peter Christen | fixed more getSolrFieldName usages Changed Files: source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/JsonResponseWriter.java, source/net/yacy/cora/federate/solr/responsewriter/OpensearchResponseWriter.java |
Wed Oct 17 18:06:44 CEST 2012 by Michael Peter Christen | fix for file parser problem Changed Files: source/net/yacy/kelondro/util/FileUtils.java |
Wed Oct 17 13:56:11 CEST 2012 by Michael Peter Christen | added an exception catch Changed Files: source/net/yacy/kelondro/util/FileUtils.java |
Thu Oct 11 10:16:37 CEST 2012 by Michael Peter Christen | fixed clear scripts Changed Files: bin/clearall.sh, bin/clearcache.sh, bin/clearindex.sh |
Wed Oct 10 10:40:32 CEST 2012 by Michael Peter Christen | fix for crawl start filter Changed Files: htroot/Crawler_p.java, source/net/yacy/crawler/data/CrawlProfile.java |
Tue Oct 09 23:11:31 CEST 2012 by orbiter | fixed interpretation of directDocByURL attribute during crawl start Changed Files: htroot/CrawlStartSite_p.html, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java |
Tue Oct 09 11:25:05 CEST 2012 by Michael Peter Christen | fix for ViewFile Changed Files: htroot/ViewFile.java |
Mon Oct 08 14:54:06 CEST 2012 by Michael Peter Christen | fix for portal mode Changed Files: source/net/yacy/cora/federate/yacy/Distribution.java |
Mon Oct 08 10:50:40 CEST 2012 by Michael Peter Christen | fixes to crawl profiles Changed Files: htroot/QuickCrawlLink_p.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/search/Switchboard.java |
Thu Oct 04 22:44:44 CEST 2012 by orbiter | another fix to location search Changed Files: source/net/yacy/search/query/QueryParams.java, source/net/yacy/search/query/SearchEvent.java |
Thu Oct 04 14:46:40 CEST 2012 by orbiter | fix for location search query encoding Changed Files: source/net/yacy/search/query/QueryParams.java |
Fri Sep 28 10:24:57 CEST 2012 by Michael Peter Christen | some more after-refactoring fixes Changed Files: addon/yacy-svn-4.spec, build.xml, defaults/yacy.logging |
Thu Sep 27 17:23:10 CEST 2012 by Michael Peter Christen | fix for debian package installation (caused by refactoring) Changed Files: debian/postinst |
Wed Sep 26 09:56:16 CEST 2012 by apfelmaennchen | fix for java.lang.RuntimeException: TableColumnIndex not available... Changed Files: source/net/yacy/data/ymark/YMarkTables.java, source/net/yacy/kelondro/blob/TableColumnIndexException.java, source/net/yacy/kelondro/blob/Tables.java |
Tue Sep 25 23:20:09 CEST 2012 by Michael Peter Christen | fixed bad output in stopYACY.sh Changed Files: stopYACY.sh |
Fri Sep 21 16:05:17 CEST 2012 by Michael Peter Christen | fix for no depth limit default value Changed Files: source/net/yacy/crawler/data/CrawlProfile.java |
Wed Sep 19 06:36:07 CEST 2012 by Michael Peter Christen | fixed size parsing in RSS message parser (for YaCy size parameter) Changed Files: source/net/yacy/cora/document/RSSMessage.java, source/net/yacy/cora/services/federated/SearchHub.java |
Tue Sep 18 11:06:36 CEST 2012 by Michael Peter Christen | fix for success query counter Changed Files: source/net/yacy/cora/services/federated/solr/AbstractSolrConnector.java |
Tue Sep 11 20:24:27 CEST 2012 by Michael Peter Christen | fixed a bug with size_i field usage Changed Files: source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java |
Mon Sep 10 20:22:26 CEST 2012 by Marc Nause | *) fix for http://www.yacy-forum.org/viewtopic.php?f=2&t=759 Changed Files: locales/fr.lng |
Mon Sep 10 12:30:03 CEST 2012 by Michael Peter Christen | fix for images_withalt Changed Files: source/net/yacy/search/index/SolrConfiguration.java |
Fri Aug 31 14:35:56 CEST 2012 by Michael Peter Christen | added more patches to work without RWI data structure Changed Files: htroot/yacy/transferRWI.java, source/de/anomic/data/WorkTables.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/dht/Dispatcher.java, source/net/yacy/peers/dht/Transmission.java, source/net/yacy/search/index/Segment.java |
Sun Aug 26 04:36:52 CEST 2012 by reger | fix path to env/grafics to display api icon on meta data page Changed Files: htroot/api/yacydoc.html |
Commit | Description |
---|---|
Wed Nov 07 23:14:45 CET 2012 by Michael Peter Christen | release 1.2 Changed Files: build.properties |
Wed Nov 07 21:26:01 CET 2012 by Michael Peter Christen | added new submenu 'Target Analysis' with three servlets which are useful to analyse the target servers: robots.txt table, mass target analysis and a regex tester Changed Files: htroot/CrawlCheck_p.html, htroot/RegexTest.html, htroot/Tables_p.html, htroot/env/templates/header.template, htroot/env/templates/submenuIndexCreate.template, htroot/env/templates/submenuTargetAnalysis.template |
Wed Nov 07 17:27:50 CET 2012 by Michael Peter Christen | do the commit anyway before calling a search interface Changed Files: htroot/HostBrowser.java, htroot/index.java, htroot/yacyinteractive.java, htroot/yacysearch.java |
Wed Nov 07 16:39:49 CET 2012 by Michael Peter Christen | using a better file name Changed Files: htroot/js/yacyinteractive.js |
Wed Nov 07 15:37:14 CET 2012 by Michael Peter Christen | removed warnings, removed too-fast pausing of crawls Changed Files: source/net/yacy/peers/Protocol.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetWorker.java |
Wed Nov 07 15:06:13 CET 2012 by Michael Peter Christen | added matching of path to query pattern Changed Files: source/net/yacy/search/query/QueryParams.java |
Wed Nov 07 14:15:27 CET 2012 by Michael Peter Christen | update to search result logging (this was a remaining issue from the solr 4.0.0 migration) Changed Files: htroot/gsa/searchresult.java, htroot/solr/select.java, source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java |
Wed Nov 07 12:23:21 CET 2012 by Michael Peter Christen | better colors for host browser and corrected document count Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/env/base.css |
Wed Nov 07 02:17:24 CET 2012 by Michael Peter Christen | update to HostBrowser Changed Files: htroot/HostBrowser.java |
Wed Nov 07 02:04:41 CET 2012 by Michael Peter Christen | removed location search because it is only working in special cases Changed Files: htroot/env/templates/header.template |
Wed Nov 07 02:04:08 CET 2012 by Michael Peter Christen | more logging Changed Files: source/net/yacy/search/index/Fulltext.java |
Wed Nov 07 02:03:44 CET 2012 by Michael Peter Christen | automatically delete entries from the crawl profile list if crawl is terminated. Changed Files: source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/query/SearchEvent.java |
Tue Nov 06 15:21:56 CET 2012 by Michael Peter Christen | added information about the reason of pausing of crawls Changed Files: htroot/Crawler_p.java, htroot/Status.java, source/net/yacy/search/ResourceObserver.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Segment.java |
Tue Nov 06 12:31:23 CET 2012 by Michael Peter Christen | added more thread-renaiming for search processes Changed Files: source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RemoteSolrConnector.java |
Tue Nov 06 11:48:04 CET 2012 by Michael Peter Christen | set the thread name during solr queries to the solr query to get better debugging options Changed Files: source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java |
Mon Nov 05 22:14:27 CET 2012 by Michael Peter Christen | when a new crawl is started, delete all entries about error-urls for crawl-start domains Changed Files: htroot/Crawler_p.java, source/net/yacy/crawler/data/ZURL.java, source/net/yacy/search/Switchboard.java |
Mon Nov 05 18:57:21 CET 2012 by Michael Peter Christen | added a hack which makes the HostBrowser more performant when the given host has a lot of urls. If the number of urls is > 1000, then the list of documents is restricted to such which have no subpath, if the root path is selected. However, this can cause a problem if no documents on the root path exist but only on paths below that root path. Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java |
Mon Nov 05 16:34:42 CET 2012 by Michael Peter Christen | automatically pause the crawler if there is a problem with solr Changed Files: source/net/yacy/search/index/Segment.java |
Mon Nov 05 15:36:42 CET 2012 by Michael Peter Christen | new submenu template Changed Files: htroot/env/templates/submenuSearchConfiguration.template |
Sun Nov 04 02:58:26 CET 2012 by orbiter | - added 'deleteold' option to crawler which causes that documents are deleted which are selected by a crawl filter (host or subpath) - site crawl used this option be default now - made option to deleteDomain() concurrency Changed Files: htroot/CrawlResults.java, htroot/CrawlStartExpert_p.html, htroot/CrawlStartSite_p.html, htroot/Crawler_p.java, htroot/IndexControlURLs_p.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/search/index/Fulltext.java |
Sun Nov 04 02:07:59 CET 2012 by reger | Fix Metadata handling - language default on missing lang property to "uk" (fix set to nothing) - language set to TLD (added call to existing language calculation from TLD) - coordinate number exception on possible lat/lon content of "NaN,NaN" adjust Netbeans IDE classpath (for Solr/Lucene 4.0.0 jars) Changed Files: nbproject/project.xml, source/net/yacy/kelondro/data/meta/URIMetadataRow.java |
Fri Nov 02 14:40:02 CET 2012 by Michael Peter Christen | host browser now shows also number of pending files per subdirectory + bugfixes Changed Files: htroot/HostBrowser.java |
Fri Nov 02 10:28:32 CET 2012 by Michael Peter Christen | code cleanup Changed Files: source/net/yacy/search/query/SearchEvent.java |
Fri Nov 02 10:27:44 CET 2012 by Michael Peter Christen | update to libraries required by solr 4.0.0 Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, lib/commons-codec-1.7.License, lib/commons-codec-1.7.jar, lib/jcl-over-slf4j-1.6.4.jar, lib/log4j-over-slf4j-1.6.4.jar, lib/slf4j-api-1.6.4.jar, lib/slf4j-jdk14-1.6.4.jar |
Fri Nov 02 01:22:31 CET 2012 by Michael Peter Christen | - fixed the delete option in host browser - added a delete method which can be used to delete a full subpath in solr. Changed Files: htroot/HostBrowser.java, source/net/yacy/search/index/Fulltext.java |
Fri Nov 02 00:14:29 CET 2012 by Michael Peter Christen | added the MIME attribute for the R tag in GSA search result writer Changed Files: source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java |
Thu Nov 01 21:38:05 CET 2012 by Michael Peter Christen | added the host browser as link to search results. that means you can select a browsing position after a search is done on the search results. Changed Files: htroot/env/grafics/minitree.png, htroot/yacysearchitem.html |
Thu Nov 01 17:40:06 CET 2012 by Michael Peter Christen | more refactoring - integrated the code of SnippetProcess into SearchEvent Changed Files: htroot/gsa/searchresult.java, htroot/solr/select.java, htroot/yacy/search.java, htroot/yacysearch.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SearchEventCache.java, source/net/yacy/search/query/SnippetWorker.java |
Wed Oct 31 23:47:08 CET 2012 by sixcooler | missing license-files (sorry I didn't commit theses files by mistake) Changed Files: lib/httpclient-4.2.2.License, lib/httpmime-4.2.2.License |
Wed Oct 31 23:29:47 CET 2012 by Michael Peter Christen | added missing libraries Changed Files: lib/httpclient-4.2.2.jar, lib/httpmime-4.2.2.jar |
Wed Oct 31 19:09:48 CET 2012 by sixcooler | bump to httpclient-4.2.2 Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, lib/dependencies.txt, nbproject/project.xml |
Wed Oct 31 15:13:05 CET 2012 by Michael Peter Christen | added more / all new crawl profile fields into crawl profile editor Changed Files: htroot/CrawlProfileEditor_p.html, htroot/CrawlProfileEditor_p.java, source/net/yacy/crawler/data/CrawlProfile.java |
Wed Oct 31 14:08:33 CET 2012 by Michael Peter Christen | in case that a crawl profile has a collection assigned, use the collection to show a name in the web interface. This should prevent that much too long names make the interface unusable. Changed Files: htroot/CrawlProfileEditor_p.java, htroot/IndexCreateQueues_p.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/retrieval/Response.java, source/net/yacy/search/Switchboard.java |
Tue Oct 30 17:30:24 CET 2012 by Michael Peter Christen | enhaced data structures for balancer and latency computation which should produce a bit better prognosis about forced waiting times. Changed Files: htroot/IndexCreateQueues_p.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/data/Latency.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/data/ymark/YMarkCrawlStart.java, source/net/yacy/peers/RemoteSearch.java |
Tue Oct 30 12:36:36 CET 2012 by Michael Peter Christen | removed options for stopwords which are not used Changed Files: htroot/CrawlStartExpert_p.html, htroot/CrawlStartSite_p.html, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java, source/net/yacy/crawler/CrawlSwitchboard.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/data/ymark/YMarkCrawlStart.java |
Tue Oct 30 12:27:22 CET 2012 by Michael Peter Christen | added the Google Search Appliance (GSA) api interface to the main menu. See: https://developers.google.com/search-appliance/documentation/68/xml_reference#request_overview Changed Files: htroot/env/templates/header.template, source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java |
Tue Oct 30 12:26:32 CET 2012 by Michael Peter Christen | less latency Changed Files: source/net/yacy/crawler/data/Latency.java |
Tue Oct 30 11:28:49 CET 2012 by Michael Peter Christen | better balancing and duetime-cumputation also for no-delay intranet hosts Changed Files: htroot/PerformanceQueues_p.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/data/Latency.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/robots/RobotsTxtEntry.java, source/net/yacy/search/Switchboard.java, source/net/yacy/server/serverObjects.java |
Mon Oct 29 22:26:52 CET 2012 by Michael Peter Christen | disabled writing new entries to crawl stacks to prevent that a domain with many documents block refreshing of the crawl queue Changed Files: source/net/yacy/crawler/Balancer.java, source/net/yacy/search/index/Segment.java |
Mon Oct 29 21:42:31 CET 2012 by Michael Peter Christen | - fix for number of words log message - adding meta:refresh also to crawler stack Changed Files: source/net/yacy/document/Document.java, source/net/yacy/search/index/Segment.java |
Mon Oct 29 21:08:45 CET 2012 by Michael Peter Christen | - added concurrency for robots.txt loading - changed data model for domain counter Changed Files: htroot/CrawlProfileEditor_p.java, source/net/yacy/crawler/Balancer.java, source/net/yacy/crawler/CrawlStacker.java, source/net/yacy/crawler/data/CrawlProfile.java, source/net/yacy/crawler/data/NoticedURL.java, source/net/yacy/crawler/robots/RobotsTxt.java, source/net/yacy/search/Switchboard.java |
Mon Oct 29 11:35:24 CET 2012 by Michael Peter Christen | enhancement to solr caching: consider that during a get() the document is not in solr but the cache points out that a commit is needed to get the document. Changed Files: source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java |
Mon Oct 29 11:27:13 CET 2012 by Michael Peter Christen | more auto-commit calls when a search interface is opened, but not when a search is done there to prevent blocking during search-time. Changed Files: htroot/HostBrowser.java, htroot/index.java, htroot/yacyinteractive.java, htroot/yacysearch.java, source/net/yacy/crawler/CrawlSwitchboard.java |
Mon Oct 29 01:51:19 CET 2012 by Michael Peter Christen | if a network configuration is choosed which does not allow DHT and no P2P communication is in robinson mode) then some menu entries are disabled which have no use in this mode. Changed Files: htroot/env/templates/header.template, htroot/env/templates/submenuAccessTracker.template, htroot/env/templates/submenuComputation.template, htroot/env/templates/submenuCrawlMonitor.template, htroot/env/templates/submenuIndexControl.template, source/net/yacy/server/http/HTTPDFileHandler.java |
Sun Oct 28 20:31:29 CET 2012 by Michael Peter Christen | enhanced solr caching: - increased cache size which is needed for longer solr commit time - speed hacks on cache write code Changed Files: htroot/PerformanceMemory_p.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java |
Sun Oct 28 11:29:53 CET 2012 by orbiter | moved static method from ClusteredScoreMap to MapDataMining because it was not used in the ClusteredScoreMap class but only in MapDataMining Changed Files: source/net/yacy/cora/sorting/ClusteredScoreMap.java, source/net/yacy/kelondro/blob/MapDataMining.java |
Fri Oct 26 18:50:45 CEST 2012 by reger | - optimize code of augmented parsing to enhence document tags - commented out augmentedparser.analyse (not function implemented yet) - adjust init of document title list to always use same list type Changed Files: source/net/yacy/document/Document.java, source/net/yacy/document/parser/augment/AugmentParser.java |
Fri Oct 26 15:35:42 CEST 2012 by Michael Peter Christen | force a commit in advance of a search for the administrator to get most recent results even if commit time is high and an indexing is ongoing. Changed Files: htroot/index.java, htroot/yacysearch.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/search/index/Fulltext.java |
Fri Oct 26 07:39:07 CEST 2012 by Michael Peter Christen | added an option to force a commit to solr. may be used by a search front-end in case that the commitWithinMs time is too short to get recently indexed documents. Changed Files: source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MultipleSolrConnector.java, source/net/yacy/cora/federate/solr/connector/RetrySolrConnector.java, source/net/yacy/cora/federate/solr/connector/ShardSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/search/index/Fulltext.java |
Fri Oct 26 02:12:45 CEST 2012 by sixcooler | rise commitWithinMs to default-value from SwitchBoard (result in lower hd-io) no dots in memory-graph (there are to much of them) Changed Files: defaults/yacy.init, source/net/yacy/peers/graphics/ProfilingGraph.java |
Thu Oct 25 21:40:27 CEST 2012 by orbiter | another performance and memory hack to graphics: this makes it possible to produce a 100-Megapixel png network graphic image on my 6 year old laptop in standard configuration in 10 seconds. Changed Files: source/net/yacy/visualization/RasterPlotter.java |
Thu Oct 25 18:38:39 CEST 2012 by Michael Peter Christen | - show more lines in online log - reverse order is default now Changed Files: defaults/yacy.logging, htroot/ViewLog_p.html, htroot/ViewLog_p.java, htroot/env/base.css, source/net/yacy/kelondro/logging/GuiHandler.java |
Thu Oct 25 18:20:05 CEST 2012 by Michael Peter Christen | more image processing hacks Changed Files: source/net/yacy/visualization/CircleTool.java, source/net/yacy/visualization/GraphPlotter.java, source/net/yacy/visualization/RasterPlotter.java |
Thu Oct 25 17:59:20 CEST 2012 by Michael Peter Christen | because the new PngEncoder had a problem with the PixelGrabber which is caused by a JRE bug, the PixelGrabber had to be circumvented using an own frame buffer which can be read without a PixelGrabber. This resulted in ultra-fast and much less memory-consuming transformation. YaCy images are now generated really fast! Changed Files: htroot/NetworkPicture.java, source/net/yacy/kelondro/util/ByteBuffer.java, source/net/yacy/peers/graphics/EncodedImage.java, source/net/yacy/server/http/HTTPDFileHandler.java, source/net/yacy/visualization/ChartPlotter.java, source/net/yacy/visualization/RasterPlotter.java |
Thu Oct 25 10:20:55 CEST 2012 by Michael Peter Christen | when a new crawl is started, an equal crawl, if still running, is terminated and the corresponding crawl profile is deleted (this also clears the crawl queue entries for that crawl profile) Changed Files: htroot/Crawler_p.java, source/net/yacy/crawler/data/CrawlProfile.java |
Thu Oct 25 10:18:28 CEST 2012 by Michael Peter Christen | the web structure image shows the pivot dot in a different color Changed Files: htroot/Crawler_p.html, htroot/HostBrowser.html, htroot/WatchWebStructure_p.html, htroot/WatchWebStructure_p.java, htroot/WebStructurePicture_p.java, source/net/yacy/visualization/GraphPlotter.java, source/net/yacy/visualization/PngEncoder.java |
Wed Oct 24 02:08:51 CEST 2012 by Michael Peter Christen | - prepared PngEncoder for concurrency: PixelGrabber.grabPixels is the main time-consuming process. This shall be done in concurrency. - added concurrent processes to call the PixelGrabber and framework to do that (queues) It is now possible to create 4k-Images (3840x2160) i.e. with the Network Graphics servlet Changed Files: source/net/yacy/visualization/PngEncoder.java |
Wed Oct 24 00:41:09 CEST 2012 by Michael Peter Christen | - new order of data computation: first compute the size of compressed deflater output, then assign an exact-sized byte[] which makes resizing afterwards superfluous - after all enhancements all class objects were removed; result is just one short static method - made objects final where possible Changed Files: source/net/yacy/visualization/PngEncoder.java, source/net/yacy/visualization/RasterPlotter.java |
Tue Oct 23 23:27:41 CEST 2012 by orbiter | added a 9-year old png encoder from David Eisenberg which I rewrote quite a bit to remove all code that handles transparency. With this highly specialized png writer it is possible to write png images much faster that with the JRE built-in png writer. In a second step it can be possible to add concurrency to increase computation speed further. Changed Files: source/net/yacy/visualization/PngEncoder.java, source/net/yacy/visualization/RasterPlotter.java |
Tue Oct 23 19:02:55 CEST 2012 by orbiter | added option to view the complete directory structure in host browser Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java |
Tue Oct 23 18:11:19 CEST 2012 by Michael Peter Christen | enhanced web structure images Changed Files: htroot/Crawler_p.html, htroot/WatchWebStructure_p.html, htroot/WatchWebStructure_p.java, htroot/WebStructurePicture_p.java, source/net/yacy/visualization/GraphPlotter.java |
Tue Oct 23 18:03:12 CEST 2012 by Michael Peter Christen | gsa results shall have only one title in metadata and that should be the visible title in the <title>-tag Changed Files: source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/document/parser/html/ContentScraper.java |
Tue Oct 23 03:49:27 CEST 2012 by sixcooler | whitelist yacyportalsearch aka search.yacy.net Changed Files: defaults/yacy.network.freeworld.unit |
Tue Oct 23 02:50:26 CEST 2012 by Michael Peter Christen | showing the web structure graph as animation in the crawl monitor Changed Files: htroot/Crawler_p.html, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java, htroot/WebStructurePicture_p.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/visualization/GraphPlotter.java |
Mon Oct 22 22:48:35 CEST 2012 by reger | - fix: with augmented parsing = on; missing metadata in index (like title) due to overwriting metadata by adding multiple result docs from augmentparser with same url - fix Document.addsubdocuments: sections might be initialized as Arrays.toList which does not provide the used .addAll methode see e.g. http://kamleshkr.wordpress.com/2010/02/17/inside-java-arrays-aslistt-a/ Changed Files: source/net/yacy/document/Document.java, source/net/yacy/document/parser/augment/AugmentParser.java |
Mon Oct 22 16:23:39 CEST 2012 by Michael Peter Christen | enhanced webstructure image: introduced - multiple hosts can be listed (comma-separated) as host argument - new 'bf'-attribut (branch factor): the maximum number of edges per node - the bf-value is computed automatically - ordering of nodes when the graphic is drawed: mostly the drawing ends with an limitation eg. number of nodes. When this happens, it should be ensured that more 'interesting' nodes are painted in advance. This is now done by sorting all nodes by the number of links they have in de distant sub-graph. Changed Files: htroot/WebStructurePicture_p.java, source/net/yacy/peers/graphics/WebStructureGraph.java, source/net/yacy/visualization/GraphPlotter.java |
Sun Oct 21 20:05:28 CEST 2012 by sixcooler | smaller dhtDispatcher.cloudSize @Orbiter: we talked about this times ago - please revert if I'm wrong Changed Files: source/net/yacy/search/Switchboard.java |
Sun Oct 21 20:00:36 CEST 2012 by sixcooler | not hold a expensive cache of references for DHT-out,but but load them on demand see: http://forum.yacy-websuche.de/viewtopic.php?f=8&t=4530 Changed Files: htroot/IndexControlRWIs_p.java, source/net/yacy/peers/Protocol.java, source/net/yacy/peers/Transmission.java |
Sun Oct 21 03:00:05 CEST 2012 by reger | format crawler timeout output string in seconds (was days) Changed Files: htroot/SettingsAck_p.java |
Thu Oct 18 15:26:55 CEST 2012 by Michael Peter Christen | more custom field usage in gsa search result Changed Files: htroot/gsa/searchresult.java |
Thu Oct 18 15:09:04 CEST 2012 by Michael Peter Christen | - more refactoring / private methods - fix for usage of custom solr field names Changed Files: htroot/HostBrowser.java, source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java, source/net/yacy/cora/federate/solr/connector/MirrorSolrConnector.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/search/query/SnippetProcess.java |
Thu Oct 18 11:42:13 CEST 2012 by Michael Peter Christen | added a HostBrowser.xml api file and changed a bit of attribute naming Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/HostBrowser.xml |
Wed Oct 17 00:44:16 CEST 2012 by Michael Peter Christen | added a shell script which can be used to delete the api action steering table. This may be necessary if the api is called by remote command and the recordings are not used. Then they can be deleted frequently by calling this clear command using a cron job Changed Files: bin/clearapi.sh |
Wed Oct 17 00:31:59 CEST 2012 by Michael Peter Christen | added a shell script which can be used to add a rss feed to the index. All pages linked in the rss feed are added. The process is not repeated automatically. If you want to repeat this, add the command to a cron job. Changed Files: bin/addrss.sh |
Tue Oct 16 18:26:21 CEST 2012 by Michael Peter Christen | specified more URIMetadata as URIMetadataNode Changed Files: htroot/IndexControlRWIs_p.java, source/net/yacy/search/index/DocumentIndex.java, source/net/yacy/search/query/RWIProcess.java, source/net/yacy/search/query/SnippetProcess.java, source/net/yacy/search/snippet/ResultEntry.java, source/net/yacy/search/snippet/TextSnippet.java |
Mon Oct 15 10:57:36 CEST 2012 by Michael Peter Christen | added date info in parser errors Changed Files: htroot/AccessTracker_p.java, htroot/IndexCreateParserErrors_p.html, htroot/IndexCreateParserErrors_p.java, source/net/yacy/cora/date/GenericFormatter.java |
Thu Oct 11 14:32:37 CEST 2012 by Michael Peter Christen | use less cache Changed Files: source/net/yacy/search/index/Fulltext.java |
Thu Oct 11 12:03:48 CEST 2012 by Michael Peter Christen | default cache size was much too high; decreased solr cache size Changed Files: source/net/yacy/cora/federate/solr/connector/AbstractSolrConnector.java |
Thu Oct 11 10:46:06 CEST 2012 by Michael Peter Christen | enhancement to post argument parsing - possible fix to zero-filled parameter values Changed Files: source/net/yacy/kelondro/util/FileUtils.java, source/net/yacy/server/http/HTTPDemon.java |
Thu Oct 11 10:17:05 CEST 2012 by Michael Peter Christen | less solr prefetch Changed Files: source/net/yacy/search/query/SearchEvent.java |
Wed Oct 10 02:02:17 CEST 2012 by Michael Peter Christen | added a crawl start checker which makes a simple analysis on the list of all given urls: shows if the url can be loaded and if there is a robots and/or a sitemap. Changed Files: htroot/CrawlCheck_p.html, htroot/CrawlCheck_p.java, htroot/Crawler_p.java, htroot/env/templates/submenuIndexCreate.template |
Wed Oct 10 00:09:27 CEST 2012 by Michael Peter Christen | moved the index deletion functions from IndexControlRWIs to IndexControlURLs where it appears more naturally. Because the RWI administration is less important in the presence of Solr, the IndexControlURL is now the default servlet when the Index Administration button on the main menu is selected. Changed Files: htroot/IndexControlRWIs_p.html, htroot/IndexControlRWIs_p.java, htroot/IndexControlURLs_p.html, htroot/IndexControlURLs_p.java, htroot/env/templates/header.template, htroot/env/templates/submenuIndexControl.template |
Tue Oct 09 20:02:58 CEST 2012 by reger | - add language detection from <html lang="xx"> tag - add jaudiotagger jar to Netbeans-IDE project classpath Changed Files: nbproject/project.xml, source/net/yacy/document/parser/html/ContentScraper.java |
Tue Oct 09 17:28:48 CEST 2012 by Michael Peter Christen | added Open Graph Metadata default fields, see http://ogp.me/ns# Changed Files: defaults/solr.keys.list, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/search/index/SolrConfiguration.java |
Tue Oct 09 13:02:43 CEST 2012 by Michael Peter Christen | added schema.org breadcrumb counter to parser and solr schema Changed Files: defaults/solr.keys.list, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/search/index/SolrConfiguration.java |
Tue Oct 09 12:14:28 CEST 2012 by Michael Peter Christen | replaced some more .getBytes() with UTF8/ASCII.getBytes() Changed Files: htroot/yacy/transferRWI.java, source/net/yacy/crawler/retrieval/FTPLoader.java, source/net/yacy/crawler/retrieval/FileLoader.java, source/net/yacy/crawler/retrieval/HTTPLoader.java, source/net/yacy/crawler/retrieval/Request.java, source/net/yacy/crawler/retrieval/Response.java, source/net/yacy/crawler/retrieval/SMBLoader.java, source/net/yacy/kelondro/util/ByteBuffer.java, source/net/yacy/search/snippet/MediaSnippet.java |
Tue Oct 09 11:24:48 CEST 2012 by Michael Peter Christen | added an url rewriter which can be used to remove session ids from urls Changed Files: source/net/yacy/crawler/retrieval/URLRewriterLibrary.java, source/net/yacy/document/LibraryProvider.java, source/net/yacy/search/Switchboard.java |
Mon Oct 08 19:47:14 CEST 2012 by orbiter | use links in AccessTracker Changed Files: htroot/AccessTracker_p.html |
Mon Oct 08 14:00:14 CEST 2012 by Michael Peter Christen | enhanced the host browser Changed Files: defaults/yacy.init, htroot/HostBrowser.java |
Thu Oct 04 21:12:09 CEST 2012 by reger | adjusted Netbeans-IDE classpath to current jars change solr jars to 3.6.1 (from 3.6.0) change lucene jars to 3.6.1 (from 3.6.0) added jsoup-1.6.3 Changed Files: nbproject/project.xml |
Thu Oct 04 20:57:29 CEST 2012 by reger | - add translation for ConfigHeuristics_p.html # section search-result - removed old/unused scroogle text Changed Files: locales/de.lng |
Wed Oct 03 02:15:02 CEST 2012 by sixcooler | bump to httpcore-4.2.2 (maintenance release) Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, lib/dependencies.txt, lib/httpcore-4.2.2.License, lib/httpcore-4.2.2.jar, nbproject/project.xml |
Tue Oct 02 21:57:50 CEST 2012 by Michael Peter Christen | refactoring Changed Files: source/net/yacy/cora/language/synonyms/AutotaggingLibrary.java, source/net/yacy/document/LibraryProvider.java |
Tue Oct 02 21:18:27 CEST 2012 by Michael Peter Christen | added an option to start indexing right from the host browser Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java |
Tue Oct 02 14:29:45 CEST 2012 by Michael Peter Christen | added the usage of synonyms to the GSA search interface Changed Files: htroot/gsa/searchresult.java, source/net/yacy/cora/federate/solr/responsewriter/GSAResponseWriter.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java, source/net/yacy/server/serverObjects.java |
Tue Oct 02 11:13:06 CEST 2012 by Michael Peter Christen | turned the synonyms_t Text field into a multi-valued String field synonyms_sxt Changed Files: defaults/solr.keys.list, defaults/yacy.init, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/document/Condenser.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/SolrConfiguration.java |
Tue Oct 02 10:23:10 CEST 2012 by orbiter | ups, added missing class for last commit Changed Files: source/net/yacy/cora/language/synonyms/SynonymLibrary.java |
Mon Oct 01 14:16:49 CEST 2012 by Michael Peter Christen | added an underline text field to solr to record all underlined texts Changed Files: defaults/solr.keys.list, source/net/yacy/cora/federate/solr/YaCySchema.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/search/index/SolrConfiguration.java |
Sun Sep 30 13:23:06 CEST 2012 by orbiter | The HostBrowser now offers to index files that are discovered because they are linked in the web interface. Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/env/base.css |
Sat Sep 29 02:13:11 CEST 2012 by Michael Peter Christen | fixed computation of links in host browser which are not indexed but knwon by the crawler. Such links are now displayed in grey color. Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java, htroot/env/base.css, source/net/yacy/search/index/SolrConfiguration.java |
Fri Sep 28 23:09:21 CEST 2012 by Michael Peter Christen | added nice links to the host browser: - click on the file icon to get the metadata of the file - click on the link icon behind the link to open the original file in the browser Changed Files: htroot/HostBrowser.html, htroot/env/grafics/link.gif |
Fri Sep 28 13:48:51 CEST 2012 by Michael Peter Christen | added lucene memory library which is now necessary as solr has to process more complex queries Changed Files: .classpath, addon/YaCy.app/Contents/Info.plist, build.xml, lib/lucene-memory-3.6.1.jar |
Fri Sep 28 09:00:40 CEST 2012 by Michael Peter Christen | another fix for the debian installer: the installer fails because some classes had unresolved dependencies. This fix removes the dependencies. Changed Files: source/net/yacy/cora/order/Base64Order.java, source/net/yacy/cora/order/Digest.java |
Thu Sep 27 12:02:24 CEST 2012 by Michael Peter Christen | allow Cross-Origin Resource Sharing for all stream servlets, that is the solr and the gsa search interface. That means that all JavaScript in browsers now can Cross-Origin access all YaCy search interfaces, which opens the option of 'YaCy Client in Browser' and 'End-Point Fail-over' concepts. Changed Files: htroot/Network.java, source/net/yacy/server/http/HTTPDFileHandler.java |
Thu Sep 27 00:31:59 CEST 2012 by Michael Peter Christen | fixed url search in IndexControlURLs_p.html / using now the solr interface Changed Files: htroot/IndexControlURLs_p.html |
Wed Sep 26 23:32:13 CEST 2012 by Michael Peter Christen | increased strength of crawling waves in network image Changed Files: source/net/yacy/peers/graphics/NetworkGraph.java |
Wed Sep 26 18:48:59 CEST 2012 by Michael Peter Christen | force usage of default faceting mechanisms for search Changed Files: source/net/yacy/search/Switchboard.java |
Wed Sep 26 18:36:32 CEST 2012 by Michael Peter Christen | - better date ranking - more protection against NPE and time travel effects Changed Files: htroot/yacysearch.java, source/net/yacy/document/content/DCEntry.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/peers/Protocol.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/ranking/RankingProfile.java |
Wed Sep 26 16:56:33 CEST 2012 by Michael Peter Christen | - if a "/date" modifier is used, the solr remote query applies an ordering by date (ascending) - added also some 'anti-timetravel' protection (check if date is in the future within any metadata date field) Changed Files: source/net/yacy/cora/protocol/ResponseHeader.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/query/QueryParams.java |
Wed Sep 26 16:05:11 CEST 2012 by Michael Peter Christen | We assert that no other metadata storage than solr is used now. Therefore a property like solrConnected() must be true all the time. Removal of this method causes removal of all write operations to the old metadata index. Changed Files: htroot/IndexControlRWIs_p.java, source/net/yacy/search/Switchboard.java, source/net/yacy/search/index/Fulltext.java, source/net/yacy/search/index/Segment.java |
Wed Sep 26 15:44:50 CEST 2012 by Michael Peter Christen | made the index schema retrieval public and allow cross-domain retrieval Changed Files: htroot/IndexFederated_p.html, htroot/api/schema.java, htroot/api/schema.xml |
Wed Sep 26 15:33:37 CEST 2012 by Michael Peter Christen | enhanced snippet extractor to find snippets also inside of tokens of an url Changed Files: source/net/yacy/search/snippet/TextSnippet.java |
Wed Sep 26 14:05:33 CEST 2012 by sixcooler | added filename for missing crawlname when crawling from file Changed Files: htroot/Crawler_p.java |
Wed Sep 26 14:03:51 CEST 2012 by sixcooler | pdf- and zipParser should not use forced Memory-Limits Changed Files: source/net/yacy/document/parser/pdfParser.java, source/net/yacy/document/parser/zipParser.java |
Wed Sep 26 10:36:09 CEST 2012 by apfelmaennchen | adding CORS access header for Network.xml to overcome cross domain restriction (e.g. necessary to build a JavaScript YaCy client). Changed Files: htroot/Network.java |
Tue Sep 25 23:59:30 CEST 2012 by Michael Peter Christen | better abstraction for solr query params Changed Files: source/net/yacy/search/query/QueryParams.java |
Tue Sep 25 23:59:09 CEST 2012 by Michael Peter Christen | - fix for NPEs during remote solr configuration - fixed remote solr setting switch - added more logging Changed Files: htroot/IndexFederated_p.java, source/net/yacy/cora/federate/solr/connector/RemoteSolrConnector.java, source/net/yacy/cora/federate/solr/connector/SolrServerConnector.java, source/net/yacy/peers/Protocol.java |
Tue Sep 25 23:09:32 CEST 2012 by Michael Peter Christen | added dummy update servlet Changed Files: htroot/solr/update.java |
Tue Sep 25 21:09:06 CEST 2012 by Michael Peter Christen | removed tenant query attribute since it is not used any more and is replaced by the site-operator in the GSA interface. This operator can also be simulated in the Solr interface using the collections_sxt field. Changed Files: htroot/yacy/search.java, htroot/yacysearch.java, source/net/yacy/cora/services/federated/solr/connector/ShardSolrConnector.java, source/net/yacy/search/query/QueryParams.java |
Tue Sep 25 17:52:33 CEST 2012 by Michael Peter Christen | using the search filter to drill down search to file types. A search like "mp3 filetype:mp3" will now maybe surprise you. Changed Files: source/net/yacy/search/index/Segment.java, source/net/yacy/search/query/QueryParams.java |
Tue Sep 25 12:19:24 CEST 2012 by Michael Peter Christen | more cleaning (yacy-cora) Changed Files: build.xml |
Tue Sep 25 00:28:20 CEST 2012 by Michael Peter Christen | added the indexrestore.sh script which must be called with the path of the index dump. This is the reverse of indexdump.sh which takes the output of indexdump.sh as input to restore an index. Now it should be possible to transfer a complete YaCy Solr index from one peer yacy1 to another peer yacy2 with the following command: yacy2/bin/indexrestore.sh ´yacy1/bin/indexdump.sh´ Changed Files: bin/indexdump.sh, bin/indexrestore.sh |
Tue Sep 25 00:19:52 CEST 2012 by Michael Peter Christen | - added xml output in IndexControlURLs to get the storage page of index dump commands - adjusted the apicall.sh script to get the downloaded text as output to stdout which is necessary to parse the content out of it - added indexdump.sh script which creates a solr dump and prints out the storage path for the index dump - added synchronization to the Fulltext class to prevent that data is stored to a non-existing solr index while this index is disabled during the storage of the dump Changed Files: bin/apicall.sh, bin/indexdump.sh, htroot/IndexControlURLs_p.xml, source/net/yacy/search/index/Fulltext.java |
Mon Sep 24 17:05:28 CEST 2012 by Michael Peter Christen | used the new zip writer/reader to add a solr dump process: the whole solr index can be written to a zip dump and also restored during runtime Changed Files: htroot/IndexControlURLs_p.html, htroot/IndexControlURLs_p.java, htroot/IndexFederated_p.html, source/net/yacy/cora/services/federated/solr/EmbeddedSolrConnector.java, source/net/yacy/search/index/Fulltext.java |
Mon Sep 24 17:04:37 CEST 2012 by Michael Peter Christen | added a directory-to-zip writer and zip-to-directory reader Changed Files: source/net/yacy/cora/storage/ZIPReader.java, source/net/yacy/cora/storage/ZIPWriter.java |
Mon Sep 24 15:01:44 CEST 2012 by Michael Peter Christen | a bit more logging Changed Files: source/net/yacy/server/http/HTTPDemon.java |
Mon Sep 24 12:01:09 CEST 2012 by Michael Peter Christen | simplifications in DHT Distribution class and more documentation Changed Files: source/net/yacy/cora/document/ASCII.java, source/net/yacy/cora/services/federated/yacy/Distribution.java, source/net/yacy/peers/DHTSelection.java, source/net/yacy/peers/Dispatcher.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/server/http/TemplateEngine.java |
Mon Sep 24 01:04:39 CEST 2012 by Michael Peter Christen | simplified DHT classes Changed Files: htroot/yacy/transferRWI.java, source/net/yacy/cora/services/federated/yacy/Distribution.java, source/net/yacy/peers/DHTSelection.java, source/net/yacy/peers/Dispatcher.java, source/net/yacy/peers/Seed.java, source/net/yacy/peers/SeedDB.java, source/net/yacy/peers/graphics/NetworkGraph.java, source/net/yacy/search/query/SearchEvent.java |
Sat Sep 22 11:10:11 CEST 2012 by orbiter | added new classes to renovate the YaCy protocol based on simple data structures in cora: - added the Peer object, which is a fresh version of Seed - added the Peers object, which is a fresh version of Network - added the Network api access class to retrieve a list of peers based on the Network.xml servlet in all YaCy peers. Changed Files: source/net/yacy/cora/services/federated/yacy/Peer.java, source/net/yacy/cora/services/federated/yacy/Peers.java, source/net/yacy/cora/services/federated/yacy/api/Network.java |
Fri Sep 21 21:38:50 CEST 2012 by orbiter | fixed mistake in wt-option which caused that the yacy json format overlapped the solr built-in json format Changed Files: htroot/solr/select.java |
Fri Sep 21 15:48:40 CEST 2012 by Michael Peter Christen | added HostBrowser servlet (stub) Changed Files: htroot/HostBrowser.html, htroot/HostBrowser.java |
Thu Sep 20 18:45:51 CEST 2012 by orbiter | more ignore Changed Files: .gitignore |
Thu Sep 20 18:29:04 CEST 2012 by orbiter | added ftp to getName Changed Files: source/net/yacy/cora/document/MultiProtocolURI.java |
Thu Sep 20 15:02:57 CEST 2012 by cominch | change parameter to support the smw extension for list import Changed Files: source/net/yacy/interaction/contentcontrol/ContentControlImportThread.java |
Tue Sep 18 22:31:01 CEST 2012 by orbiter | full memory usage for debian and when changing the size: debian seems to dislike the big difference between xmx and xms (I have crashes here which stop if both values are same) Changed Files: debian/postinst, htroot/PerformanceQueues_p.java |
Sun Sep 16 21:27:55 CEST 2012 by orbiter | added new crawl options: - indexUrlMustMatch and indexUrlMustNotMatch which can be used to select loaded pages for indexing. Default patterns are in such a way that all loaded pages are also indexed (as before) but when doing an expert crawl start, then the user may select only specific urls to be indexed. - crawlerNoDepthLimitMatch is a new pattern that can be used to remove the crawl depth limitation. This filter a never-match by default (which causes that the depth is used) but the user can select paths which will be loaded completely even if a crawl depth is reached. Changed Files: htroot/CrawlStartExpert_p.html, htroot/CrawlStartExpert_p.java, htroot/Crawler_p.java, source/net/yacy/search/Switchboard.java |
Sun Sep 16 21:22:56 CEST 2012 by orbiter | fixed the size() method which counted also failed pages (which are also inside the solr index) Changed Files: source/net/yacy/cora/services/federated/solr/AbstractSolrConnector.java, source/net/yacy/cora/services/federated/solr/SolrServerConnector.java |
Fri Sep 14 16:49:29 CEST 2012 by Michael Peter Christen | added new crawl attributes in crawl profile (not active yet) Changed Files: htroot/CrawlProfileEditor_p.java, htroot/Crawler_p.java, htroot/QuickCrawlLink_p.java, source/de/anomic/crawler/CrawlProfile.java, source/de/anomic/crawler/CrawlSwitchboard.java, source/de/anomic/data/ymark/YMarkCrawlStart.java, source/net/yacy/search/Switchboard.java |
Fri Sep 14 12:09:20 CEST 2012 by Michael Peter Christen | added default facet fields for json response format (stub) Changed Files: htroot/gsa/searchresult.java, htroot/solr/select.java, source/de/anomic/server/serverObjects.java, source/net/yacy/cora/services/federated/solr/EnhancedXMLResponseWriter.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/cora/services/federated/solr/OpensearchResponseWriter.java |
Fri Sep 14 12:06:06 CEST 2012 by Michael Peter Christen | added missing license headers Changed Files: htroot/Trails.java, htroot/Triple_p.java, htroot/Triplestore_p.java |
Fri Sep 14 12:04:54 CEST 2012 by Michael Peter Christen | added a regular expression test servlet which is linked within the parser/crawler error page whenever a problem with regular expression occurs. This makes it easy to correct and enhance the must-match and must-not-match patterns just by trying out which pattern could be correct. Changed Files: htroot/IndexCreateParserErrors_p.java, htroot/RegexTest.html, htroot/RegexTest.java, htroot/env/grafics/nok.png, source/de/anomic/crawler/CrawlStacker.java |
Thu Sep 13 23:53:53 CEST 2012 by orbiter | added twitter search heuristic Changed Files: defaults/yacy.init, htroot/ConfigHeuristics_p.html, htroot/ConfigHeuristics_p.java, htroot/ConfigNetwork_p.java, htroot/yacysearch.java, source/net/yacy/search/Switchboard.java |
Tue Sep 11 23:28:21 CEST 2012 by Michael Peter Christen | - some corrections in usage of getFile() and getFileName() - added more attributes in json response writer according to yacy servlet Changed Files: htroot/yacysearchitem.java, source/de/anomic/crawler/ResultImages.java, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/document/parser/html/ContentScraper.java, source/net/yacy/search/snippet/ResultEntry.java |
Tue Sep 11 22:46:39 CEST 2012 by Michael Peter Christen | added the protocol and the file name extension to the solr fields since these fields are probably facets in file search Changed Files: defaults/solr.keys.list, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Tue Sep 11 22:28:10 CEST 2012 by Michael Peter Christen | no complaints about memory if the database is empty Changed Files: source/net/yacy/kelondro/table/Table.java |
Tue Sep 11 20:15:54 CEST 2012 by Michael Peter Christen | activate two solr fields which will be used by administration interface (later) Changed Files: defaults/solr.keys.list, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java, source/net/yacy/search/index/SolrConfiguration.java |
Tue Sep 11 09:15:47 CEST 2012 by orbiter | added facet stub in JsonResponseWriter Changed Files: source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java |
Tue Sep 11 03:02:02 CEST 2012 by Michael Peter Christen | enhanced solr writers Changed Files: source/net/yacy/cora/services/federated/solr/EnhancedXMLResponseWriter.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/cora/services/federated/solr/OpensearchResponseWriter.java |
Tue Sep 11 02:03:14 CEST 2012 by Michael Peter Christen | added search functionality to ViewFile.html servlet Changed Files: htroot/Crawler_p.java, htroot/ViewFile.html, source/de/anomic/http/server/HTTPDFileHandler.java |
Mon Sep 10 15:20:55 CEST 2012 by Michael Peter Christen | - added collections to yacydoc - changed yacydoc.htm to yacydoc.json - added query logging in solr and gsa search result Changed Files: htroot/api/yacydoc.html, htroot/api/yacydoc.java, htroot/api/yacydoc.xml, htroot/gsa/searchresult.java, htroot/interaction_elements/Tag_part.html, htroot/solr/select.java |
Mon Sep 10 14:30:44 CEST 2012 by Michael Peter Christen | - added a json writer for solr (yes there was one using xslt but this one writes the same way as yacysearch.json) - using the new json solr result to change the ajax search in IndexControlURLs to the new solr search Changed Files: htroot/IndexControlURLs_p.html, htroot/solr/select.java, source/de/anomic/server/serverObjects.java, source/net/yacy/cora/services/federated/solr/JsonResponseWriter.java, source/net/yacy/cora/services/federated/solr/OpensearchResponseWriter.java |
Mon Sep 10 08:10:53 CEST 2012 by Michael Peter Christen | Merge remote-tracking branch 'reger/master' Changed Files: source/de/anomic/data/Translator.java |
Mon Sep 10 07:15:52 CEST 2012 by Michael Peter Christen | removed warnings Changed Files: htroot/api/ymarks/get_ymark.java, source/de/anomic/data/ymark/YMarkCrawlStart.java, source/net/yacy/dbtest.java, source/net/yacy/kelondro/blob/TablesColumnBLOBIndex.java, source/net/yacy/kelondro/util/MemoryStrategy.java, source/net/yacy/search/query/SearchEvent.java, source/net/yacy/yacy.java |
Sun Sep 09 22:56:24 CEST 2012 by apfelmaennchen | Added more sophisticated RDF output for YMarks, including the folder structure (b:Topic) and support for multiple tags (dc:subject) and folders (b:hasTopic) via rdf:Bag container. Changed Files: htroot/YMarks.java, source/de/anomic/data/ymark/YMarkRDF.java |
Sun Sep 09 06:15:25 CEST 2012 by reger | keep input order of translation entries within one file section. Allowing on translation conflicts (translaton of words contained in other sentence) to put shorter key at the end of the translation list. Changed Files: source/de/anomic/data/Translator.java |
Fri Sep 07 22:06:51 CEST 2012 by Michael Peter Christen | added Solr fields: inboundlinks_text_chars_val inboundlinks_text_words_val inboundlinks_alttag_txt outboundlinks_text_chars_val outboundlinks_text_words_val outboundlinks_alttag_txt Changed Files: defaults/solr.keys.list, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Fri Sep 07 21:33:45 CEST 2012 by orbiter | added solr field images_withalt_i Changed Files: defaults/solr.keys.list, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Thu Sep 06 22:35:55 CEST 2012 by orbiter | added disjunction '|' option to site parameter in GSA API Changed Files: htroot/gsa/searchresult.java |
Thu Sep 06 22:10:03 CEST 2012 by sixcooler | clear fulltext-cache and stop crawling if running out of memory Changed Files: source/de/anomic/crawler/ResourceObserver.java |
Thu Sep 06 22:07:07 CEST 2012 by sixcooler | also do a clearcache on the solr-connector-caches Changed Files: source/net/yacy/search/index/Fulltext.java |
Thu Sep 06 22:02:29 CEST 2012 by sixcooler | statistics for solr-cache Changed Files: htroot/PerformanceMemory_p.html, htroot/PerformanceMemory_p.java, source/net/yacy/cora/services/federated/solr/MirrorSolrConnector.java |
Tue Sep 04 14:47:53 CEST 2012 by Michael Peter Christen | added collections to crawl monitor Changed Files: htroot/CrawlResults.html, htroot/CrawlResults.java, htroot/ViewFile.html, htroot/ViewFile.java, source/de/anomic/crawler/ResultURLs.java, source/net/yacy/kelondro/data/meta/URIMetadata.java, source/net/yacy/kelondro/data/meta/URIMetadataNode.java, source/net/yacy/kelondro/data/meta/URIMetadataRow.java, source/net/yacy/search/index/Segment.java |
Tue Sep 04 14:11:11 CEST 2012 by Michael Peter Christen | added h1..h6 counter fields Changed Files: defaults/solr.keys.list, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Mon Sep 03 15:27:47 CEST 2012 by Michael Peter Christen | add nice submit buttons to pdblue skin Changed Files: skins/pdblue.css |
Mon Sep 03 15:26:47 CEST 2012 by Michael Peter Christen | dependency is java6 only Changed Files: debian/control |
Sat Sep 01 14:17:20 CEST 2012 by apfelmaennchen | removed jquery.slider as it is already included as part of jquery-ui package Changed Files: htroot/env/templates/jqueryheader.template |
Sat Sep 01 10:25:22 CEST 2012 by apfelmaennchen | removed unused jquery plugin slider as it is part of jquery-ui package Changed Files: htroot/env/templates/jqueryheader.template |
Fri Aug 31 15:16:33 CEST 2012 by Michael Peter Christen | added synchronization to solr server requests since lucene is not thread-safe. We experienced problems as described in http://stackoverflow.com/questions/5327978/lockobtainfailedexception-updating-lucene-search-index-using-solr Changed Files: source/net/yacy/cora/services/federated/solr/SolrServerConnector.java |
Fri Aug 31 14:07:33 CEST 2012 by Michael Peter Christen | removed many warnings Changed Files: htroot/Table_YMark_p.java, htroot/api/ymarks/add_ymark.java, htroot/api/ymarks/import_ymark.java, source/de/anomic/data/ymark/YMarkAutoTagger.java, source/de/anomic/data/ymark/YMarkSMWJSONImporter.java, source/de/anomic/data/ymark/YMarkTables.java, source/net/yacy/interaction/contentcontrol/ContentControlFilterUpdateThread.java, source/net/yacy/interaction/contentcontrol/ContentControlImportThread.java |
Fri Aug 31 14:00:53 CEST 2012 by Michael Peter Christen | - moved the gsa search interface from /gsa/searchresult? to /gsa/search? - fixed the NB field data Changed Files: htroot/gsa/searchresult.java, source/de/anomic/http/server/HTTPDFileHandler.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java |
Wed Aug 29 16:48:53 CEST 2012 by Michael Peter Christen | fixed problems with GSA api: - better FS attribute - highlightning of searched words in title Changed Files: htroot/gsa/searchresult.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java |
Wed Aug 29 16:28:32 CEST 2012 by Michael Peter Christen | - fixed num parameter in GSA api - changed FS attribute in GSA api Changed Files: htroot/gsa/searchresult.java, source/net/yacy/cora/protocol/HeaderFramework.java, source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java |
Wed Aug 29 16:11:23 CEST 2012 by Michael Peter Christen | added new field for solr: url_paths_sxt url_parameter_i url_parameter_key_sxt url_parameter_value_sxt url_chars_i Changed Files: defaults/solr.keys.list, source/net/yacy/cora/document/MultiProtocolURI.java, source/net/yacy/search/index/SolrConfiguration.java, source/net/yacy/search/index/YaCySchema.java |
Wed Aug 29 09:52:14 CEST 2012 by cominch | content control: apply filter if enabled to crawls Changed Files: source/de/anomic/crawler/CrawlStacker.java, source/de/anomic/data/ymark/YMarkTables.java |
Mon Aug 27 16:56:22 CEST 2012 by orbiter | Merge commit '65d49df865f60511d22d86fb15c33a082176e7ab' Changed Files: htroot/api/yacydoc.html, source/net/yacy/search/Switchboard.java |
Mon Aug 27 15:25:25 CEST 2012 by Michael Peter Christen | added boosts to solr search queries Changed Files: source/net/yacy/search/query/QueryParams.java |
Mon Aug 27 14:41:47 CEST 2012 by Michael Peter Christen | switched off some solr logging Changed Files: defaults/yacy.logging |
Mon Aug 27 12:15:42 CEST 2012 by Michael Peter Christen | added gsa result attribute 'has' Changed Files: source/net/yacy/cora/services/federated/solr/GSAResponseWriter.java |
Sun Aug 26 22:28:14 CEST 2012 by reger | security fix: clear automtic password only if adminAccountForLocalhost=false to prevent remote access to protected pages after restart. if adminAccountForLocalhost=true leave automatic password unchanged so access from local host is granted but remote access is preventet from the 1st second. Changed Files: source/net/yacy/search/Switchboard.java |
Sun Aug 26 17:46:40 CEST 2012 by orbiter | - correct length computation for BStringObject (bugfix suggested by apfelmaennchen) - using ASCII for string conversion for Strings generated from Integer Changed Files: source/de/anomic/crawler/RobotsTxtEntry.java, source/de/anomic/data/WorkTables.java, source/net/yacy/kelondro/blob/Tables.java, source/net/yacy/kelondro/util/BDecoder.java |
Sat Aug 25 19:08:42 CEST 2012 by orbiter | - added hack to prevent that stream servlet paths are not parsed wrongly if the path contains a dot. - added also warnings if documents are requests which do not exist. Changed Files: source/de/anomic/http/server/HTTPDFileHandler.java |