Saturday, 26 September 2015

tf–idf - best known weighting scheme in information retrieval

tf–idf, short for term frequency–inverse document frequency.
Best known weighting scheme in information retrieval
Good explanation:

Friday, 8 May 2015

I have just released Allegro OpenSource: Elasticsearch reindex tool

Friday, 6 March 2015

Logstash roadmap

Logstash is a tool for managing events and logs. You can use it to collect logs, parse them, and store them for later use (like, for searching). Speaking of searching, logstash comes with a web interface for searching and drilling into all of your logs.
It is fully free and fully open source. The license is Apache 2.0, meaning you are pretty much free to use it however you want in whatever way.
Now we have also Logstash roadmap

Spring Cloud 1.0.0 Available Now

Spring Cloud provides tools for developers to quickly build some of the common patterns in distributed systems (e.g. configuration management, service discovery, circuit breakers, intelligent routing, micro-proxy, control bus, one-time tokens, global locks, leadership election, distributed sessions, cluster state). Coordination of distributed systems leads to boiler plate patterns, and using Spring Cloud developers can quickly stand up services and applications that implement those patterns. They will work well in any distributed environment, including the developer’s own laptop, bare metal data centres, and managed platforms such as Cloud Foundry.

More info here:

Good article about lucene internals

You can read about lucene implementation concerning posting lists and filters here: