Jump to content


FCP Euro

Major Search Changes


  • Please log in to reply
6 replies to this topic

#1 Che'_Moderator

Che'_Moderator

    Monkey Master

  • Admin
  • 24,209 posts
  • Gender:Male
  • Interests:Things I am interested in
  • Location:In Your Bushes
  • Crew:___

Posted 03 March 2011 - 04:03 PM

After some urging and a lot of research, Volvospeed pulled the trigger and converted their search to Sphinx. For those interested this is from the sphinx website:

Sphinx overview

Sphinx is an open source full text search server, designed from the ground up with performance, relevance (aka search quality), and integration simplicity in mind. It's written in C++ and works on Linux (RedHat, Ubuntu, etc), Windows, MacOS, Solaris, FreeBSD, and a few other systems.

Sphinx lets you either batch index and search data stored in an SQL database, NoSQL storage, or just files quickly and easily — or index and search data on the fly, working with Sphinx pretty much as with a database server.

A variety of text processing features enable fine-tuning Sphinx for your particular application requirements, and a number of relevance functions ensures you can tweak search quality as well.

Searching via SphinxAPI is as simple as 3 lines of code, and querying via SphinxQL is even simpler, with search queries expressed in good old SQL.

Sphinx clusters scale up to billions of documents and tens of millions search queries per day, powering top websites such as Craigslist, DailyMotion, NetLog, etc.

And last but not least, it's open-sourced under GPLv2, and the community edition is free to use.

Performance and scalability

* Indexing performance. Sphinx indexes up to 10-15 MB of text per second per single CPU core, that is 60+ MB/sec per server (on a dedicated indexing machine).
* Searching performance. Searching through 1,000,000-document, 1.2 GB text collection that we use for everyday development and testing runs at 500+ queries/sec on a 2-core desktop machine with 2 GB of RAM.
* Scalability. Biggest known Sphinx cluster indexes almost 5 billion documents, resulting in over 6 TB of data. Busiest known one is, unsurpisingly, Craigslist, top-10 website in the US that serves 50+ million search queries/day.

Key features

* Batch and Real-Time full-text indexes. Two index backends that support both efficient offline index construction andincremental on-the-fly index updates are available.
* Non-text attributes support. An arbitrary number of attributes (product IDs, company names, prices, etc) can be stored in the index and used either just for retrieveal (to avoid hitting the DB), or for efficient Sphinx-side search result set post-processing.
* SQL database indexing. Sphinx can directly access and index data stored in MySQL (all storage engines are supported), PostgreSQL, Oracle, Microsoft SQL Server, SQLite, Drizzle, and anything else that supports ODBC.
* Non-SQL storage indexing. Data can also be streamed to batch indexer in a simple XML format called XMLpipe, or inserted directly into an incremental RT index.
* Easy application integration. Sphinx comes with three different APIs, SphinxAPI, SphinxSE, and SphinxQL. SphinxAPI is a native library available for Java, PHP, Python, Perl, C, and other languages. SphinxSE, a pluggable storage engine for MySQL, enables huge result sets to be shipped directly to MySQL server for post-processing. SphinxQL lets the application query Sphinx using standard MySQL client libary and query syntax.
* Advanced full-text searching syntax. Our querying engine supports arbitrarily complex queries combining boolean operators, phrase, proximity, strict order, and quorum matching, field and position limits, exact keyword form matching, substring searches, etc.
* Rich database-like querying features. Sphinx does not limit you to just keyword searching. On top of full-text search result set, you can compute arbitrary arithmetic expressions, add WHERE conditions, do ORDER BY, GROUP BY, use MIN/MAX/AVG/SUM, aggregates etc. Essentially, full-blown SQL SELECT is supported.
* Better relevance ranking. Unlike many other engines, Sphinx does not solely rely on 30-year-old statistical ranking that only considers keyword frequencies, nor limits you to it. By default, Sphinx additionally analyzes keyword proximity, and ranks closer phrase matches higher, with perfect matches ranked on top. Also, ranking is flexible: you can choose from a number of built-in relevance functions, tweak their weights by using expressions, or develop new ones.
* Flexible text processing. Sphinx indexing features include full support for SBCS and UTF-8 encodings (meaning that effectively all world's languages are supported); stopword removal and optional hit position removal (hitless indexing); morphology and synonym processing through word forms dictionaries and stemmers; exceptions and blended characters; and many more.
* Distributed searching. Searches can be distributed across multiple machines, enabling horizontal scale-out and HA (High Availability).


That this means for you:

A lot of people over the years wanted 3 letter searching which I have turned off as Volvospeed felt full text searching was more value. The issue was not a limitation of the forum, but a short coming of php. You can now search for 3 letter terms.

In addidtion this update had a weird effect on the other sites on this server..... Sorry Tom, sorry Mike. I am not sure if they are related to Sphinx, they probably are, but I am looking into them now.



#2 theForgottenone

theForgottenone

    Level 4 Member

  • Gold Member
  • 3,977 posts
  • Gender:Male
  • Location:P-town
  • Crew:NWC

Posted 03 March 2011 - 05:14 PM

yeah, woo hoo for 3 letters searching. short term affect, our site is down :ph34r:
I wanted an EVO but ended up with EVO chassis only

This is where I hangout http://www.30forty50.com/forum/

#3 Owned

Owned

    Adam #1

  • OH Moderator
  • 5,896 posts
  • Gender:Male
  • Interests:$
  • Location:$
  • Crew:___

Posted 03 March 2011 - 05:52 PM

Sweet!

I had to register my basement as an NFA item.
Posted Image


#4 Stormtrooper1

Stormtrooper1

    Level 4 Member

  • Lifetime Supporter
  • 4,094 posts
  • Gender:Male
  • Interests:Photography, The Internet, My Volvos, Random Home Improvement, Should I keep Going?
  • Location:St Paul, Minnesota
  • Crew:MWC

Posted 03 March 2011 - 08:06 PM

Woot Woot!
Member:
MN CREW | T.A.P.


1998 SAFFRON V70 R | 1998 Black V70

Former Rides: 1999 Silver V70 XC | 1999 Bronzemsit Pontiac Grand Prix GT | 1999 Black Cadillac Sedan Deville |
1993 White Ford Mustang | 1989 Burgandy Ford Mustang | 1976 Green Cadillac Coupe DeVille | 1983 Tan Chevrolet Celebrity

Have I Mentioned Recently that I despise Best Buy?!?

#5 Fudge_Brownie

Fudge_Brownie

    Hippie Exterminator

  • Moderator
  • 8,429 posts
  • Gender:Male
  • Interests:Cars and Computers
  • Location:Bristol County, MA
  • Crew:NEC

Posted 04 March 2011 - 02:32 PM

Quitter!
Posted Image
I really need to make a new sig pic. Nahhhhhhhh
98 S70 T5 | 3Barracing MBC @ 12psi, BSR Cat-back, UR Pulley
86 Kawasaki Gpz900R Ninja

#6 Che'_Moderator

Che'_Moderator

    Monkey Master

  • Admin
  • 24,209 posts
  • Gender:Male
  • Interests:Things I am interested in
  • Location:In Your Bushes
  • Crew:___

Posted 04 March 2011 - 03:05 PM

View PostFudge_Brownie, on 04 March 2011 - 02:32 PM, said:

Quitter!

Agreed, but it cut server load by a ton. In addition I now get to hear about how the deltas cause some weird delays.

#7 Fudge_Brownie

Fudge_Brownie

    Hippie Exterminator

  • Moderator
  • 8,429 posts
  • Gender:Male
  • Interests:Cars and Computers
  • Location:Bristol County, MA
  • Crew:NEC

Posted 05 March 2011 - 01:53 AM

View PostChe, on 04 March 2011 - 03:05 PM, said:

Agreed, but it cut server load by a ton. In addition I now get to hear about how the deltas cause some weird delays.
and if I write some perl to issue 3-letter queries all day long? :lol:
Posted Image
I really need to make a new sig pic. Nahhhhhhhh
98 S70 T5 | 3Barracing MBC @ 12psi, BSR Cat-back, UR Pulley
86 Kawasaki Gpz900R Ninja




1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users

IPD Volvo Parts


Copyright 2012 Volvospeed