Laravel 4.2 package for full-text search over Eloquent models based on ZendSearch Lucene.
Require this package in your composer.json and run composer update:
{
"require": {
"nqxcode/laravel-lucene-search": "1.*"
}
}
After updating composer, add the ServiceProvider to the providers array in app/config/app.php
'providers' => [
'Nqxcode\LuceneSearch\ServiceProvider',
],
If you want to use the facade to search, add this to your facades in app/config/app.php
:
'aliases' => [
'Search' => 'Nqxcode\LuceneSearch\Facade',
],
Publish the config file into your project by running:
php artisan config:publish nqxcode/laravel-lucene-search
In published config file add descriptions for models which need to be indexed, for example:
'index' => [
// ...
'namespace\FirstModel' => [
'fields' => [
'name', 'full_description', // fields for indexing
]
],
'namespace\SecondModel' => [
'fields' => [
'name', 'short_description', // fields for indexing
]
],
namespace\ModelWithCustomPrimaryKey::class => [
// You can also define your primary key (if you use something else than "id")
'primary_key' => 'my_custom_field_name',
'fields' => [
'username', 'short_description', // fields for indexing
]
],
// ...
],
You can also index values of optional fields (dynamic fields). For enable indexing for optional fields:
- In config for each necessary model add following option:
'optional_attributes' => true
// or
'optional_attributes' => [
'accessor' => 'custom_name' // with specifying of accessor name
]
- In model add special accessor, that returns list of
field-name => field-value
. By defaultgetOptionalAttributesAttribute
accessor will be used. In case accessor name specified in configgetCustomNameAttribute
accessor will be used.
Example:
In config file:
'namespace\FirstModel' => [
'fields' => [
'name', 'full_description', // fixed fields for indexing
],
'optional_attributes' => true // enable indexing for dynamic fields
],
In model add following accessor:
public function getOptionalAttributesAttribute()
{
return [
'optional_attribute1' => 'value1',
'optional_attribute2' => 'value2',
];
}
See details on Apache Lucene - Scoring.
This is Document level boosting in terminology of Apache Lucene. By default all models have boost value equal to 1. For change of this behavior customize boost for necessary models as in the following examples.
- In config for each necessary model add following option:
'boost' => true
// or
'boost' => [
'accessor' => 'custom_name' // with specifying of accessor name
]
In model add following accessor:
public function getBoostAttribute()
{
return 0.5; // customize boost value for model
}
- In model add special accessor, that returns boost value.
By default
getBoostAttribute
accessor will be used. In case accessor name specified in configgetCustomNameAttribute
accessor will be used.
Example:
In config file:
'namespace\FirstModel' => [
'fields' => [
'name', 'full_description',
],
'boost' => true // enable boosting for model
],
In model add following accessor:
public function getBoostAttribute()
{
return 0.5; // customize boost value for model
}
This is Document's Field level boosting in terminology of Apache Lucene. By default boost is set in 1 for each field. For change of this behavior set boost for necessary fields as in the following examples.
In config file:
'namespace\FirstModel' => [
'fields' => [
'name', // field with default boost
'full_description' => ['boost' => 0.2], // customize boost value
],
],
Or/and in model accessor:
public function getOptionalAttributesAttribute()
{
return [
'optional_attribute1' => 'value1', // field with default boost
'optional_attribute2' => ['boost' => 0.5, 'value' => 'value2'], // customize boost value
];
}
By default the following filters are used in search:
- Stemming filter for english/russian words (for reducing words to their root form),
- Stopword filters for english/russian words (for exclude some words from search index).
This filters can be deleted or replaced with others.
'analyzer' => [
'filters' => [
// Default stemming filter.
'Nqxcode\Stemming\TokenFilterEnRu',
],
// List of paths to files with stopwords.
'stopwords' => Nqxcode\LuceneSearch\Analyzer\Stopwords\Files::get(),
],
'queue' => 'search', // Define queue that will be used for jobs of search index updating.
'chunk' => 50, // Chunk size for models, use in search index rebuilding.
For building of search index run:
php artisan search:rebuild --verbose
For clearing of search index run:
php artisan search:clear
For filtering of models in search results each model's class can implements SearchableInterface
.
For example:
use Illuminate\Database\Eloquent\Model;
use Nqxcode\LuceneSearch\Model\SearchableInterface;
class Dummy extends Model implements SearchableInterface
{
// ...
/**
* Get id list for all searchable models.
*/
public static function searchableIds()
{
return self::wherePublish(true)->lists('id');
}
// ...
}
For register of necessary events (save/update/delete) use Nqxcode\LuceneSearch\Model\SearchTrait
in target model:
use Illuminate\Database\Eloquent\Model;
use Nqxcode\LuceneSearch\Model\SearchableInterface;
use Nqxcode\LuceneSearch\Model\SearchTrait;
class Dummy extends Model implements SearchableInterface
{
use SearchTrait;
// ...
}
If you want to avoid triggering the indexing, wrap necessary operations in the withoutSyncingToSearch()
method on your model:
Product::withoutSyncingToSearch(function () {
// mass update position for product, e.g.
foreach (Product::all() as $i => $product) {
$product->update(['position' => $i)]);
}
});
Build query in several ways:
By default, queries which will execute search in the phrase entirely are created.
$query = Search::query('clock'); // search by all fields.
// or
$query = Search::where('name', 'clock'); // search by 'name' field.
// or
$query = Search::query('clock') // search by all fields with
->where('short_description', 'analog'); // filter by 'short_description' field.
// or
$query = Product::search('clock'); // search only in `Product` model by all fields in case when `Product` use `SearchableTrait`.
For query
and where
methods it is possible to set the following options:
- phrase - phrase match (boolean, true by default)
- proximity - value of distance between words (unsigned integer)
- fuzzy - value of fuzzy (float, 0 ... 1)
- required - should match (boolean, true by default)
- prohibited - should not match (boolean, false by default)
Find all models in which any field contains phrase like 'composite one two phrase':
$query = Search::query('composite phrase', '*', ['proximity' => 2]);
Search by each word in query:
$query = Search::query('composite phrase', '*', ['phrase' => false]);
$query = Search::rawQuery('short_description:"analog"');
// or
$rawQuery = QueryParser::parse('short_description:"analog"');
$query = Search::rawQuery($rawQuery);
For built query are available following actions:
$models = $query->get();
$count = $query->count();
$models = $query->limit(5, 10)->get(); // Limit = 5 and offset = 10
$paginator = $query->paginate(50);
Highlighting of matches is available for any html fragment encoded in utf-8 and is executed only for the last executed request.
Search::find('nearly all words must be highlighted')->get();
$highlighted = Search::highlight('all words');
// highlighted html:
// '<span class="highlight">all</span> <span class="highlight">words</span>'
Package licenced under the MIT license.