Streaming Data Access for BSON and JSON encoded documents
Boson is a library, written in Scala with both Scala and Java APIs, built to extract and inject data to and from various ‘wire encodings’. It currently implements two input codecs, JSON documents encoded in the form of UTF8 strings, and BSON documents encoded in the form of binary arrays.
Through the use of Shapeless, Boson allows the use of user created classes as data extraction/injection types.
In the following points we show how to use Boson in both Scala and Java in a QuickStart Guide.
Boson is available through the Central Maven Repository, divided into 3 parts, BosonScala, BosonJava and BosonCore. To include Boson in your project you can get the dependencies code for each part from Maven.
A "Boson" is an object created when constructing an extractor/injector that includes either a Bson encoded as an Array[Byte] or a Json encoded as a String in a Netty buffer and processes it according to a given expression, traversing the buffer only once.
Extraction requires a "BsonPath" expression (see Documentation for examples and syntax), an encoded BSON and an Higher-Order Function. The Extractor instance is built only once and can be reused multiple times to extract from different encoded BSON.
// Valid Bson Event
val arrayObj = new BsonArray().add(10).add(20).add(30)
val bsonEvent = new BsonObject().put("name","SomeName").put("array", arrayObj)
// Encode Bson:
val validBson : Array[Byte] = bsonEvent.encode.getBytes
// BsonPath expression:
val expression: String = ".array[1]"
// Put the result onto a Stream.
val valueStream : ValueStream = ValueStream()
// Simple Extractor:
val boson: Boson = Boson.extractor(expression, (in: Int) => {
valueStream.add(in)
})
// Trigger extraction with encoded Bson:
boson.go(validBson)
// Function will be called as a result of calling 'go'
Injection requires a "BsonPath" expression (see Documentation table for examples and syntax), an encoded BSON and an Higher-Order Function. The returned result is a Future[Array[Byte]]. The Injector instance is built only once and can be reused to inject different encoded BSON.
// Valid Bson Event
val arrayObj = new BsonArray().add(10).add(20).add(30)
val bsonEvent = new BsonObject().put("name","SomeName").put("array", arrayObj)
// Encode Bson:
val validBson: Array[Byte] = bsonEvent.encode.getBytes
// BsonPath expression:
val expression: String = ".name"
// Simple Injector:
val boson: Boson = Boson.injector(expression, (in: String) => {
in.toUpperCase
})
// Trigger injection with encoded Bson:
val result: Future[Array[Byte]] = boson.go(validBsonArray)
// Function will be called as a result of calling 'go'
Instead of passing a function to modify the value found throught the path, it is also possible to just inject a simple value by replacing the above injector with the following.
val boson: Boson = Boson.injector(expression, "SOMENAME")
Extraction requires a "BsonPath" expression (see Documentation table for examples and syntax), an encoded BSON and a lambda expression. The Extractor instance is built only once and can be reused multiple times to extract from different encoded BSON.
// Valid Bson Event
BsonArray arrayObj = new BsonArray().add(10).add(20).add(30);
BsonObject bsonEvent = new BsonObject().put("name","SomeName").put("array", arrayObj);
// Encode Bson:
byte[] validatedByteArray = bsonEvent.encode().getBytes();
// BsonPath expression:
String expression = ".array[1]";
// Want to put the result onto a Stream.
ArrayList<Integer> mutableBuffer = new ArrayList<>();
// Simple Extractor:
Boson boson = Boson.extractor(expression, (Integer obj)-> {
mutableBuffer.add(obj);
});
// Trigger extraction with encoded Bson:
boson.go(validatedByteArray);
// Function will be called as a result of calling 'go'
Injection requires a "BsonPath" expression (see Documentation table for examplesand syntax), an encoded BSON and a lambda expression. The returned result is a CompletableFuture<byte[]>. The Injector instance is built only once and can be reused to inject different encoded BSON.
//Valid Bson Event
BsonArray arrayObj = new BsonArray().add(10).add(20).add(30);
BsonObject bsonEvent = new BsonObject().put("name","SomeName").put("array", arrayObj);
// Encode Bson:
byte[] validatedByteArray = bsonEvent.encode().getBytes();
// BsonPath expression:
String expression = ".name";
// Simple Injector:
Boson boson = Boson.injector(expression, (String in) -> {
in.toUpperCase();
return in;
});
// Trigger injection with encoded Bson:
byte[] result = boson.go(validatedByteArray).join();
// Function will be called as a result of calling 'go'
Instead of passing a function to modify the value found throught the path, it is also possible to just inject a simple value by replacing the above injector with the following.
val boson: Boson = Boson.injector(expression, "SOMENAME");
BosonPath expressions target a Bson structure with the same logic JsonPath expressions target a JSON structure and XPath target an XML document. Unlike in JsonPath, there is no reference of a "root member object", instead if you want to specify a path starting from the root the expression must begin with a dot (example: .key
).
BsonPath expressions use the dot-notation: key1.key2[0].key3
.
Expressions whose path doesn't necessarily start from the root can be expressed in two ways:
- No dot:
key
- Two dots:
..key
Operator | Description |
---|---|
. |
Child. |
.. |
Deep scan. Available anywhere a name is required. |
@ |
Current node. |
[<number> ((to,until) <number>)] |
Array index or indexes. |
[@<key>] |
Filter expression. |
[first | end | all] |
Array index through condition. |
* |
Wildcard. Available anywhere a name is required. |
Given the json
{
"Store":{
"Book":[
{
"Title":"Java",
"Price":15.5,
"SpecialEditions":[
{
"Title":"JavaMachine",
"Price":39
}
]
},
{
"Title":"Scala",
"Pri":21.5,
"SpecialEditions":[
{
"Title":"ScalaMachine",
"Price":40
}
]
},
{
"Title":"C++",
"Price":12.6,
"SpecialEditions":[
{
"Title":"C++Machine",
"Price":38
}
]
}
],
"Hat":[
{
"Price":48,
"Color":"Red"
},
{
"Price":35,
"Color":"White"
},
{
"Price":38,
"Color":"Blue"
}
]
}
}
BsonPath | JsonPath | Result |
---|---|---|
.Store |
$.Store |
All Stores and what they contain |
.Store.Book[@Price] |
$.Store.Book[?(@.Price)] |
All Books that contain the tag "Price" |
Book[@Price]..Title |
$..Book[?(@.Price)]..Title |
All the "Titles" of the Books, available anywhere in the json, that contain the tag "Price" |
Book[1] |
$..Book[1] |
The second Book of all the Book arrays available in the json |
Book[0 to end]..Price |
$..Book[:]..Price |
All the values of the tag "Price" from all the Books, available in the json, and the objects they contain |
Book[all].*..Title |
$..Book[:].*..Title |
All the values of the tag "Title" if the objects contained in Book, available anywhere in the json |
Book[0 until end]..Price |
$..Book[:-1]..Price |
All the values of the tag "Price" from the Books, available in the json, and the objects they contain, excluding the last Book |
Book[first until end].*..Title |
$..Book[:-1].*..Title |
All the values of the tag "Title" if the objects contained in each Book, available anywhere in the json, excluding the last Book |
.* |
$.* |
All the objects contained in Store |
Book.*.[first to end] |
$..Book.*.[:] |
An Array contained in all the objects in Book, available anywhere in the json(considering the case above, nothing) |
.Store..Book[1 until end]..SpecialEditions[@Price] |
$.Store..Book[1:-1]..SpecialEditions[?(@.Price)] |
All the Special Editions, available anywhere in the Book, of Book that contain the tag "Price" from the second Book until the end, excluding |
Bo*k , *ok or Bo* |
Non existent. |
Halfkeys of Book that return all Books |
*ok[@Pri*]..SpecialEd*.Price |
Non existent. |
Prices of Halfkey of SpecialEditions,available anywhere in Book, of Halfkey of Book that contain the Halfkey of Price |
Note: JsonPath doesn't support the halfkey (B*ok
).
Boson is a library that relies on high performance BSON/JSON data manipulation, and so performance monitoring is of paramount importance. The chosen java profiler is YourKit for being a supporter of open source projects and one of the most innovative and intelligent tools for profiling Java & .NET applications as well .