AspectJ + Spring for method click records
In one of the projects we recently did, we had a Spring and Jersey based JSON/XML REST API server. Clients would connect to this API server and access various resources. Now, for one of the reporting scenario, we needed to log who accessed which API method and when. This appeared to be very simple initially. I decided to intercept every http call, and log that.
However, when I dug deep, I found that one method might call another method internally, depending upon the situation. For example, say User-A is trying to login from Device-1, and he is already logged on to some other device, say Device-2. So, LoginResource would make a call to LogoutResouce to invalidate User-A’s session with Device-2. We couldn’t track this just by intercepting http calls. We needed to adopt some other strategy. Initially, I thought of logging each API method with brute force… but that was too inelegant to actually code. Then I looked at AspectJ, and I was enlightened.
First, Added AspectJ dependencies to pom.
... <dependencies> <!-- aspectj --> <dependency> <groupId>org.aspectj</groupId> <artifactId>aspectjtools</artifactId> <version>1.6.8</version> </dependency> <dependency> <groupId>org.aspectj</groupId> <artifactId>aspectjrt</artifactId> <version>1.6.8</version> </dependency> ... <dependencies> ...
And updated spring config file.
<beans ... xsi:schemaLocation="... http://www.springframework.org/schema/aop http://www.springframework.org/schema/aop/spring-aop.xsd ..." xmlns:aop="http://www.springframework.org/schema/aop" ... > ... <aop:aspectj-autoproxy proxy-target-class="true" /> ...
Then created a Java Method Annotation class.
import java.lang.annotation.ElementType;
import java.lang.annotation.Retention;
import java.lang.annotation.RetentionPolicy;
import java.lang.annotation.Target;
@Retention(RetentionPolicy.RUNTIME)
@Target({ ElementType.METHOD })
public @interface RecordClick {
String action(); // Name of the action
String comment() default ""; // Comment, if any?
}
Every method we want to track, I just need to annotate it with @RecordClick. Like this:
// API to signup
@RecordClick(action = "signup", comment = "User initiated signup")
public User explicitSignup(String firstName, String lastName, String email, String phone, String password, Role role) throws ValidationException {
}
// API to signup
@RecordClick(action = "signup", comment = "System initiated signup")
public User implicitSignup(String email, String phone) throws ValidationException {
}
// API to signin
@RecordClick(action = "signin", comment = "User initiated signin")
public Session explicitSignin(String username, String password) {
}
Okay, so now I have everything in place. I just need to create an Aspect to advice pertinent methods. So, I will create an Aspect class. This will intercept all public methods that are annotated with @RecordClick annotation, and create a click record for this call.
@Aspect
@Component
public class RecordClickMethodInterceptor {
private static final Logger log = LoggerFactory.getLogger(RecordClickMethodInterceptor.class);
// This will create and save an individual click
@Autowired
private ClickResource clickResource;
/**
* Intercepts all public methods annotated with @RecordClick
*
* @param pjp
* @param recordClick
* @return
* @throws Throwable
*/
@Around("execution(public * *(..)) && @annotation(recordClick)")
public Object doClickRecord(ProceedingJoinPoint pjp, RecordClick recordClick) throws Throwable {
log.info("Intercepted method={} with action={}", pjp.getSignature().getName(), recordClick.action());
List<String> paramNames = new ArrayList<String>();
try {
MethodSignature signature = (MethodSignature) pjp.getSignature();
// Fetch method argument names.
// Note: to enable this, you must compile the code in debug mode.
paramNames = Arrays.asList(signature.getParameterNames());
} catch (Exception e) {
log.error("Can't record clicks. You must compile the code with debug=true");
// Proceed with method call
return pjp.proceed();
}
log.info("paramNames={}", paramNames);
String user = "anonymous";
Map<String, Object> methodArguments = new HashMap<String, Object>();
int index = 0;
// Iterate through method arguments
for (Object arg : pjp.getArgs()) {
// name of this argument
String name = paramNames.get(index++);
log.info("{} ==> {}", name, arg);
// if arg is null, skip
if (null == arg) {
continue;
}
// Fetch user from argument '@Context SecurityContext'
if (arg instanceof SecurityContext) {
Principal principal = ((SecurityContext) arg).getUserPrincipal();
if (null != principal) {
log.info("Found user={}", principal.getName());
user = principal.getName();
}
}
// Fetch user from argument 'Principal principal;
else if (arg instanceof Principal) {
Principal principal = (Principal) arg;
log.info("Found user={}", principal.getName());
user = principal.getName();
}
// save to methodArguments
else {
methodArguments.put(name, arg.toString());
}
}
// execute the method
Object returned = pjp.proceed();
// Process the returned value
// Record this click
String action = recordClick.action();
String comment = recordClick.comment();
clickResource.createClick(user, action, methodArguments, comment);
return returned;
}
}
I also need to compile the code in debug mode, unless I do this, I will not be able to fetch method argument names which I desperately need in order to make any sense of these records. So, I needed to update maven-compiler-plugin to enable debug mode.
<plugin>
<artifactId>maven-compiler-plugin</artifactId>
<configuration>
<source>${java-version}</source>
<target>${java-version}</target>
<!-- compile with debug so as to get param names in aspects -->
<debug>true</debug>
<debuglevel>lines,vars,source</debuglevel>
</configuration>
</plugin>
That’s it! have fun.
In Studio: Javascript Primitives
Javascript – and for that matter most other languages – has two kinds of stuffs: primitives and objects. Primitives are immutable and stored in a variable entirely. That is, when you do assignments among primitives, the assigned receives a copy from the assigner.
y = x; // y receives a copy of x.
Objects are not stored in the variable. They stay somewhere else, and the variable only holds a reference to that location. So, in case of object assignments only references get copied.
obj2 = obj1; // obj2 receives a copy of reference to the object referred by obj1.
That is, both obj1 and obj2 now points to the same object, and if you change this object – the change will be observed by both obj1 and obj2.
In Javascript, there are 5 kinds of primitives: undefined, null, boolean, string and number. So, string “abc”, number 1, 1.34, boolean true, false – these all are primitives. Rest everything are Objects.
I know what you are thinking? Primitives are not Objects, and so they must not be able to execute any method on themselves – because they don’t have any – then how come this works?
console.log(“abc”.length); // prints 3
Well, the reason this works is that Javascript wraps these primitives with their Object counterparts.
“abc” ==> new String(“abc”)
true ==> new Boolean(true)
1 ==> new Number(1)
Recall Java’s wrapper objects? Well, this is almost the same and with more juicy auto-boxing in play. Javascript automatically coerces between primitives and Objects. In this case,
- string value is coerced into a String Object,
- property ‘length’ is accessed, and then
- the coerced Object is discarded for garbage collection!
Now, the hacker in you must have started thinking: If Javascript automatically coerces primitives to Objects; then certainly, you should be able to assign properties to a primitive too, like this:
var my_bank_account = “State Bank of India”; my_bank_account.city = “Indore”; console.log(my_bank_account.city); // prints undefined.
Oops. It failed. But why?
Well, what you guessed was right. The moment you tried to assign a property to the primitive, Javascript did indeed coerce primitive string into the wrapper Object – in this case, String. This new Object got itself assigned a brand new property – ‘city’ – too. But, since there wasn’t any placeholder to store the reference to this new Object, it was quickly discarded and sent for garbage collection. And when you went back to access the property ‘city’, Javascript again coerced your primitive to the wrapper Object – which incidentally is a new Object and doesn’t know anything about your ‘city’ property hence you see ‘undefined’.
Let me show you – roughly – what happens behind the scene:
var my_bank_account = “State Bank of India”; my_bank_account.city = “Indore”; ==> (new String(“State Bank of India”)).city = “Indore”; console.log(my_bank_account.city); ==> console.log( (new String(“State Bank of India”)).city );
This whole game seems messy? No. How do I know when am I dealing with primitives and when with Objects?
Well, you use ‘typeof’ operator.
typeof false; // ‘boolean’ typeof new Boolean(false); // ‘object’ typeof “Animesh”;// ‘string’ typeof new String(“Animesh”); // ‘object’ typeof 1.45; // ‘number’ typeof new Number(1.45); // ‘object’
Enlightened? Good.
One more thing: this entire coercion business is a double-way traffic. The way Javascript turns your primitives into Objects when needed, it can turn Objects into primitives too when required.
var s = new String(‘hello’); typeof s; // ‘object’ var p = s.valueOf(); // converts to primitive typeof p; // ‘string’
Javascript will automatically use ‘valueOf’ (and sometimes, ‘toString’) method whenever such a need arises. Watch this:
var x = new String(“abc”); typeof x; // ‘object’ var y = x + 1; // automatically calls valueOf() on Object x. typeof y; // ‘string’
Normally, you don’t need to worry about this thing. But knowing what goes on certainly helps at times. Don’t you think? Follow this:
var b = new Boolean(false);
typeof b; // ‘object’
if (b){
console.log(“b is true.”); // Ah! When did b become true?
}
What’s wrong with the above code? You must have thought that Javascript would coerce the Boolean object into primitive while evaluating ‘if’? Well, Javascript didn’t think so. It just went ahead and evaluated the Object directly for the condition… and since that Object was neither null nor undefined, it evaluated to true.
If you had done this:
if (b.valueOf(){
console.log(“b is true”);
} else {
console.log(“b is false”); // thank God!
}
It would have come out differently. Remember this: unless there is a dire need, Javascript doesn’t perform any coercion.
I hope this article was helpful. I’ll be writing more on Javascript internals.
Fun with Singleton (Python, Javascript, Java)
They say that Singletons, like global variables, are evil. They hide dependencies; are harder to test and even harder to extend. Singletons are lies, and it’s best to keep away from them. But, there are scenarios where you need them. For example, when you want a shared resource like printer spooler or file manager or log manager, you want a single object to handle requests from all the various parts of your application.
In this blog, I am going to explore various ways to make Singletons in languages like Python, Java and Javascript so as to keep it simple, elegant and usable. Let’s talk about Python first. I love it, and it’s a really really wonderful language, and in here, there are n different ways to solve a problem. Singletons are no exception. The most natural way to do it is to create a decorator.
class Singleton(object): def __init__(self, klass): self.klass = klass # class which is being decorated self.instance = None # instance of that class def __call__(self, *args, **kwargs): if self.instance is None: # new instance is created and stored for future use self.instance = self.klass(*args, **kwargs) return self.instance
Now, let’s say you have a Resource class. To make it singleton, you just need to decorate it with ‘@Singleton‘, and you are done.
@Singleton class Resource(object): def __init__(self, klass): self.name = None
Cool…eh? There are other – nerdy – ways too. Python uses an internal dictionary ‘__dict__’ variable to keep track of an Object’s properties and methods. So, if you can share ‘__dict__‘ across multiple instances of a Class, you can share the state between them. And isn’t that Singleton? Yes, that is. You might have many many instances, but all of them behave exactly the same.
class Singleton(object):
_shared = {}
def __init__(self):
self.__dict__ = Singleton._shared
class Resource(Singleton):
def __init__(self, klass):
self.name = None
Since ‘self.__dict__‘ now refers to ‘_shared‘ dictionary, all instances of Resource would use the same dictionary and hence they will all have the same behavior. Geeky? Let me show you an even geekier way to do it.
In Python, when you instantiate a class, the interpreter calls ‘__new__‘ method – a class method which returns an instance of the class – and then ‘__init__‘ method – constructor of the class – is called which initializes the object. So, you can hack into ‘__new__‘ and return the single instance whenever it is being called.
class Singleton(object): _instance = None def __new__(cls, *args, **kwargs): # cls is the Class and the rest are constructor arguments if cls._instance is None: # create an instance and store it cls._instance = Object.__new__(cls, *args, **kwargs) return cls._instance class Resource(Singleton): def __init__(self, klass): self.name = None
Aweomse! Isn’t it? There are other ways that deal with ‘__metaclass__‘ etc. but let’s save them for another day. Let’s use it now:
# get resource r1 r1 = Resource(); # get resource r2 (since Resource is singleton, r1 == r2) r2 = Resource(); # to verify, let's set 'name' onto r1 r1.name = "Animesh Kumar" print r1.name # and the same 'name' appears in r2 as well! print r2.name
Let’s now see how do we do this in Javascript. For the most simple form, just define an Object Literal, and you are done.
var resource = {
getName : function() {
return this.name;
},
setName: function(name){
this.name = name;
}
}
Easy. You have an object which you can now share across your application modules and it just works. For more complex scenarios, like private variables and all, you might have to resort to something like this:
// self-executable wrapper function
var Resource = (function(){
// Resouce class which is to made 'singleton'
function _Resource() {
var name; // private variable
this.getName = function() { // getter
return this.name;
};
this.setName= function(name){ // setter
this.name = name;
};
// do more stuffs
}
// instance holder
var instance = new _Resource();</p>
// return an object with 'getInstance' method
return = {
getInstance: function(){
return instance;
}
};
})();
_Resource (line-04) is your function of interest, and you want to make it singleton. So, you create another function ‘Resource‘ which wraps over _Resource and returns an object with method ‘getInstance‘ which would return the same instance of _Resource every time it will be called.
Let’s try to use it now:
// get resource r1
r1 = Resource.getInstance();
// get resource r2 (since Resource is singleton, r1 == r2)
r2 = Resource.getInstance();
// to verify, let's set 'name' onto r1
r1.setName("Animesh Kumar");
console.log(r1.getName());
// and the same 'name' appears in r2 as well!
console.log(r2.getName());
So it was easy. No? Great.
Now, Java. I know every one of you would already know it. I would write about it anyway, just for the sake of completeness. In Java, you create a private static instance of the class, and use that instance wherever necessary.
public class Resource {
// static instance (Note: Resource instantiation is done here, not in getInstance)
private static Resource instance = new Resource();
// property
private String name;
// private constructor to disable 'new'
private Resource() {
}
// public staic method to get an instance of this class
public static Resource getInstance() {
return instance;
}
// getter
public String getName() {
return name;
}
// setter
public void setName(String name) {
this.name = name;
}
}
Now, let’s use it.
public static void main(String[] args) {
// get resource r1
Resource r1 = Resource.getInstance();
// get resource r2 (since Resource is singleton, r1 == r2)
Resource r2 = Resource.getInstance();
// to verify, let's set 'name' onto r1
r1.setName("Animesh Kumar");
System.out.println(r1.getName());
// and the same 'name' appears in r2 as well!
System.out.println(r2.getName());
}
Loud and clear. And this also stops you from instantiating ‘Resource‘ with ‘new‘ operator. Try this:
Resource r1 = new Resource(); // java.lang.Error: Unresolved compilation problem:
Your code will not compile. I know you know why. I will write anyways: because the constructor is private! So, there is no way to get an instance of Resource class but through ‘getInstance’ method which ensures single instance of the class. Also, If you noticed: I have instantiated ‘instance’ during declaration itself, not in ‘getInstance’ method. This way, the object gets created at the time of class loading, and you save yourself from a lot of issues that creeps in because of Java Just in Time.
IBM has no idea what Node.js is
People, don’t read this article on IBM’s developer works Just what is Node.js?. And you have read it already, don’t believe it. IBM has no idea what Node is, really.
Read my rebuttal http://wkp.me/wkk6g
For more gory details, read Marak Squires’ blog here.
IBM must pull the article or ask the author to rewrite it. The article is providing a dis-service to any new developers who might stumble along it as their first introduction to node.js.
Node and Its Many Incarnations (Node Version Management)
Node.js is under active development. And every other day, a new build is released. It’s awesome to see how fast Node is growing and how vibrant the community is… but on the down side, it’s becoming increasingly difficult to keep track of its many versions, and API changes.
Very often, while developing an app, you find yourself married into a particular Node version, because a newer one might have some API changes (mind you, Node is witnessing heavy transformations, especially at the API level) which might break you app… and then, you would be forced to revert back to the older version. That means, uninstall the current node and re-install the older one. Ouch! So much work for a mere upgrade.
Well, there is a nicer way to do it. Check out this project by Tim Caswell: Node Version Manager. It does exactly what it says. It manages various Node versions on your machine, development, stage, production whatever. How?
It creates a virtual Node environment for each version you want to keep. Let’s say, you want to stay with the last stable release v0.2.6 (from the time you started your app) but also want to experiment with v0.4.7 to keep an eye on new additions. NVM will install two separate Node(s) for you, and each will run in its own sandbox like environment, that is, you will have to install all your third party Modules/Libraries separately for each Node installation. That might seem to be a lot of work, but trust me, it’s the safest way to avoid conflicts. Okay. Let’s get to work.
Installation
Note: I am assuming that you have basic knowhow of GIT (the most awesome source control management system).
- Clone NVM repository to your local machine:
$ git clone git://github.com/creationix/nvm.git ~/.nvm
Above command would close the NVM repository to a folder ‘.nvm’ in your home directory. (I am using Ubuntu 10.0.4)
- Switch to folder ‘.nvm’ and make file ‘nvm.sh’ executable:
$ chmod 755 ~/.nvm/nvm.sh
- ‘nvm.sh’ is just a shell script, so in order to run it, you must source it to every terminal you open. To do this automatically, simply edit either ‘.bashrc’ or ‘.profile’ file to have this line in the very end:
. ~/.nvm/nvm.sh
- That’s it. Open a new terminal and run,
$ nvm
- You will see a set of useful commands you can use.
Easy huh?
Getting dirty
Before you get any further, just make sure that you have ‘wget’ installed in your machine. I know, I know… you might already have it. I just want you to make sure.
Check which versions of Node are available.
$ nvm sync // update the local machine with available versions from server $ nvm ls // displays all available and installed versions
Now install Node v0.4.7.
$ nvm install v0.4.7 // will install Node v.0.4.7
Note: You might get this error, “Could not autodetect OpenSSL support. Make sure OpenSSL development packages are installed. Use configure –without-ssl to disable this message” which says, that you need to install SSl library:
$ sudo apt-get install libssl-dev
NVM creates a folder ‘src’ either in your home directory or in ‘.nvm’ directory where it downloads the bundled release, extracts and installs it. NVM also installs NPM (node package manager) for each installation of Node.
Select a particular version
$ nvm use v0.4.7 // start using Node-v0.4.7
That’s it. You have set up a system which will enable you to quickly and cleanly switch between various Node versions. You can test your app’s compatibility with any of them, and if need be, easily switch to the one your app was most comfortable with.
Now, since you have set up a congenial Node development machine, in the next blog, I will talk about how to go live with your Node app.
Note: for CentOs-5.x, please make sure that you have following packages installed:
$sudo yum install gcc-c++ screen git-core openssl openssl-devel
shutdown hooks in java
You know what? You can add your custom hooks to JVM to be called when JVN shuts down. Interesting… no?
What happens is very simple: you create a Thread and write you logic in there, and then register this Thread to the Runtime instance. Here is a sample:
public class JVMShutdownTest {
public static void main(String[] args) {
// Add a shutdown hook
Runtime.getRuntime().addShutdownHook(new Thread(){
public void run(){
System.out.println ("closing down the shop...");
}
});
// Exit now
System.exit(0);
}
}
output will be: closing down the shop…
This attached Thread is initialized but not yet started. When JVM starts to shut down, it starts all registered hooks in an uncontrollable (that is, there is no way to enforce your will) order, and all of them run concurrently.
One thing you must take in consideration is that these hooks get executed at a very delicate time and so you must keep them light, thread-safe and independent of heavy dependencies. Thump rule: hooks must finish quick.
WebSocket support in Android’s Phonegap apps
We are developing a small game which can be played from multiple users using variety of clients, web-browser, Android, iPhone, iPad etc. It’s like, there is a server and all clients connect to this server, and send and receive messages. We decided to use WebSocket for underlying connection between clients and server, and Phonegap to build clients. Our idea is to write the client once and then run it on variety of platforms. Since, Phonegap enables app development using HTML, CSS and JavaScripts, it generously fits into our requirement.
But Phonegap doesn’t support WebSocket yet, it’s in their Plan-of-Action for 1.x release though. So, it was needed to be addressed. I found Mathias Desloge’s PhoneGap-Android-HTML5-WebSocket project. It was good but it used old java.io.* packages. I would have preferred to use java.nio.* for better and efficient non-blocking behavior. So, I decided to write my own small library.
Library can be found here: websocket-android-phonegap.
How to use?
- Copy Java source into your source folder.
- Copy websocket.js in your assets/www/js folder
- Attach com.strumsoft.websocket.phonegap.WebSocketFactory to WebView, like
@Override public void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); super.loadUrl("file:///android_asset/www/index.html"); // attach websocket factory appView.addJavascriptInterface(new WebSocketFactory(appView), "WebSocketFactory"); } - In your page, create a new WebSocket, and overload its method ‘onmessage’, ‘onopen’, ‘onclose’, like
// new socket var socket = new WebSocket('ws://192.168.1.153:8081'); // push a message after the connection is established. socket.onopen = function() { alert('connected'); }; // alerts message pushed from server socket.onmessage = function(msg) { alert(JSON.stringify(msg)); }; // alert close event socket.onclose = function() { alert('closed'); };
How it works?
When you create a new WebSocket object in your page, behind the scene, websocket.js delegates the responsibility to com.strumsoft.websocket.phonegap.WebSocketFactory to instantiate new com.strumsoft.websocket.phonegap.WebSocket object.
// websocket.js // get a new websocket object from factory (check com.strumsoft.websocket.WebSocketFactory.java) this.socket = WebSocketFactory.getWebSocket(url);
WebSocketFactory simply instantiates a new WebSocket object, connects it to the designated server and returns the instance.
// com.strumsoft.websocket.phonegap.WebSocketFactory
public WebSocket getWebSocket(String url) throws URISyntaxException {
WebSocket socket = new WebSocket(appView, new URI(url));
socket.connect(); // connects to server
return socket;
}
Now, whenever an event occurs, say, ‘onmessage’, WebSocket class delegates that event to Javascript.
// com.strumsoft.websocket.phonegap.WebSocket
public void onMessage(String message) {
appView.loadUrl(buildLoadData("message", message));
}
private String buildLoadData(String _event, String _data) {
String _d = "javascript:WebSocket.on" + _event + "(" +
"{"
+ "\"_target\":\"" + webSocketId + "\"," +
"\"_data\":'" + data + "'" +
"}" +
")";
Logger.log(_d);
return _d;
}
Finally, ‘WebSocket.onmessage’ from websocket.js is called. It parses the payload, finds out the target WebSocket object, and calls the corresponding event on the target object with event data.
// websocket.js
// static event methods to call event methods on target websocket objects
WebSocket.onmessage = function (evt) {
WebSocket.registry[evt._target]['onmessage'].call(global, evt._data);
}
That’s it!
Amendment
(Date: Thu Aug 25 12:40:52 IST 2011)
There was a serious bug! The Websocket connection runs in a separate thread to manage persistent state with the server. And the front end Javascript (websocket.js) stays within UI/Main thread. And Android doesn’t want other threads to communicate with UI thread directly. These threads must employ an additional thread to bridge the communication. So, here is the fix!
// a message is sent to server!
public void send(final String text) {
// new thread
new Thread(new Runnable() {
@Override
public void run() {
if (instance.readyState == WEBSOCKET_STATE_OPEN) {
try {
instance._send(text);
} catch (IOException e) {
instance.onError(e);
}
} else {
instance.onError(new NotYetConnectedException());
}
}
}).start();
}
// when a message is received
public void onMessage(final String msg) {
// post a new thread to View
appView.post(new Runnable() {
@Override
public void run() {
appView.loadUrl(buildJavaScriptData(EVENT_ON_MESSAGE, msg));
}
});
}
Commit link:
https://github.com/anismiles/websocket-android-phonegap/commit/a7ccb815cce3a446c3ec92058187cdb20e5a41e8
https://github.com/anismiles/websocket-android-phonegap/commit/087f7a93d46f92cb037d2b451a4d253a65f5f015
WebSocket and node.js: why shud’ya care?
Traditional HTTP messages are heavy. Every message is sent with HTTP headers. Now, let’s say you have an application that has a real-time component, like chat or some twitter client or may be some traffic analysis stuff. And let’s say you have around 100,000 users connected to your app. To make your app real-time, you need to have a mechanism which will enable server to push data almost as soon as this data becomes available. You could do it in two ways: Write a script which will connect to server every few seconds to check if there is any data. With each attempt, full set of HTTP headers moves back and forth between client and server. That’s not very efficient. To save yourself with all these bandwidth hassles, you could use a popular trick known as long-polling, where your browser connects to server and server holds the connection open until there is some data available to be pushed.
Now, let’s assume that there are 100,000 users connected to your app and every 10 seconds some data is sent from server to clients. Following HTTP specs, every time some data is sent, full set of headers are shared between client and server. This is how they look,
Request header
GET / HTTP/1.1 User-Agent: ...some long user agent string... Host: animesh.org Accept: */*
Response header
HTTP/1.1 200 OK Date: Tue, 25 Jan 2011 17:32:19 GMT Server: Apache X-Powered-By: PHP/5.2.3 X-Pingback: http://animesh.org/endpoint/ Connection: close Transfer-Encoding: chunked Content-Type: text/html; charset=UTF-8
That’s approximately 350 bytes of data, per user every 10 seconds. That’s roughly 28,400,000 bits per second of network throughput for 100,000 users. Roughly 26.7 Mbps for only HTTP headers. Gosh!
WebSocket
WebSocket comes to resue. With web sockets, once a handshake is done between client and server, messages can be sent back and forth with a minimal overhead. That’s awesome. You do a handshake while establishing the connection, and of course handshaking needs all those HTTP headers, but after that, you only need to send the data… no headers. This greatly reduces the bandwidth usage and thus improves the performance. Let’s see how. This is how handshake headers look like,
Handshake Request header
GET /demo HTTP/1.1 Upgrade: WebSocket Connection: Upgrade Host: animesh.org Origin: http://animesh.org WebSocket-Protocol: sample
Handshake Response header
HTTP/1.1 101 Web Socket Protocol Handshake Upgrade: WebSocket Connection: Upgrade WebSocket-Origin: http://animesh.org WebSocket-Location: ws://animesh.org/ WebSocket-Protocol: sample
And now, the connection has been established and data can freely flow between server and client without having to exchange any HTTP headers until this connection is closed or broken and you do another handshake. Imagine how much bandwidth you are saving! Whoa!
Example
Let’s write a simple application to see and learn how this thing actually works. This application will have a server all the clients will connect to, and whenever one client writes something to the server, all clients will be notified.
Here is our server, written in Node.js. Let’s name it server.js
Note: Though you can very well write a web socket server using Node’s native APIs, however I chose to use Micheil Smith‘s node-websocket-server library. This library is simple, elegant and very easy to work with. It works by wrapping and extending Node’s server object.
var sys = require("sys");
// Library https://github.com/miksago/node-websocket-server
var websocket = require('./lib/node-websocket-server/lib/ws/server');
// create web socket server
var server = websocket.createServer();
// listen on port 8078
server.listen(8078);
// when the server is ready
server.addListener("listening", function() {
sys.log("Listening for connections on localhost:8078");
});
// when a traditional HTTP request comes
server.addListener("request", function(req, res) {
res.writeHead(200, {
"Content-Type" : "text/plain"
});
res.write("This is an example WebSocket server.");
res.end();
});
// when a client websocket connects
server.addListener("connection", function(conn) {
// when client writes something
conn.addListener("message", function(message) {
// iterate thorough all connected clients, and push this message
server.manager.forEach(function(connected_client) {
connected_client.write(JSON.stringify(conn.id + ": " + message));
});
});
});
Now, let’s write a simple client. We will create one HTML file and run it in Google Chrome. Let’s name is client.html
<!DOCTYPE html>
<html>
<head>
<title>WebSocket - Simple Client</title>
<script type="text/javascript" src="http://ajax.googleapis.com/ajax/libs/jquery/1.2.6/jquery.js"></script>
<script type="text/javascript">
$(function() {
// bind form
$('#payload-form').submit(function() {
var payload = ($("input#payload").val());
socket.send(payload); // write to server
return false;
});
// open websocket
var socket = new WebSocket('ws://localhost:8078');
socket.onopen = function() {
// Web Socket is connected. send an initial random message.
socket.send(Math.floor(Math.random()*11) + ' >> Hi, I am Mr. so-and-so!');
};
// append to '#log' whatever server pushes.
socket.onmessage = function(ev){
msg = JSON.parse(ev.data);
$('#log').append(JSON.stringify(msg) + '</br>');
}
})
</script>
</head>
<body>
<div id='payload-container'>
<form id='payload-form'>
<input type='text' name='payload' id='payload' value='Hello World' style="width:500px;"/>
<input type='submit' value='push'/>
</form>
</div>
<div id='log' style='display:block; border:1px solid lightgray;'></div>
</body>
</html>
Now, run your server, and open your client in multiple Chrome windows/tabs.
// run server $ node server.js
That’s it! Was is fun? I will write more on how to establish WebSocket connections from a Java client in the next blog.
i’m a joke
I’m a joke
may be, a laugh.
Don’t trust me
not worth it
I come cheap
and easy
like wild scrubs
at every nook
and cranny.
Never taken home
but, used
and cast away
and laughed at
worst, ridiculed sometimes
I’m not worth it.
Maven, SQL and ordered execution
I have been re-architecting this new project. This is huge. And broken. And I was called upon to clean, refactor and re-architect it. Phew!
I thought a good strategy would be to deal with it in outside-in style, which is, fixing the build part first then moving down to various parts. One of the interesting things that they had done was using Maven to populate PostgreSQL DB Schemas and then load data into the DB.
Steps of execution
- Drop everything
- Create users
- Create schemas
- Load data
- Load upgrade data
They used sql-maven-plugin underwhich they had multiple execution tasks. The whole thing was very untidy and I didn’t like it.
Summary
- They used <fileSets> to import sql files. FileSets are an elegant choice if you want to import files with ANT like wildcard patterns. But the files are alphabetically sorted. So, unless you don’t care about the order in which the files will be executed, you should avoid using FileSets. In their case, they wanted to run ‘drop_users_databases.sql’ before ‘create_users_databases.sql’. With FileSets, it was impossible to do, they would be re-ordered just the opposite. Solution? They should have used <srcFiles>, but instead they created 2 separate <execution> tasks to run in order. Too much verbosity. Eh?
- Another issue was with loading upgrade data. Now, they follow agile methodologies and keep updating their DB schemas, and so after few iterations, before they merge the changes in the main script, they end up with a number of files like upgrade-schema-2.0.4.sql, upgrade-schema-2.0.5.sql, upgrade-schema-2.0.5.1.sql and so on. Pain is, they don’t have any automated mechanism to execute these files in order, so the developers would be forced to do it manually. If the number of these files were low, that wouldn’t have been a problem… but how about thirty or forty such files? Pissed off.
Strategy
- Kick <fileSets> out of the window
- Use <srcFiles> instead (because I cared more about order)
- Logically partition the operations
- Create a new profile to help run these DB executions at will
Code
<profiles> <profile> <!-- profile id --> <id>init-db</id> <build> <plugins> <plugin> <groupId>org.codehaus.mojo</groupId> <artifactId>sql-maven-plugin</artifactId> <!-- postgresql dependencies --> <dependencies> <dependency> <groupId>postgresql</groupId> <artifactId>postgresql</artifactId> <version>8.3-603.jdbc4</version> <scope>clean</scope> </dependency> </dependencies> <!-- DB Configuration --> <configuration> <url>jdbc:postgresql:my_db</url> <driver>org.postgresql.Driver</driver> <username>user</username> <password>pass</password> <autocommit>true</autocommit> </configuration> <executions> <!-- 1. drop everything, create users and load data --> <execution> <id>drop-and-create-database</id> <phase>clean</phase> <goals><goal>execute</goal></goals> <configuration> <srcFiles> <srcFile>src/main/sql/drop_users_databases.sql</srcFile> <srcFile>src/main/sql/create_users_databases.sql</srcFile> <srcFile>src/main/sql/create_schema.sql</srcFile> <srcFile>src/main/sql/load_data.sql</srcFile> <srcFile>src/main/sql/test_data.sql</srcFile> </srcFiles> </configuration> </execution> <!-- 2. run all upgrade scripts --> <execution> <id>upgrade-schema</id> <phase>clean</phase> <goals><goal>execute</goal></goals> <configuration> <srcFiles> <srcFile>src/main/sql/upgrade-schema-2.0.6.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-2.1.5.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-2.1.7.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-2.1.8.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-2.2.1.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.0.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.1.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.2.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.3.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.4.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.5.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.6.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.6.1.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.8.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.9.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.10.sql</srcFile> <srcFile>src/main/sql/upgrade-schema-3.0.11.sql</srcFile> </srcFiles> </configuration> </execution> </executions> </plugin> </plugins> </build> </profile> </profiles>
How to run?
Note that I have bound the executions with phase ‘clean’ within a profile with id ‘init-db’. So, I just need to let maven know about the phase and profile, like this:
mvn -Pinit-db clean
I hope this will help someone in need. I will post more adventurous code cleaning stuffs as I will encounter them. Cheers!




