Thursday, March 22, 2012

Twitter Infographic


create infographics with visual.ly This info graphic shows the numbers behind the twitter accounts of Kevin Durant and Dwayne Wade of the NBA. The beauty of the infographic is that it shows the information in an eye catching view and is easy to interpret and understand. 

Looking at the graph it is easy to see that Wade has more followers with 2.89 million compared to 1.64 million followers for Durant. Kevin Durant does have 1.2K : 1 follower to following ration unlike Wade who has a 8.5K. This shows that Durant is just a rising young star that stays in touch with fans and is more likely to follow fans than Dwayne Wade.

Looking at the Followers by region it is easy to see that Wade has a bigger followed base in the US compared to Durant. Both players have the potential to increase their follower base tremendously if either of their teams can win a Championship this season. Durant has the biggest twitter upside because of his engagement with followers and his basketball potential.

Friday, March 9, 2012

Twitter Sentiment

Wordle: Lebron5600
Twitter Sentiment
Wordle: Lebron5600A1
TweetTone
This is from Twitter Sentiment This is from TweetTone Comparing the two sentiment tools it is clear that Lebron James is not liked by the majority of the NBA Community. There are also alot of post regarding Kobe Bryant when users tweet about Lebron James. The ratings from the sentiment analysis do not carry over into the word cloud.

Monday, March 5, 2012

Splunk

1. What does Splunk do and offer ?
Splunk is a company founded around making machine data accessible and easy to use by everyone. Machine data is the part of big data that is collected by any and all interactions that a person has with a company. It includes websites, applications, servers and other computer devices used on a daily basis. Splunk provides analytics for clients to turn all that machne data, transaction, call records and so on into operational inteligence to allow clients to reach their target market.

2. I am unsure on what Case to use for the example application.

3. Evaluation of Splunk
Splunk has over 3300 customers world wide through more than 75 countries. Splunks service seems to be great and their personnel seem to passionate and well trained for them to be used by different enterprises big and small. Operational intelligence is helpful tool for any business big or small. The company seems well positioned for expansion in the growing machine data market.

4.
index any data
Indexing Data from any Source
splunk distributed search across datacenters
Scable with Data Centers  around the world
                       

Cloudera

1.What is Hadoop and why is it a big deal ?
Hadoop is a software framework that allows users to change the application code to customize it to their big data analytic needs. Hadoop is an Apache open source being developed by a world wide community of developers. It is based around earlier work done by Google on their MapReduce application. MapReduce allows distributed computing of large data sets on a cluster of computers. Hadoop is a big deal because it is a open source project that allows custom code that allows business to analyze complex data sets that otherwise would have been hard to make sense of using standard data tables. Hadoop is used by a lot of different big companies but most business are not ready to use it just yet because of the high level analytic expertise and training it requires.

2. Who are Cloudera?
It is a company that specializes in Apache Hadoop software and support services around it on an enterprise level they also contribute to Apache projects related to Hadoop. Cloudera offers two products; the first which is Cloudera Enterprise and the other being  Cloudera's Distribution including Apache Hadoop.


3.What is PIG?
It is a high level data flow language used in conjunction with Hadoop. The language is called Pig Latin and it is a form of Java that allows for fast ad-hoc analysis of data sets. Users can create their own functions for special purpose data processing.


4.What is HIVE ?
HIVE functions as a data warehouse that allows for query based analysis of larger data sets. It uses a SQL like languange for its queries.It functions along side Hadoop files systems and just like Hadoop it is open-source and apache developed.


5. What is Cassandra?
Cassandra is an Apache open source database management system. It can handle large volumes of data that is spread out around many different servers. It started out as a way for Facebook to power their inbox search function. It uses NoSQL because traditional SQL based databases can be slow when dealing with big data sets.


6. What is Mahout ?
Mahout it is a suite of machine learning libraries that is designed to be  scalable and robust. it is another Apache open source project that is degined to work with Hadoop. Hadoop is associated with big data and Mahout is the word for a person driving an elephant. The elephant is Hadoop and Mahout wants to be the driving force behind it, but not lead the development of Hadoop.

Movie Review

Ip Man

Ip Man is a overly dramatized biograpiical movie about Yip “ Ip” Man. He is the first master of Wing Chun fighting style. The movie follows Ip Man from his days just before Japan invaded China in the 1930’s and his struggle to provide for his family. The movie also is about rising up to oppression and equality between nations and races.The movie is also packed with greatly executed martial arts sequences by Donnie Yen as Master Ip

Reasons I like the movie
1.Fast paced execution of Wing Chun kung fu
2.It’s based on the life of the master and mentor of the great Bruce Lee
3.It has a great message about equality and pride, even though it might come across as pro-Chinese to some people

IMDB Information: Ip Man