- 浏览: 246031 次
- 性别:
- 来自: 成都
最新评论
-
oldrat:
https://github.com/oldratlee/tr ...
Kafka: High Qulity Posts
文章列表
Streaming algorithm
- 博客分类:
- Storm
http://en.wikipedia.org/wiki/Streaming_algorithm
Count-Min sketch
- 博客分类:
- Storm
“Sketching” data structures store a summary of a data set in situations where the whole data would be prohibitively costly to store (at least in a fast-access place like the memory as opposed to the hard disk). Variants of trees, hash tables, etc. are not sketching structures, they just facilita ...
https://storm.apache.org/documentation/Setting-up-a-Storm-cluster.html
This page outlines the steps for getting a Storm cluster up and running. If you're on AWS, you should check out the storm-deploy project. storm-deploy completely automates the provisioning, configuration, and installation of Sto ...
References
https://github.com/pranab/sifarish
sudo apt-get install libgtkmm-2.4-dev
sudo apt-get -f install
wget http://cdn.mysql.com/Downloads/MySQLGUITools/mysql-workbench-community-6.2.3-1ubu1404-amd64.deb
sudo dpkg -i mysql-workbench-community-6.2.3-1ubu1404-amd64.deb
If you want to patch and build mysql-workbench yoursel ...
Storm guarantees that each message coming off a spout will be fully processed. This page describes how Storm accomplishes this guarantee and what you have to do as a user to benefit from Storm's reliability capabilities.
What does it mean for a message to be "fully processed"?
A tuple co ...
Trident Tutorial
- 博客分类:
- Storm
Trident is a high-level abstraction for doing realtime computing on top of Storm. It allows you to seamlessly intermix high throughput (millions of messages per second), stateful stream processing with low latency distributed querying. If you're familiar with high level batch processing tools like P ...
https://storm.apache.org/documentation/Transactional-topologies.html
NOTE: Transactional topologies have been deprecated -- use the Trident framework instead.
Storm guarantees data processing by providing an at least once processing guarantee. The most common question asked about Storm is &q ...
References
http://www.wentrue.net/blog/?p=1181
http://blog.csdn.net/huilixiang/article/details/38441203
[1] Davidson, J. and Liebald, B. and Liu, J. The YouTube video recommendation system. Proceedings of the fourth ACM conference on Recommender systems. 2010 ...
#tar xzf kafka-0.8.1.1-src.tgz
#cd kafka-0.8.1.1-src
#./gradlew build
error:Cannot perform signing task ':clients:signArchives' because it has no configured signat
#./gradlew releaseTarGzAll -x signArchives
package inok.storm.kafka.sample;
import java.io.FileInputStream;
import java.io.IOException;
import java.util.Arrays;
import java.util.HashMap;
import java.util.Iterator;
import java.util.Map;
import java.util.Map.Entry;
import java.util.Properties;
import java.util.concurrent.a ...
We are using Sqoop to export data from the hive to SQL Server. The new data is always appended to the existing data in SQL Server.
Is it possible to truncate the SQL Server table via Sqoop before starting the export?
sqoop eval --connect 'jdbc:sqlserver://1.1.1.1;database=SomeDatabase;user ...
build tool gradle
- 博客分类:
- Java
http://gradle.org/docs/current/userguide/userguide_single.html
https://spring.io/guides/gs/gradle/
http://www.drdobbs.com/go-parallel/article/print?articleId=240143874&siteSectionName=
http://www.michael-noll.com/blog/2013/01/18/implementing-real-time-trending-topics-in-storm/
A common pattern in real-time data workflows is performing rolling counts of incoming data points, also known as sliding window analysis. A typical use case for rolling counts is identifying trending to ...