一个简单高效的禁词过滤类

nid007

浏览: 46523 次
性别:
来自: 上海

最近访客更多访客>>

devcang

edgardo_赵鹏

chwnchwn

月下独酌

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

java

禁词过滤 java

使用方法：

public static void main(String[] args) {
		  SimpleTreeFilter filter = new SimpleTreeFilter();
		  filter.addKeyword("禁词1");
		  filter.addKeyword("禁词2");
		  filter.addKeyword("其它禁词");
		  
		  System.out.println(filter.contains("我是合法的"));
		  System.out.println(filter.contains("我包含禁词1"));
		  System.out.println(filter.contains("我包含禁词2"));
		  System.out.println(filter.contains("我包含其它禁词1"));
		  System.out.println(filter.contains("来个别的吧"));
		  System.out.println(filter.contains("再见"));
	}

输出：
null
禁词1
禁词2
其它禁词
null
null

import java.util.HashMap;

public class SimpleTreeFilter{

	public class TreeNode{
		public char c;
		public HashMap<Character, TreeNode> next;
		public boolean isEnd=false;
	}
	HashMap<Character, TreeNode> head = new HashMap<Character, TreeNode>();
	
	public void addKeyword(String word) {
		word=word.toLowerCase();
		int len = word.length();
		if(len==0){
			return;
		}
		char firstChar = word.charAt(0);
		TreeNode node;
		if(head.containsKey(firstChar)){
			node = head.get(firstChar);
		}else{
			node = new TreeNode();
			node.c=firstChar;
			head.put(firstChar, node);
		}	
		for(int i=1;i<len;i++){
			char c=word.charAt(i);
			if(node.next==null){
				node.next = new HashMap<Character, TreeNode>();
			}
			if(node.next.containsKey(c)){
				node = node.next.get(c);
			}else{
				TreeNode tNode = new TreeNode();
				tNode.c=c;
				node.next.put(c, tNode);
				node = tNode;
			}
		}
		node.isEnd=true;
	}

	public String contains(String line) {
		int len = line.length();
		line=line.toLowerCase();
		for(int i=0;i<len;i++){
			char c=line.charAt(i);
			if(head.containsKey(c)){
				TreeNode node = head.get(c);
				if(node.isEnd==true){
					return (c+"").toLowerCase();
				}
				int j=i+1;
				while(j<len){
					char cTemp = line.charAt(j);
					if(node.next.containsKey(cTemp)){
						node = node.next.get(cTemp);
						if(node.isEnd==true){
							return line.substring(i,j+1).toLowerCase();
						}
					}else{
						break;
					}
					j++;
				}
			}
		}
		return null;
	}	
}

分享到：

StringUtils.split比 "".split 性能要好11 ... | 用perl写了个memcache ping

2012-05-30 09:47
浏览 1082
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

一个简单高效的禁词过滤类

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

一个简单高效的禁词过滤类

评论

发表评论

相关推荐

Spring Boot spring.profiles.active 环境变量配置

java8学习- StringJoiner

java8学习-Optional

java8学习- lambda表达式

StringUtils.split 的一个小陷阱

log4j DEBUG工具类

StringUtils.split比 "".split 性能要好11倍。差一个数量级

最近访客更多访客>>