In this system, two or more rules hidden in the database and respective rules' importance can be acquired by cooperation of agents. ![]() By using this method, we had developed the rule extraction system from database (Hara, 2004, 2005, 2008). The ADG is a new method that united Genetic Programming (GP) with cooperative problem solving by multiple agents. The SOM is an algorithm used to visualize and interpret large high-dimensional data sets. A topological map is a simple mapping that preserves neighborhood relations. The SOM is developed by Kohonen and is a topology-preserving map because there is a topological structure imposed on the nodes in the network. First, the method divides E-mails into some categories by Self-Organizing Map (SOM) (Kohonen, 1995) and extracts the adequate judgment rules by Automatically Defined Groups(ADG) (Hara, 1999), even if the judgment results by SpamAssassin are wrong. This method can learn patterns of Spam E-mails and Ham ones and correctly recognizes them. In this paper, we propose a classification method for Spam E-mail based on the results of SpamAssassin, which is the open source software to identify spam signatures. Moreover, even if the message is judged as Spam, the different rules for each person are required according to the personal environment such as work style, because a content of received message will be different. Although each score in the rule of SpamAssassin is low, there is a few cases in which total score becomes high. The agreement degree for each rule is scored and if the total score is larger than the threshold value, the given E-mail is judged as Spam. There are the predefined rules for each filtering method to detect a Spam E-mail. The SpamAssassin(SpamAssassin) is the open source software and is a mail filter which attempts to identify a Spam E-mail using various pattern match methods including text analysis, Bayesian filtering, DNS block lists, and collaborative filtering databases. In order to avoid Spam E-mails, we must build the filtering system which can judge whether the received E-mail is a Spam E-mail or not. Therefore, undesired E-mails to us have been increased everyday, so that, it is not easy to read an important E-mail. The sender cannot be specified, because the sender of Spamming has only temporary E-mail address and the reply of them is not reached to the original sender. Spamming is economically viable because advertisers have no operating costs beyond the management of their mailing lists. Spamming is the abuse of electronic messaging systems to send unsolicited bulk messages or to promote products or services, which are almost universally undesired. ![]() But, their E-mails are called “Spamming“ and the Spamming becomes a one of the social issues. ![]() ![]() Their accumulated E-mail addresses collected in such a way will be valuable for advertising agents. However, E-mail addresses on the web page are gained and the virus is appended to E-mails, and the attacks for acquiring information on user's personal computer (called BOT) have been spreaded. One of major other tools gives E-mail communication which propagates not only letters from person to person, but a means of advertisement. are digitalized at a low price and are shared for our current culture. Web provides a virtual huge space of information, where an emotional experience, a political idea, a cultural custom, and the advice of manners of music, the business, the arts, photographs, and literatures, etc. Recently, the Internet has been a basis of our cultural life.
0 Comments
Leave a Reply. |