Real-time Content Analysis
The Web consists of billions of Websites with millions of new websites being created every month. Web 2.0 has fundamentally changed web pages from being static content to content that changes dynamically and the rapid growth in Social media has added user-generated content.
First and second-generation legacy Web filters relying exclusively on URL databases or keyword scanning and scoring to control user access now only offer limited protection and can't cope with the Web's continual growth and dynamic nature.
URL databases are typically updated daily by vendors, using a combination of human categorizers or offline content analysis software. With over 7.5 million new or updated URLs created every day, it is simply impossible to maintain an up-to-date URL list. It may take several days or weeks before a new URL is discovered, categorized and available to customers. What happens if that new URL is not in the URL database? What is the risk if the URL contains inappropriate content that users can freely access?
Keyword scoring offers some level of additional filtering and protection but is generally only effective across limited types of content such as pornography. However, keyword scoring is prone to over-blocking. For example, pornography, sex education and health web pages may contain the same words; keyword scoring is not sophisticated enough to determine which category the page belongs.
Some vendors use a cloud based approach to complement these techniques by sending pages which are not in the URL database for categorization to a cloud based off-line categorization engine. However, this can introduce latency into browsing and is relatively simple for criminals to fool the web filter by presenting another page when the page is retrieved by the categorizer.
To overcome these limitations, Bloxx went back to the drawing board and developed its real-time content analysis and categorization engine, Tru-View Technology (TVT). Using patent-pending language analysis and intelligent identification techniques, TVT performs real-time analysis and categorization of requested Web pages, giving you more complete coverage of the Web and dramatically improving the accuracy of Web content filtering, increasing protection and improving security.
Tru-View Technology Web Content Filtering vs. first and second generation Web Content Filtering
The following table summarizes the key differences between TVT based Web Content Filtering and legacy first and second generation Web filtering.
|1st & 2nd Gen Web Filters||Bloxx Tru-View Technology|
|URL databases, no matter how large they are, provide only limited coverage of the Web.||TVT automatically analyzes and categorizes requested URLs not included in the optimized Bloxx URL database, providing greatly improved coverage of the Web.|
|URL databases focus on top level domains so content on individual Web pages are not usually categorized individually. This means that inappropriate content or non-business content can easily slip through.||TVT categorizes not just the top level domain page but any page on a website that is requested. This improves the granularity of filtering and reduces risk.|
|New web pages which may not be listed in a URL database are easily accessible and often need to be manually added to a deny list to block access.||It doesn't matter what Bloxx category the page belongs to – shopping, violence, travel, anonymous proxies – TVT controls it with minimal effort from you.|
|New Websites and Web pages need to be discovered and categorized before being added to a URL database – this could be days, week or several months after a Website is launched or new pages added.||TVT automatically categorizes pages – even if no one has seen the content before – giving you outstanding zero-second protection.|
|URLs in a URL database are seldom re-classified. If the site content changes then the site may be mis-categorized and available to your users.||When the content on a site changes, TVT will analyze and categorize the site into the appropriate Bloxx category.|
How Tru-View Technology Works
Tru-View Technology is patent-pending software that analyzes and categorizes textual content and classifies it into one or more content categories. The software uses a range of different language analysis and classification techniques to provide unsurpassed categorization speed and accuracy.
Each content category used in Bloxx content filters has been trained by Bloxx to accurately recognize the key characteristics and language used in web pages. When presented with a web page, Tru-View Technology is able to understand the patterns, relevance and content of text on the page and is able to classify the page.
This in-line, real-time method of Web content filtering allows web pages that have never been seen before and consequently may not yet be listed in an URL database, to be identified and categorized correctly with an extremely high level of categorization accuracy.
The method is extremely effective at categorizing web pages across a wide range of different categories, not just inappropriate content such as pornography. For example, the software is excellent at categorizing content such as shopping and social networking that may not be inappropriate but could have a dramatic impact on staff productivity.
The software is also extremely fast and introduces no measureable delay into the overall process of requesting and retrieving web pages.