How NogaLogic Works
Data Classification & Business Context
Every company suffers from the same problem: vast amounts of unidentified unstructured data in a multitude of formats — including email, Word documents, PDF files, spreadsheets and web pages — dispersed across the enterprise. Without full visibility and control of this data, you cannot exploit its full business value, and you are exposed to the significant risks associated with inappropriate data access and distribution, regulatory non-compliance and inadequate legal discovery. You also probably spend too much time and money on storage and the management of unnecessary files. The only way to fully understand information assets and avoid the risks associated with poor data protection and management is to first identify your data assets by their business value to your organization. Only then can you develop the right policies and processes to manage and protect your company.
What is Business Context?
Every piece of unstructured business data is “about something.” It may be about the design of a product, a situation with a customer, the financial performance of a business unit, a patient's medical history, or some combination of such things. This is its “business context”. Obviously, this business context is important — because the business context is what differentiates a proposal for a million-dollar deal from an invitation to the company picnic. The business context of a document affects everything about how this document is managed, from its access and distribution permissions, to its storage location and retention requirements. The business context of a document is particularly important for identifying sensitive data. A document is sensitive because of what it’s about, rather than some arbitrary tag or classification (such as top secret, public). Furthermore, the sensitivity of a document can/will change over the documents’ lifecycle depending on changing business circumstances. Therefore it will need to be identified and managed on an ongoing basis, according to its changing value to the organization.
How does NogaLogic create business context?
NogaLogic codifies the business context using Business Entities (BEs). NogaLogic’s BEs are the logical objects that define what the business is and what it does, as discussed and referenced within an organization’s documents. These are typically, the people and companies that are related to the business in one capacity or another, products that the company produces, and the locations related to the business. The actual nature of the BE types will vary from industry to industry. For example, in the pharmaceutical industry, the relevant product BEs will include the names of drugs, diseases, vitamins, proteins etc. while in the communications industry relevant types of product BEs will likely include cell phones models, coverage plans etc. BEs are multi dimensional. They can be identified not only by their actual names, but also by other properties — such as email addresses, phone numbers, or other unique or semi-unique identifiers.
Through its data classification process, NogaLogic transforms your fragmented unstructured data scattered across your company into a single, simple, well structured virtual repository. Now, your unstructured data can now be identified, understood, managed organized and protected based on its actual business meaning and value to your specific organization.
NogaLogic’s data classification is automatic, powerful yet extremely simple and easy-to-use:
- Accurate, objective and consistent results through utilization of business context data classification, SNLP, keyword search and various text analysis techniques.
- Automatically recognizes synonyms and nicknames.
- Understands the hierarchy of business terms to one and another (such as a product belonging to a certain department).
- Classification includes metadata properties as well as the document's content.
- Supports multiple European languages, including English, French, German, Italian, Spanish, Portuguese, Dutch and Hebrew. Includes automatic language identification and morphological analysis for each specific language.
- Recognizes and reclassifies new or updated documents as they are created.
- Extremely easy to use.
- Fast and easy installation, classification process is automatic and doesn’t not require manual tagging