Recent Posts

Pages: 1 ... 7 8 [9] 10
81
"Plagiarism Detector" - Bugs\Errors\Crashes / Report list error in PD v848
« Last post by Alexei B. on June 04, 2014, 10:30:41 PM »
For the version 848 there is a problem when there is just one report shown in the list of the reports and this report is opened each time any document is checked.

It happens when there are more then 200 reports in the report folder. An easy way around is moving older reports in a separate folder.

This will be fixed in the next version released.
82
"Plagiarism Detector" - Bugs\Errors\Crashes / Footnotes not extracted problem.
« Last post by Alexei B. on May 16, 2014, 01:55:13 PM »
We have received several requests regarding references in the footnotes of the docX documents not being detected.

The problem is this: text from footnotes is not extracted. It is easy to check - no links that are present in the footnotes, are present in the report. We can do nothing here, since even Microsoft iFilters don't do this.

We recommend adding all the links from footnotes in the end of the document as text. For example as a list of used resources. In this case it will work just fine (remember to have URL-references enables in the program).

I hope it helps.
83
Versions - Release History / PDC core version 844 - new major update
« Last post by Mike Sanders on April 13, 2014, 10:44:43 PM »
Plagiarism Detector major update with core version 844:

Features:

01. Added failed resources number warning to reports.
02. Added "close" instead "goto tray" setting in application settings, allowing to shutdown on close.
03. Folder Check is a bit faster now.
04. Added column-sorting feature in Report Selector per client's request.
05. Portable Plagiarism Detector now has its own Originality Report storage on the same USB Flash drive.
06. Revised Proxy connection testing.

Bugfixes:

01. PDAS subscription capacity problem
02. DOCX loading if lockced by open Word application does not extract text.
03. Default Parser for Docx - changed to tackle Wiki-to-MS-word copy-paste problem.
04. Made Reports Selector load reports much faster.
05. Portable activation fixed.
06. Fixed "check stop" at PDAS if at least one document fails.
07. Tray icon dispersals correctly at shutdown.
08. Multiple fixes for Report Loader.
09. Second instance launch fixed.
10. Corrected Report Graph at program resize\minimize.
11. Correct check time for each document.
12. Fixed "Splash-creen On-Top problem" preventing from correct Proxy configuration.
--
Plagiarism Detector gas moved to Mantis Bug-tracking system, that boosted bug-resolution and
general development.
--
We are highly looking towards your feedback!
84
Dear Sir,

I have contacted our RnD and they promptly responded with a little update.

Please install and try out the following:

http://78.47.128.158/pdc_setup_generic_839.exe
(this will be soon released as an update)

Core version is 839.

This must work without a pre-start freeze.
--
"Local check" contained a bug (some debug source code) that resulted in a freeze before the
actual check started. In this update this code is completely removed, thus making this stage must faster.

The bug itself was an accumulative one - more files, more freeze. The less performing machine - the more freeze time.
The less RAM - the more freeze time.

We have tested in against 11000 sources vs 1 single file case,
with resulting check ETA = 14 minutes. On a relatively powerful desktop with HDD Drive.
--
Looking forward to your feedback!
85
Several point covered that you may find useful:

1. "I am trying to compare a document against a rather large directory of other word documents." - we strongly advise to try out our product PDAS (http://www.plagiarism-detector.com/plagiarism-detector-accumulator-server-demo-download.php). It was developed to resolve your particular task - comparison to large amount of documents in minimal time.

The difference between two methods is the following:

- "Folder Check" in PDC alway takes linear time e.g.:

  • 1000 documents :: 10 minutes
  • 2000 documents :: 20 minutes
  • 3000 documents :: 30 minutes
etc.

- PDAS (Plagiarism Detector Accumulator Server) check will be around minimal, fixed time irrespective the number of documents in the Database e.g.:

  • 1000 documents :: 10 seconds
  • 2000 documents :: 10,2 seconds
  • 3000 documents :: 10,4 seconds
--

02. "The system locks up every time" - could you specify the exact moment this happens?
I've already sent your complaint to our RnD dpt so that will look into this.

03. "I need to know what the directory size limits are?"
At the moment there exists no limits. PDC recursively locates all the files within the target folder and starts comparison in "one-by-one" manner
aggregating the results. Putting it shortly PDC will check ALL found files.

04. "How many documents can this system compare a document against?" As many as you need. The time will grow in a linear manner though. Using PDAS is strongly advised.
--
Feel free to as any additional questions!
86
Comparison to a folder was always considered more as a feature, then primary function. For a big part that is because such a check has to take much time.
To compare against a big document storage, we recommend using PDAS (Plagiarism Detector Accumulator Server), that is specifically created to check against a database, containing thousands of documents, and does it much faster.

So it was a surprise to us, that some customers started using the Folder comparison in such a way, and we never really tested this feature with such big packs of files, nor we created it for the task.

I will definitely ask our RnD to have a look into it.

May I ask you for some info on the computer specifications, such as the CPU and amount of RAM your computer has? It can be helpful. Same as what number of documents are you trying to check against.
87
I just purchased a pro license of this tool, but am having issues.  I am trying to compare a document against a rather large directory of other word documents.  The system locks up every time.  I need to know what the directory size limits are?  ie how many documents can this system compare a document against?  1000 files, 5000 files, 50,000 files???

Thanks.
88
Versions - Release History / PDC core version 838 - minor update
« Last post by Alexei B. on March 22, 2014, 02:34:39 PM »
- Fixed an error with a document loading failure in Demo version
- Compare a document to a folder of documents now compares with all the documents in a folder and all sub-folders
89
Over the time of our work in the field of Plagiarism Detection we have answered a lot of questions like "Are our documents stored elsewhere?". The answer is always the same: Your documents don't leave your computer during Plagiarism Detector checks.

But not so long ago a user started a more general discussion about the safety of such products in general. Safety for Your Documents, for sure.

With a permission from that kind person, I will publish our conversation here (with little edition). It may be useful for those worried about the problem.

_____________________

User:

I have a question regarding this software..

If I make a test with my documents to test against plagiarism? Why my documents and my scientific works must remain on the server online
stored? This software must be 100 % confidential. No need to upload my scientific works on the server and stay there permanently.

PD Team:

No checked documents leave your computer when searching for plagiarism (except for fragments used for a search itself), nor do we have any database at our side with users' documents. Thus your documents are confidential.

You  may have misunderstood PDAS software description. But PDAS is a software that a client can use to store documents in his own database and check against them. It is not used for Plagiarism Detector Internet check.

User:

That's why I asked because has hovered the suspicion that these softwares for against plagiarism take us the documents which we scan with these softwares against plagiarism and you receive and publish them before us and upload them on the internet.. It is not good with our work.
On many websites people complains regarding this thing. And many do not want to scan for against plagiarism for this reason, because they worked in vain and software developers receive all scanned documents and they publish them or upload on the websites.
It is intellectual theft.

PD Team:

We didn't do any research on the way our competitors work, but we can assure you that no documents leave your computer during check with Plagiarism Detector.

The thing you are worried about would seem rather counter-productive for any service that takes care of its clients. Thus the first risk-factor I would predict is "free of charge". Meaning that I wouldn't trust any plagiarism-check service that is free to all. As a question stands: "what's their interest then?".

But for any serious and well-established service a revenue lost from unsatisfied customers leaving (and making bad PR) would be of more importance then their documents. At least I believe so.

Once again: with our software no documents leave your computer to be stored elsewhere.

As you have raised a serious problem, I kindly ask your permission to publish this conversation (text only, no names) on our forums, as it may be of interest for other people.

User:

I raised this issue because, as I said, has hovered the suspicion that these kind of softwares of against plagiarism steal our PC documents, scientific works etc, when when we add in the software the document and check and scan the document.
And now, I don't just mean just at this software Plagiarism Detector. Generally this kind of software that checks if the document is plagiarized or not, people worryes, that their work would be compromised in vain due of intelectual thieft, ie, the thieft of the software..
Yes, you can use these phrases and put on the application's forum, but without my name or my e-mail. Thank you in advance!

PD Team:

Well, I totally understand your concerns. So let me analyze the risks from my experience.

In addition to the already mentioned "free cheese can be a mouse-trap":

1. Software installed at your computer is more secure then some Internet-site, providing the check service. All the following is said about an installed software, since noone knows what happens server-side.

2. Having a certain skills, you can check what data leaves you computer during the check. I can't go into the detail of the algorithms we use, but to my knowledge one cannot reproduce the document with those search-requests generated by our software. Besides, any traffic-analysis will show the requests are going to different places (since several search-engines are used), thus separating the fragments even more. If such analysis shows a whole document uploaded somewhere in one piece -  it doesn't look secure.

3. Easier way: if a software is observed to heavily-load the CPU - it is working on your side. If the CPU is not loaded - the software is either not so good, or the document is uploaded to some server and is checked there. Evidently, less secure.

4. Additional data. You can always check the Internet for third-party sites mentioning the software you are interested about. The more "serious business" it looks - the more secure it is likely to be. As I have said before - one is unlikely to risk his profits from a well-established business with stealing clients' intellectual property. But if the site looks like a home-work of post-graduate student and noone has ever heard of this product before - well, use it at your own risk.

Besides, I have just consulted our RnD about this and they did provide some additions from their perspective:

1. It is rather unlikely that clients' works are stolen for their scientific value. More likely as a part of regular process of filling the database that documents are checked against. For a widely-used service it looks impossible to analyze all the incoming documents for scientific value.

2. Indeed, some services do store all the documents that are checked with them (we don't see it right to mention them). You can follow the  above mentioned list of criteria to reduce the chance to use such a service. Even if the service stores clients' documents, there are two options: documents are stored for the service internal use only (later documents are checked against old ones) or documents are later indexed by search engines, which is indeed a serious threat, as you document becomes publicly available.

3. Someone interested in a detailed research in the field can make a set of "trap-documents" that are 100% original and check them with different services. Then in about 1.5 month repeat the same. If a document remains 100% with the same service - no documents are stored. If it is found plagiarized by the same service, but none of the others - documents are stored in the internal database. But if different services start finding plagiarism in a clean document that was checked with once service only - those services are just mirrors of a singe document-storing server or the document became publicly available.

We hope you find this information useful.
90
FAQ - Frequently Asked Questions / Exclude/Include lists
« Last post by Alexei B. on January 13, 2014, 03:44:17 AM »
At the last page of the Step-By-Step Wizard you can open the second tab, which is Exclusion lists. It allows you to edit two different list, giving specific details for the search engine.

Exclude list: if any exclusion mask in this list is present in the found page URL - the page is ignored during check.
For example, adding the title of the document to the list will ignore all the Internet sources, that use the document name in the page address.
Example:
If the exclusion list contains "wikipedia" then ALL URLs to wikipedia.org will be ignored.

Include list: any URL in the list is thoroughly checked against during the search.
If you expect the page to be a source of part of the checked document, but the software does not find it, you can add the page to this list, to make sure it is checked in detail. The fact that the document is not detected without it can be explained by the SEP-stage fast check, finding candidates for a REP detailed comparison, is not so specific and detailed, but relies on the quantities of sites checked.
Pages: 1 ... 7 8 [9] 10