Skip to content

Releases: nazuke/SEOMacroscope

New v1.7.6.1 release of SEO Macroscope: Predestination

17 Feb 10:45
Compare
Choose a tag to compare

This is a simple release of SEO Macroscope that adds some shortcuts for preset crawl configurations. For example, only crawling the HTML pages on a site, ignoring linked assets. This can save a little bit of time switching options on and off in the preferences panels.

New features in this release include:

  • Crawl configuration presets.

Bug fixes

  • Nothing significant for this release.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.6 release of SEO Macroscope: Entitled

27 Jan 10:15
Compare
Choose a tag to compare

This release of SEO Macroscope enhances the keyword meta tag analysis for some legacy processing, and fixes some bugs.

New features in this release include:

  • Titles and descriptions are now included in the keywords meta tag analysis. Additionally, some malformed keywords meta tags will be reported as such.
  • Some refactoring has been done behind the scenes to try and make the crawled document collection a little more efficient.
  • New Recent URLs sub-menu under in the File menu.
  • New Status Code columns in the Links and and Hyperlinks overview lists.
  • Anchor Text columns added to broken links reports.
  • Experimental Save Session feature, to save and reload the current crawled session.

Bug fixes

  • Fixed error handling in AnalyzeKeywordPresence.
  • Added preferences option to re-fetch linked documents from external sites that initially return a 404 on a HEAD request. This helps to verify links to external sites that have mis-configured webservers, at the expense of network bandwidth.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.5 release of SEO Macroscope: Bearer of The Word

23 Nov 07:54
Compare
Choose a tag to compare

This release of SEO Macroscope adds keyword meta tag analysis for some legacy processing, and fixes some bugs.

New features in this release include:

  • A recent request was made for processing the contents of the legacy "keywords" meta tag. This feature is enabled by default, with the results available in the new Keywords Presence display list. Very briefly, the contents of the keywords meta tag is examined, and then the presence or absence of each keyword in the page body text is reported. Currently, this only applies to the body text; other elements, such as the title tag, are not processed.
    • Normally, I would advise against using the keywords meta tag in any new websites; however it appears that this meta tag is still used by some CMS platforms, and is a reasonable method to check that keywords that should be present in the body text are actually there, or not.
    • Please note that this analysis step is separate to the existing keywords analysis; that analysis ignores the keyword meta tag entirely, and operates purely on the body text alone.

Bug fixes

  • Licence window was broken.
  • Preferences window resized for smaller screens.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.4 release of SEO Macroscope: Divine Predecessors

04 Sep 13:08
Compare
Choose a tag to compare

This release of SEO Macroscope adds parent directory probing, and fixes bugs.

New features in this release include:

  • There is a new option to probe parent directories for each URL found on an internal site. This builds a new set of URLs to crawl, by taking the current URL, and progressively stripping off each rightmost element until it reaches the root. Each stripped URL is then added to the list of URLs to crawl.
  • The body text word counter has been improved, and unit tests written.
  • Regular expression data extraction now works on PDF documents.
  • PDF embedded link extraction and following has been improved.

Bug fixes

  • Not a bug as such, but the check for update phone home function now more precisely checks the current and updated version numbers, instead of doing a simple equals comparison.
  • Keyword analysis is now skipped when humans.txt, and some other page types 404.
  • Absolute URL handling in robots.txt has been improved.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.3 release of SEO Macroscope: Chainlinks

22 May 15:07
Compare
Choose a tag to compare

This release of SEO Macroscope primarily fixes a number of minor bugs, and adds a few new features.

New features in this release include:

  • Where possible, Author fields are extracted from HTML and PDF documents.
  • The Page Metadata Excel report has a new worksheet that combines the crawled author, title, description, and keywords fields.
    • This can be useful when crawling a list of PDF documents, as it extracts that information into a single worksheet.
  • A simple check for update feature has been added. This will show an alert if a new version of SEO Macroscope appears to be available.
  • HTML page character set sniffing has been enhanced.

Bug fixes

  • Rewrote the redirect chain analysis code, so that the redirect chain analysis should now be more complete for each crawl. Previously, the redirect chain list was built from the crawled document collection, which meant that some redirects were missing if they had not been crawled yet. Not, an explicit HEAD request is executed for each document that redirects, until no more redirects are encountered.
  • There was a locking fault in the crawled document collection, that caused some documents to never be fetched.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.2 release of SEO Macroscope: Two to talk

16 Apr 13:18
Compare
Choose a tag to compare

This release of SEO Macroscope primarily fixes a number of minor bugs.

Bug fixes

  • Incorrect behaviour of progress dialogues.
  • Removed memory guard on sitemap generators, that may have prevented sitemap generation under certain circumstances.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7.1 Release of SEO Macroscope: HTTP Too and a half

15 Apr 13:42
Compare
Choose a tag to compare

This release of SEO Macroscope primarily fixes bugs from v1.7.

New features in this release include:

  • There is a new hyperlink ratio feature found in the document details panel, and in the overview Excel and CSV reports. This calculates the percentage value for the number of hyperlinks in and out of a particular document, within the crawled collection. It does not include links from third-party sites not in the crawled collection.
  • The web proxy settings may now now the systems configured proxies.

Bug fixes

  • A malformed User-Agent HTTP Header caused some websites to not be crawled at all.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.7 Release of SEO Macroscope: HTTP Too

04 Mar 13:43
Compare
Choose a tag to compare

This release of SEO Macroscope includes support for HTTP/2 on Windows 10, and accelerates near-duplicate content detection. HTTP/2 support enables the application to work with websites that do not also support HTTP/1.1 as a fallback option.

As well as numerous bugs being fixed, the Levenshtein analysis of near-duplicate content has also been accelerated, by generating a "Levenshtein Fingerprint" of the document text of each web page as it is crawled. There are two analysis level options, with the second being slower but may help to eliminate false positives.

Where possible, the full text of PDF documents may be extracted, analyzed for readability, and URLs extracted. Currently, URLs found in the text will be added as additional outlinks from the PDF.

In some cases, QR Codes in linked image files on the site may be detected as containing a link. These can be further crawled.

Source code and an installer can be found on GitHub at:

I've also fixed many minor bugs and other issues.

New features in this release include:

  • HTTP/2 support on Windows 10 machines.
  • Improved crawl continuation when document type preferences have been changed.
  • Optional detection of hyperlinks in QR Codes found in linked image files.
  • Orphaned HTML pages may be reported. These are pages that are found via sitemaps, but may not be linked to from other HTML pages.
  • More sitemap error reporting.
  • More details in the Site Overview summary, such as number of document types found.
  • Where possible, the text of PDFs is now fully extracted, and may be searched.
  • Where possible, some URLs may be found in the text of PDFs, and further crawled.
  • For text analysis, certain HTML5 navigational elements may now be disregarded.
  • Some HTTP Status Codes, such as 410, may be ignored as errors.
  • Redirect chains analysis
  • Additional Excel and CSV reporting.

Bug fixes

There are too many fixes to list, but notable ones included:

  • The "Maximum pages to fetch" setting has been reworked, and now works properly.
  • Previously, outlinks for document types set as not crawlable were not being properly reported as outlinks in crawled documents; this has been corrected.

SEO Macroscope Application Window

New v1.6 release of SEO Macroscope: The Flesch Prevails

15 Sep 14:43
Compare
Choose a tag to compare

This release of SEO Macroscope includes support for English language text readability scoring. Implementations of the Flesch-Kincaid or SMOG algorithms may be applied to the body text of web pages, giving a simple score as to the "readability" of the page text. This may then be used to further refine the page text to suit the target audience.

Installation

It is strongly recommended that you uninstall any previous versions of SEO Microscope before installing the new version.

New features in this release include:

  • New Flesch-Kincaid and SMOG algorithms, for scoring the readability of the body text in English language web pages.

  • Include and Exclude Patterns now take a list of regular expressions.

  • New overview panel selector menu.

  • Most display lists may be exported to CSV or Excel formats, including search results.

  • Links to external pages and files are now included in results lists.

  • Some simple chart displays.

Bug fixes

  • The in and out link calculation display in the Structure Overview panel has been fixed. Previously, these values were not being displayed correctly after the link counts for each document were updated. There is a new button that may be clicked to trigger recalculation; otherwise recalculation occurs periodically, and when a scan completes.

Please check the downloads page for more recent versions.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window

New v1.5 release of SEO Macroscope: Hard Vacuum

11 Jul 14:48
Compare
Choose a tag to compare

This release of SEO Macroscope is now 64 bit. This is mostly to help alleviate out-of-memory problems when performing large scans.

Installation

It is strongly recommended that you uninstall any previous versions of SEO Microscope before installing the new version.

New features in this release include:

  • Custom filters.

  • Data extractors (web scrapers) using CSS selectors, regular expressions, and XPath queries.

  • Google XML and text format sitemap generators.

  • More Excel and CSV format report generators.

  • Include/Exclude patterns now use regular expressions.

Please check the downloads page for more recent versions.

Please report issues at https://github.com/nazuke/SEOMacroscope/issues.

SEO Macroscope Application Window