Skip to content

Releases: mhahsler/dbscan

dbscan_1.2.0

28 Jun 22:19
Compare
Choose a tag to compare

New Features

  • dbscan has now tidymodels tidiers (glance, tidy, augment).
  • kNNdistplot can now plot a range of k/minPts values.
  • added stats::nobs methods for the clusterings.
  • kNN and frNN now contains the used distance metric.

Changes

  • dbscan component dist was renamed to metric.
  • Removed redundant sort in kNNdistplot (reported by Natasza Szczypien).
  • Refactoring use more performant anyNA(x) instead of any(is.na(x))
    and many more (by m-muecke).
  • Reorganized the C++ source code.
  • README now uses bibtex.
  • Tests use now testthat edition 3 (m-muecke).

dbscan_1.1.10

17 Jan 16:54
Compare
Choose a tag to compare

New Features

  • is.corepoint() for DBSCAN.
  • coredist() and mrdist() for HDBSCAN.
  • find connected components with comps().

Changes

  • reachability plot now shows all undefined distances as a dashed line.

Bugfix

  • memory leak in mrd calculation fixed.

dbscan_1.1.9

11 Jan 00:20
Compare
Choose a tag to compare

dbscan 1.1-9 (2022-01-10)

Changes

  • We use now roxygen2.

New Features

  • Added predict for hdbscan (as suggested by moredatapls)

dbscan_1.1-8

27 Apr 13:56
Compare
Choose a tag to compare

dbscan 1.1-8 (2021-04-26)

Bugfixes

  • LOF: fixed numerical issues with k-nearest neighbor distance on Solaris.

dbscan 1.1-7 (2021-04-21)

Bugfixes

  • Fixed description of k in knndistplot and added minPts argument.
  • Fixed bug for tied distances in lof (reported by sverchkov).

Changes

  • lof: the density parameter was changes to minPts to be consistent with the original paper and dbscan. Note that minPts = k + 1.

dbscan_1.1-6

01 Mar 15:26
Compare
Choose a tag to compare

Improvements

  • Improved speed of LOF for large ks (following suggestions by eduardokapp).
  • kNN: results is now not sorted again for kd-tree queries which is much faster (by a factor of 10).
  • ANN library: annclose() is now only called once when the package is unloaded. This is in preparation to support persistent kd-trees using external pointers.
  • hdbscan lost parameter xdist.

Bugfixes

  • removed dependence on methods.
  • fixed problem in hullplot for singleton clusters (reported by Fernando Archuby).
  • GLOSH now also accepts data.frames.
  • GLOSH returns now 0 instead of NaN if we have k duplicate points in the data.

dbscan_1.1-5

23 Oct 14:13
Compare
Choose a tag to compare

New Features

  • kNN and frNN gained parameter query to query neighbors for points not in the data.
  • sNN gained parameter jp to decide if the shared NN should be counted using the definition by Jarvis and Patrick.

dbscan_1.1-4

05 Aug 19:53
Compare
Choose a tag to compare

Bugfixes

  • kNNdist now correctly returns the distances to the kth neighbor
    (reported by zschuster).
  • dbscan: check eps and minPts parameters to avoid undefined results (reported by ArthurPERE).

New Features

  • kNNdist gained parameter all to indicate if a matrix with the distance to all
    nearest neighbors up to k should be returned.

dbscan_1.1-3

13 Nov 23:38
Compare
Choose a tag to compare

Bugfix

  • pointdensity was double counting the query point (reported by Marius Hofert).

dbscan_1.1-2

19 May 12:30
Compare
Choose a tag to compare

New Features

  • OPTICS now calculates eps if it is omitted.

Bugfix

  • Example now only uses igraph conditionally since it is unavailable
    on Solaris (reported by B. Ripley).

dbscan_1.1-0

19 Mar 11:50
Compare
Choose a tag to compare

New Features

  • HDBSCAN was added.
  • extractFOSC (optimal selection of clusters for HDBSCAN) was added.
  • GLOSH outlier score was added.
  • hullplot uses now filled polygons as the default.
  • hullplot now used PCA if the data has more than 2 dimensions.
  • Added NN superclass for kNN and frNN with plot and with adjacencylist().
  • Added shared nearest neighbor clustering as sNNclust() and sNN to calculate
    the number of shared nearest neighbors.
  • Added pointdensity function.
  • Unsorted kNN and frNN can now be sorted using sort().
  • kNN and frNN now also accept kNN and frNN objects, respectively. This can
    be used to create a new kNN (frNN) with a reduced k or eps.
  • Datasets added: DS3 and moon.

Interface Changes

  • Improved interface for dbscan() and optics(): ... it now passed on to frNN.
  • OPTICS clustering extraction methods are now called extractDBSCAN and
    extractXi.
  • kNN and frNN are now objects with a print function.
  • dbscan now also accepts a frNN object as input.
  • jpclust and sNNclust now return a list instead of just the
    cluster assignments.