Skip to content

Prep 1.1 release #757

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 79 commits into from
Mar 27, 2021
Merged
Show file tree
Hide file tree
Changes from 71 commits
Commits
Show all changes
79 commits
Select commit Hold shift + click to select a range
03b7939
Adding section for UDF serialization
Niharikadutta Apr 20, 2020
4ef693d
removing guides from master
Niharikadutta Apr 20, 2020
81145ca
Merge latest from master
Niharikadutta May 6, 2020
e4b81af
merging latest from master
Niharikadutta May 7, 2020
4c32173
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jun 2, 2020
4987a09
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jun 14, 2020
ca9612e
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jun 16, 2020
f581c86
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jun 20, 2020
086b325
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jun 23, 2020
2f72907
Merge remote-tracking branch 'upstream/master'
Niharikadutta Jul 25, 2020
6bab996
CountVectorizer
Jul 27, 2020
e2a566b
moving private methods to bottom
Jul 27, 2020
5f682a6
changing wrap method
Jul 28, 2020
31371db
setting min version required
Jul 31, 2020
60eb82f
undoing csproj change
Jul 31, 2020
ed36375
member doesnt need to be internal
Jul 31, 2020
c7baf72
too many lines
Jul 31, 2020
d13303c
removing whitespace change
Jul 31, 2020
f5b477c
removing whitespace change
Jul 31, 2020
73db52b
ionide
Jul 31, 2020
98f5e4d
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 7, 2020
4c5d502
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 10, 2020
a766146
Merge branch 'master' into ml/countvectorizer
GoEddie Aug 12, 2020
ad6bced
Merge branch 'ml/countvectorizer' of https://github.com/GoEddie/spark
Niharikadutta Aug 13, 2020
8e1685c
Revert "Merge branch 'master' into ml/countvectorizer"
Niharikadutta Aug 13, 2020
255515e
Revert "Merge branch 'ml/countvectorizer' of https://github.com/GoEdd…
Niharikadutta Aug 13, 2020
a44c882
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 14, 2020
3c2c936
fixing merge errors
Niharikadutta Aug 14, 2020
88e834d
removing ionid
Niharikadutta Aug 20, 2020
59e7299
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 20, 2020
a13de2d
Merge branch 'master' of github.com:Niharikadutta/spark
Niharikadutta Aug 21, 2020
13d0e4a
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 24, 2020
595b141
Merge remote-tracking branch 'upstream/master'
Niharikadutta Aug 29, 2020
decfa48
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 2, 2020
ce694ff
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 8, 2020
8128ba0
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 12, 2020
52f0a74
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 19, 2020
6a89f01
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 24, 2020
4b1de41
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 25, 2020
929d8e2
Merge remote-tracking branch 'upstream/master'
Niharikadutta Sep 26, 2020
ffa0a4d
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 2, 2020
2579faa
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 5, 2020
2297add
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 6, 2020
daade7a
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 8, 2020
cb6aa7a
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 12, 2020
cbe6e50
Merge branch 'master' of github.com:Niharikadutta/spark
Niharikadutta Oct 12, 2020
3a04b19
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 12, 2020
2c498dc
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 13, 2020
d19cfb6
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 16, 2020
d34188e
Merge branch 'master' of github.com:Niharikadutta/spark
Niharikadutta Oct 16, 2020
5457ffb
Merge remote-tracking branch 'upstream/master'
Niharikadutta Oct 26, 2020
f92820f
first commit
Niharikadutta Oct 28, 2020
1478b1c
formatting
Niharikadutta Oct 28, 2020
cfb8154
formatting
Niharikadutta Oct 28, 2020
a0e556a
update jar name
Niharikadutta Oct 28, 2020
0d5e89c
fix table
Niharikadutta Oct 28, 2020
11706d0
Merge branch 'master' into nidutta/release-1.0.1
suhsteve Nov 3, 2020
b9801c8
Updating release notes
Niharikadutta Nov 4, 2020
40f3a32
Merge branch 'nidutta/release-1.0.1' of github.com:Niharikadutta/spar…
Niharikadutta Nov 4, 2020
b66f23f
PR comments
Niharikadutta Nov 5, 2020
a58fa74
fixed formatting
Niharikadutta Nov 5, 2020
a64fd50
update
Niharikadutta Nov 6, 2020
c8e48f8
Merge branch 'master' into nidutta/release-1.0.1
Niharikadutta Nov 6, 2020
4fe60a9
updating release notes
Niharikadutta Nov 6, 2020
72ed2cc
Merge branch 'nidutta/release-1.0.1' of github.com:Niharikadutta/spar…
Niharikadutta Nov 6, 2020
005282f
Merge branch 'master' into nidutta/release-1.0.1
suhsteve Mar 17, 2021
1236113
prep for 1.1
Niharikadutta Mar 26, 2021
56265b7
Merge branch 'main' into nidutta/release-1.0.1
Niharikadutta Mar 26, 2021
907efc6
changes
Niharikadutta Mar 26, 2021
c52ce05
Merge branch 'nidutta/release-1.0.1' of github.com:Niharikadutta/spar…
Niharikadutta Mar 26, 2021
63d2648
fix formatting
Niharikadutta Mar 26, 2021
1fceb0f
PR review comment
Niharikadutta Mar 27, 2021
2219cdd
PR review comment
Niharikadutta Mar 27, 2021
f443933
PR review comment
Niharikadutta Mar 27, 2021
893b09f
change
Niharikadutta Mar 27, 2021
d6acb43
Update docs/release-notes/1.1/release-1.1.md
imback82 Mar 27, 2021
fb8ce2f
Update docs/release-notes/1.1/release-1.1.md
imback82 Mar 27, 2021
af0f631
Update docs/release-notes/1.1/release-1.1.md
imback82 Mar 27, 2021
8924ae2
Merge branch 'main' into nidutta/release-1.0.1
imback82 Mar 27, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@
<tbody align="center">
<tr>
<td >2.3</td>
<td rowspan=3><a href="https://github.com/dotnet/spark/releases/tag/v1.0.0">v1.0.0</a></td>
<td rowspan=3><a href="https://github.com/dotnet/spark/releases/tag/v1.1.0">v1.1.0</a></td>
</tr>
<tr>
<td>2.4*</td>
Expand Down
2 changes: 1 addition & 1 deletion benchmark/scala/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
<modelVersion>4.0.0</modelVersion>
<groupId>com.microsoft.spark</groupId>
<artifactId>microsoft-spark-benchmark</artifactId>
<version>1.0.0</version>
<version>1.1.0</version>
<inceptionYear>2019</inceptionYear>
<properties>
<encoding>UTF-8</encoding>
Expand Down
208 changes: 208 additions & 0 deletions docs/release-notes/1.1/release-1.1.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,208 @@
# .NET for Apache Spark 1.1 Release Notes

### New Features/Improvements

* Support for Arrow 2.0 and GroupedMapUdf in Spark 3.0.0 ([#711](https://github.com/dotnet/spark/pull/711))
* Use pattern matching in arrow test utils to improve readability ([#725](https://github.com/dotnet/spark/pull/725))
* Check whether file is found before trying to dereference it ([#759](https://github.com/dotnet/spark/pull/759))
* Ml/feature hasher has only internal contructors ([#761](https://github.com/dotnet/spark/pull/761))
* Support for stop words removers ([#726](https://github.com/dotnet/spark/pull/726))
* Support for adding NGram functionality ([#734](https://github.com/dotnet/spark/pull/734))
* Add support for SQLTransformer ML feature ([#781](https://github.com/dotnet/spark/pull/781))
* Add skeletal support for FileSystem extension ([#787](https://github.com/dotnet/spark/pull/787))
* Using (processId, threadId) as key to mantain threadpool executor instead of only threadId ([#793](https://github.com/dotnet/spark/pull/793))
* Support for Hyperspace 0.4.0 ([#815](https://github.com/dotnet/spark/pull/815))
* Support for Delta Lake 0.8.0 ([#823](https://github.com/dotnet/spark/pull/823))
* Add support for Spark 3.0.2 ([#833](https://github.com/dotnet/spark/pull/833))
* Migrating master to main branch ([#847](https://github.com/dotnet/spark/pull/847), [#849](https://github.com/dotnet/spark/pull/849))
* Add DOTNET_WORKER_<ver>_DIR environment variable ([#861](https://github.com/dotnet/spark/pull/861))
* Add spark.dotnet.ignoreSparkPatchVersionCheck conf to ignore patch version in DotnetRunner ([#862](https://github.com/dotnet/spark/pull/862))

### Bug Fixes

* Fix signer information mismatch issue ([#752](https://github.com/dotnet/spark/pull/752))
* Fix package-worker.ps1 to handle output path with ":" ([#742](https://github.com/dotnet/spark/pull/742))
* Fixes for TimestampType and DateType conversion ([#765](https://github.com/dotnet/spark/pull/765))
* Fix for using Broadcast variables in Databricks ([#766](https://github.com/dotnet/spark/pull/766))
* Fix macOS Catalina Permissions ([#784](https://github.com/dotnet/spark/pull/784))
* Fix for memory leak in JVMObjectTracker ([#801](https://github.com/dotnet/spark/pull/801))

### Infrastructure / Documentation / Etc.

* Fix flaky CallbackTests.TestCallbackHandlers Test ([#745](https://github.com/dotnet/spark/pull/745))
* Run E2E tests on Linux in build pipeline and add Backward/Forward E2E tests ([#737](https://github.com/dotnet/spark/pull/737))
* Add comments and cleanup azure pipeline ([#764](https://github.com/dotnet/spark/pull/764))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's remove this one.

* Update dotnet-interactive deprecated feed ([#807](https://github.com/dotnet/spark/pull/807), [#808](https://github.com/dotnet/spark/pull/808))
* Remove unnecessary RestoreSources ([#812](https://github.com/dotnet/spark/pull/812))

### Breaking Changes

* None

### Known Issues

* Broadcast variables do not work with [dotnet-interactive](https://github.com/dotnet/interactive) ([#561](https://github.com/dotnet/spark/pull/561))
* UDFs defined using class objects with closures does not work with [dotnet-interactive](https://github.com/dotnet/interactive) ([#619](https://github.com/dotnet/spark/pull/619))
* In [dotnet-interactive](https://github.com/dotnet/interactive) blocking Spark methods that require external threads to unblock them does not work. ie `StreamingQuery.AwaitTermination` requires `StreamingQuery.Stop` to unblock ([#736](https://github.com/dotnet/spark/pull/736))
* UDFs don't work in Linux with Spark 2.3.0 ([#753](https://github.com/dotnet/spark/issues/753))

### Compatibility

#### Backward compatibility

The following table describes the oldest version of the worker that the current version is compatible with, along with new features that are incompatible with the worker.

<table>
<thead>
<tr>
<th>Oldest compatible Microsoft.Spark.Worker version</th>
<th>Incompatible features</th>
</tr>
</thead>
<tbody align="center">
<tr>
<td>v1.0.0</td>
<td>GroupedMap in Spark 3.0 is not compatible with Worker 1.0 <a href="https://github.com/dotnet/spark/pull/654">(#654)</a></td>
</tr>
</tbody>
</table>

#### Forward compatibility

The following table describes the oldest version of .NET for Apache Spark release that the current worker is compatible with.

<table>
<thead>
<tr>
<th>Oldest compatible .NET for Apache Spark release version</th>
</tr>
</thead>
<tbody align="center">
<tr>
<td>v1.0.0</td>
</tr>
</tbody>
</table>

### Supported Spark Versions

The following table outlines the supported Spark versions along with the microsoft-spark JAR to use with:

<table>
<thead>
<tr>
<th>Spark Version</th>
<th>microsoft-spark JAR</th>
</tr>
</thead>
<tbody align="center">
<tr>
<td>2.3.*</td>
<td>microsoft-spark-2-3_2.11-1.1.0.jar</td>
</tr>
<tr>
<td>2.4.0</td>
<td rowspan=7>microsoft-spark-2-4_2.11-1.1.0.jar</td>
</tr>
<tr>
<td>2.4.1</td>
</tr>
<tr>
<td>2.4.3</td>
</tr>
<tr>
<td>2.4.4</td>
</tr>
<tr>
<td>2.4.5</td>
</tr>
<tr>
<td>2.4.6</td>
</tr>
<tr>
<td>2.4.7</td>
</tr>
<tr>
<td>2.4.2</td>
<td><a href="https://github.com/dotnet/spark/issues/60">Not supported</a></td>
</tr>
<tr>
<td>3.0.0</td>
<td rowspan=3>microsoft-spark-3-0_2.12-1.1.0.jar</td>
</tr>
<tr>
<td>3.0.1</td>
</tr>
<tr>
<td>3.0.2</td>
</tr>
</tbody>
</table>

### Supported Delta Versions

The following table outlines the supported Delta versions along with the Microsoft.Spark.Extensions version to use with:

<table>
<thead>
<tr>
<th>Delta Version</th>
<th>Microsoft.Spark.Extensions.Delta</th>
</tr>
</thead>
<tbody align="center">
<tr>
<td>0.1.0</td>
<td rowspan=9>1.1.0</td>
</tr>
<tr>
<td>0.2.0</td>
</tr>
<tr>
<td>0.3.0</td>
</tr>
<tr>
<td>0.4.0</td>
</tr>
<tr>
<td>0.5.0</td>
</tr>
<tr>
<td>0.6.0</td>
</tr>
<tr>
<td>0.6.1</td>
</tr>
<tr>
<td>0.7.0</td>
</tr>
<tr>
<td>0.8.0</td>
</tr>
</tbody>
</table>

### Supported Hyperspace Versions

The following table outlines the supported Hyperspace versions along with the Microsoft.Spark.Extensions version to use with:

<table>
<thead>
<tr>
<th>Hyperspace Version</th>
<th>Microsoft.Spark.Extensions.Hyperspace</th>
</tr>
</thead>
<tbody align="center">
<tr>
<td>0.1.0</td>
<td rowspan=3>1.1.0</td>
</tr>
<tr>
<td>0.2.0</td>
</tr>
<tr>
<td>0.4.0</td>
</tr>
</tbody>
</table>
2 changes: 1 addition & 1 deletion eng/Versions.props
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
<?xml version="1.0" encoding="utf-8"?>
<Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
<PropertyGroup>
<VersionPrefix>1.0.0</VersionPrefix>
<VersionPrefix>1.1.0</VersionPrefix>
<PreReleaseVersionLabel>prerelease</PreReleaseVersionLabel>
</PropertyGroup>
</Project>
2 changes: 1 addition & 1 deletion src/scala/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@
<version>${microsoft-spark.version}</version>
<properties>
<encoding>UTF-8</encoding>
<microsoft-spark.version>1.0.0</microsoft-spark.version>
<microsoft-spark.version>1.1.0</microsoft-spark.version>
</properties>

<modules>
Expand Down