Changes for page Announcements
                  Last modified by hbpadmin on 2025/01/09 12:12
              
      Summary
- 
          Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
- 
      - Author
-   ... ... @@ -1,1 +1,1 @@ 1 -XWiki. mmorgan1 +XWiki.hbpadmin 
- Content
-   ... ... @@ -1,9 +1,43 @@ 1 1 (% class="wikigeneratedid" %) 2 +=== **Unable to save files to Drive** === 3 + 4 +The file system that the Drive runs on is currently full. Unfortunately, our detection of the file system being nearly full did not work. As such users cannot upload files nor save any changes to their files in the Drive. Due to another, unrelated issue, it is not possible for us to simply expand the file system. For this reason, we are currently in the process of moving the Drive data to a bigger volume. This is causing the delay. We will update this page as soon as we are finished with the move. 5 + 6 +=== **No uploads allowed to Data-proxy (Fixed)** === 7 + 8 +At the moment uploads are not permitted due to the data-proxy exceeding its quota allowance. We are working to solve this as soon as possible. 9 +\\**Fixed: **Quota was increased, files can no be uploaded again. 10 + 11 +=== **Collaboratory Drive maintenance (2022-08-19) (Completed)** === 12 + 13 +The Drive was meant to be taken down for routine maintenance to increase the available space available for Drive storage this afternoon. That operation has had to be rescheduled due to technical issues on the storage infrastructure. 14 + 15 +=== **Intermittent issues with the Bucket (data-proxy) (Solved)** === 16 + 17 +As reported by the main banner, there had been intermittent issues with the Bucket occasionally going down for a short amount of time. This has been resolved by the maintenance performed at CSCS on August 10th. If you encounter any further issues related to the Bucket, please open a ticket to support. 18 + 19 +=== **Storage and Cloud service maintenance (2022-08-10) (Completed)** === 20 + 21 +This maintenance was shifted from August 3 to August 10 due to an incident on another server managed by the ETHZ central IT services. 22 + 23 +A maintenance operation at CSCS requires that some HBP/EBRAINS services be stopped Wednesday August 3 morning. The services affected are those using NFS storage volumes on the Castor cloud service. EBRAINS service providers that migrate their VMs to CEPH storage ahead of that date can keep their services running during the maintenance. 24 + 25 +**__Timeline__**: all times CEST 26 + 27 +* **08:00**: Service providers shutdown services running on OpenStack or OpenShift at CSCS 28 +* **08:30**: Maintenance start by CSCS team 29 +* **12:00**: Planned maintenance end by CSCS team. Service providers check that services have come back online.(% style="color:#95a5a6" %) (% style="color:#1abc9c" %)Check this page for updates 30 +* (% style="color:#1abc9c" %)15:20: Maintenance ended at 15:20 CEST time. 31 + 32 +The storage back-end used by HBP/EBRAINS services has been causing some issues which have had repercussions on access to the object storage and OpenStack cloud service and thereby on HBP/EBRAINS services which run on this infrastructure. The issue has been identified and CSCS is ready to deploy a patch on the storage back-end. This will require that services running on OpenStack at CSCS be stopped for the duration of the maintenance. 33 + 34 +There is never a good time for maintenance. We’re heading into a few weeks when more users will be on vacation, and some of the service providers may also be away. Hopefully this will impact as few people as possible. We apologize in advance for any inconvenience the downtime may cause. 35 + 2 2 === **Infrastructure issues at CSCS (2022-08-01)** === 3 3 4 -The infrastructure at CSCS on which EBRAINS services run has failed over the weekend. August 1 is a bank holiday in Switzerland where CSCS is locatedso odds are slimthattheinfrastructurecan berecovered beforeofficehourson Tuesday August 2.38 +The infrastructure at CSCS on which EBRAINS services run has failed over the weekend. August 1 was a bank holiday in Switzerland where CSCS is located. The situation was recovered before 10:00 CEST on Tuesday August 2. 5 5 6 -The services affected are all those running on the OpenShift service at CSCS including:40 +The services affected were all those running on the OpenShift service at CSCS including: 7 7 8 8 * Collaboratory Lab at CSCS (please choose the JSC site when starting the Lab), 9 9 * image service, ... ... @@ -14,20 +14,6 @@ 14 14 15 15 We apologize for the inconvenience. 16 16 17 -=== //**Storage and Cloud service maintenance (2022-08-03)**// === 18 - 19 -//A maintenance operation at CSCS requires that HBP/EBRAINS services be stopped Wednesday August 3 morning.(% style="color:#1abc9c" %) Pending final confirmation on Tuesday August 2.// 20 - 21 -//**__Timeline__**: all times CEST// 22 - 23 -* //**08:00**: Service providers shutdown services running on OpenStack or OpenShift at CSCS// 24 -* //**08:30**: Maintenance start by CSCS team// 25 -* //**12:00**: Planned maintenance end by CSCS team. Service providers check that services have come back online. (% style="color:#1abc9c" %)Check this page for updates// 26 - 27 -//The storage back-end used by HBP/EBRAINS services has been causing some issues which have had repercussions on access to the object storage and OpenStack cloud service and thereby on HBP/EBRAINS services which run on this infrastructure. The issue has been identified and CSCS is ready to deploy a patch on the storage back-end. This will require that services running on OpenStack at CSCS be stopped for the duration of the maintenance.// 28 - 29 -//There is never a good time for maintenance. We’re heading into a few weeks when more users will be on vacation, and some of the service providers may also be away. Hopefully this will impact as few people as possible. We apologize in advance for any inconvenience the downtime may cause.// 30 - 31 31 === **Infrastructure issues (2022-07-13)** === 32 32 33 33 Several services on our EBRAINS **OpenShift **server running at CSCS were detected as having issues.