The AGIS System and Changing SchedConfig Parameters at AGLT2 Why AGIS Information about changeover to AGIS came in this Email of 2/28/2013 Ladies and gentleme...
Michigan/AGLT2 SuperComputing 2015 Network Demonstrations This year the University of Michigan and AGLT2 are again participating in SuperComputing 2015. The venue...
The ATLAS LIVE monitor is a NEC P521 AVT purchased near the beginning of April 2011. It is located in the hallway just outside West Hall 348 (the Michigan ATLAS ...
This document describes how user with grid certificate can access the files stored in the AGLT2 dCache system! All files in dCache need to be copied to a local fi...
IO with EOS command lines User can access (list and read) files in CERN EOS from non lxplus nodes, without being authenticated. In order to get full permission(wr...
Replaced Disks That Show "Foreign" Status Such a simple thing, but such a pain. We've all seen this, replace a failed disk in a RAID array with a salvaged disk, ...
Download openafs kernel source rpm and install on build system. The SRPM from https://linat05.grid.umich.edu/pub/SLC/4x/custom/SRPMS/openafs 1.4.6 1.1_AGLT2.src....
AFS Tape Backups with Amanda Amanda Commands For operations with amanda, you should be the amanda user on bambi: "su amanda". The exception is "amrecover". Her...
Controlling the ATLAS Queues, and the pilot rate Much of the basic command structure is documented in this document. There is also a newer document about setting...
Auto Test Programs over AGLT2 Cluster Related PNFS mount point test Purpose Make sure every computer node has "/pnfs/aglt2.org" mounted , and every gridftp door ...
Bi weekly AGLT2 site meeting notes Thursday, June 26, 2008 See SecurityPlanning Security Notes Need to check syslog ng configuration for all hosts and base on mo...
Details of common settings for the CMC on a Dell Blade Chassis are shown via screen shots in the attached MS Word document. Note that power supplies on the chassi...
Configuration of the UM CERN Computing Cluster in BAT 188 In November 2014, the UM CERN Computing Cluster was upgraded to SLC6. Some old hardware was retired, new...
Grid Certificate Distribution at AGLT2 The certificates in /etc/grid security/certificates are used by the OSG authentication stack. It is a regularly updated, st...
Transition from CFEngine v2 to v3, and Build dCache Pool Servers Introduction As documented elsewhere in this Wiki, cfengine2 is currently (Oct 2012) in use to c...
MSU 2008 May BNX2 DKMS Ganglia Found that the existing bnx2 network driver was the cause of the large spikes in the ganglia network plots. It intermittently pu...
Cluster Control This is the main page for information about the Cluster Monitoring and Control tools. At this time, the state of two conditions is maintained an m...
Manuals Cobbler manual: http://www.cobblerd.org/manuals/ For information on the Cheetah template language used in kickstart templates: http://www.cheetahtemplate....
Setting up Condor CE Condor CE is a replacement for globus on our gatekeepers. Condor G can still be used to submit jobs to the gatekeeper, but then the JobRoute...
Job Queing at Michigan State NOTE: THIS PAGE IS PRETTY MUCH OUT OF DATE The queing system at Michigan State has not yet been established. Job Queing at the Unive...
Condor Batch System This is the main page for administrative info about the Condor batch system(s) in use at AGLT2. User info is at CondorUser. A description of t...
Planning Condor Configuration Updates for AGLT2 Now that AGLT2 is running on an SL6.4 OS we can plan on implementing some new features in Condor that will take ad...
If a line starts with $ it is a command to be run as a normal user, if it starts with # it is a command to be run as root. * $ cd /net/data07/tests/stress_test/ ...
Upgrading dCache at AGLT2 from 2.10.55 1 to 2.13.23 1 We are upgrading to the next golden release of dCache on February 23, 2016. We have setup CFEngine to have t...
(Re)Configuring the log4j.properties in dCache to output to Syslog At AGLT2 we have setup a central loghost running syslog ng which also has a php syslog ng web i...
This page is OBSOLETE All site services now run at BNL. AGLT2 DQ2/DDM Verification and Debugging One of the central tasks for our Tier2 is to support a DQ2 ser...
Just starting this out... Tom, May 21 Data Storage Locations What locations are in use and how they are used. We have a number of storage locations for AGLT2: ...
Cacti Setup for Dell Nodes The Dell PE1950 and PE2950 nodes have a large number of fans and temperature probes which are not exposed via SNMP. The presents a pro...
AGLT2/Dell.OmreportOmconfig There is a Dell ROCKS Roll, do we want that? AGLT2/Dell.DellOrderStatusMSU check status of a Dell order for MSU AGLT2/Dell.DellService...
IDRAC web interace IDRAC web interface can provide the virtual console and the log files of the system. How to access the idrac web interface URL: https://idrac ...
Dell Poweredge x950 Hardware Notes Information about the Dell 1950 and 2950 nodes. Dell Docs BIOS The Fall '07 order arrived with BIOS v1.5.1. This was release...
Installing DQ2 References * https://twiki.cern.ch/twiki/bin/view/Atlas/PandaDataService Procedure Shawn created a host cert from doegrids.org for umfs02.grid...
Setting up Two OSG Gatekeepers for a Single Condor Cluster We need to load balance access to our Condor cluster because of a possible time out issue we are seeing...
Extending LVM Disks on VMware VMs We sometimes have partitions fill during operations and when those partitions are on VMs and using LVM we can easily extend them...
Dear Atlas US grid participant: REMINDER The next USATLAS Facilities and Operations phone meeting will occur Wed 1200 1330 CDT This is a reminder about a c...
FTS Channel Management Instructions The FTS channels for AGLT2 and be managed using the glite software. You will need an ATLAS VOMS production role (voms proxy i...
Getting a Grid Certificate for ATLAS Use For getting a new certificate or renew a certificate,you can use the CERN CA to request the grid certificate: https://ca....
Testing Glusterfs For testing purposes only we used the Redhat Storage Appliance demo which has gluster tools pre installed. Docs are here: http://docs.redhat.com...
Installing Google Chrome on SL7 for use with VMWare * Create the google chrome repo on umt3int05 google chrome name=google chrome baseurl=http://dl.google.co...
Setup of GRAM Auditing for AGLT2 (OSG 0.8.1) The current OSG installation (0.8.1) has Globus 4.0.5 which supports a new "auditing" feature. You can request that ...
HS06 Measurements Performed at AGLT2 We have made a variety of measurements at AGLT2 during September of 2009 in preparation for the upcoming purchase cycle. We p...
AGLT2 IP Addresses Information on IP addresses for the Tier2 For detail list of IPs at MSU see ask Tom for msu ips.ods or see configs/msu/network/msu ips.csv For ...
Implementing Network QoS at AGLT2 Recently we have seen periods where our LANs have been congested and packets are dropped. This has resulted in some of the monit...
Installing a Main line Kernel on Scientific Linux 6.4 or CentOS 7.2 64 bit To install a main line kernel kernel is as simple as putting in place the correct elrep...
Installation of OSG 0.6.0 on gate01.aglt2.org The installation procedure for OSG 0.6.0 on gate01.aglt2.org is below. It was installed on April 2nd, 2007. Please...
LFC SQL Queries Below are some potentially useful SQL queries to check the status of the LFC. These are my test queries and I don't guarantee they are correct ...
Resizing LVM Partitions Some CERN systems were built with little space in /, with the bulk of the space in /home. However, this means HTCondor, that wants at lea...
MultiCore Condor Set UP Introduction AGLT2 implements a mix of static and dynamic job slots for MultiCore jobs. At the time of this writing, we use 10 static sl...
Installation and Configuration of Dell MD3460 Storage Basic Hardware This page refers specifically to hardware purchased in August 2016 using RBD 2016 funds. A s...
See also: * MSUDZeroOsgSE about the storage element * MSUDZeroOsgStartup Restarting the system * MSUDZeroOsgTests Testing the OSG site * MSUDZeroOsgJo...
MSU Hardware Catalog This page lists hardware at MSU. Subpages provide more details and link to hardware documentation. Rack View * WesternSciRack2005 The ra...
This page is obsolete Hardware maintenance is now logged at http://glpi.aglt2.org/ MSU Hardware Repairs Until we have a better system, I'm recording hardware rep...
for big three phase PDUs The rearmost PDU is 1. In these racks the rearmost PDU is inverted (its cord comes out the top). * place label like "MAC 00:00:00:00...
MSU Tier 2 Administration MSU's computing resources make up approximately half of AGLT2. These machines are jointly administrated by MSU and UM. This page will br...
User Info for MSU Tier3 Regulations Your usage of the cluster must conform with MSU's acceptible use statement http://www.msu.edu/au/ Privacy The cluster is a m...
Nov 2008 T2 Hardware Things that can happen whenever: * configure PDUs power strips * install power cords to PDUs * label needed network cables * get ...
MSU Tripp Lite UPS The storage racks at MSU have Tripp Lite SU6000RT3UHV UPSs. These are 6KVA models. One of the two PDU (power strips) in the rack is fed from ...
pe2950 Utility Node Install Have a pe2950 with 2x 250GB drive and 4x 750 GB drives. Want to set it up to support a variety of cluster services including running ...
* RebuildComputeNode Rebuilding a ROCKS compute node * RespondToDownNode What to do with a down node * ControlledShutdown How to bring the cluster down nice...
Hardware Transition Planning from head01 (old R610) to head01 temp (new R630) We purchased a new Dell R630 to act as replacement hardware for our existing head01 ...
Local AGLT2 Monitors There are many monitors we've implemented. These include both AGLT2 and general USATLAS pages. Summaries * AGL Compute Summary page of Ph...
MSU OSG OSG site information and policy. Currently the MSU OSG site is 100% allocated to "SAMGrid" processing for the DZero Experiment. An SRM/dCache v2.2 SE is l...
Installation of NDT on ndt.aglt2.org See also Patrick McGuigan's page at NDTInstallation. Installation overview (more details below) 1. Applied the web100 ker...
Some Addresses At U M 198.32.43.193 an interface on Nile All UM Networks and purposes:http://www.itcom.itd.umich.edu/backbone/umnet/Tool to list all known IP ass...
Network Issues at AGLT2 This page is intended to capture the network related issues at AGLT2 Network Issues after UltraLight Router at Starlight (R04CHI) was Ret...
Planning for the production network. NetworkHardwareInfo Near term To Do List Here is a list of network related items that need doing as of February 4, 2011: ...
Network Testing and Debugging for AGLT2 During the last year we have seen many indications that all is not right with our network connections to BNL (and perhaps ...
Network Tuning and Testing On September 18, 2007 Dimitri Katramatos, Kunal Shroff and Shawn McKee tried to test and tune the following machines at BNL and Michiga...
Evaluation and testing of Nexsan SATABeast with B60E expansion Unpacking and Installation See photos and some comments here: https://picasaweb.google.com/ben.mee...
Numpy and Scipy at AGLT2 The numpy and scipy software packages are in common use at AGLT2, but, the installed versions are somewhat old, having to do with the dea...
* InstallUpgradeOSG Modified April 5, 2011, for OSG 1.2.19 install, B.Ball * UpdateOSGOnGatekeeper How to update the OSG and condor ce on gatekeepers ...
The content below was copied from the OSG install Twiki page on June 5, 2006. This was done to allow us to use this Twiki to record install details for our OSG i...
OpenAFS and Kerberos on Windows Software prerequisites Kerberos for windows. The current release of OpenAFS 1.7.4 recommends the Heimdal Kerberos implementation. ...
Setting up Oracle on Linux The following documents the installation and setup of Oracle at the University of Michigan for use by the ATLAS Muon Calibration and Al...
Installing Updated Muon Calibration Schema New schema was made available in early February 2008. Since the changes were significant I totally removed the origin...
Some info on Oracle setup at AGLT2 * Oracle Installation on linux for the ATLAS Muon Calibration/Alignment centers. * Oracle MuonDB updated (new) schema Feb...
Oracle Upgrade from 10.2.0.2 to 10.2.0.3 Prior to installing the Rome muon calibration DB for replication we needed to update our Oracle installation. I received...
Installing pCache and LSM at AGLT2 We are interested in setting up both a Local Site Mover (LSM) and pCache on our worker nodes. The goals are: * Reduce the I...
Useful PNFS/Chimera SQL Queries NOTE: This page assumes you are running Chimera/PNFS rather than the older PNFS from dCache 1.8.x or earlier. First query: Fix PN...
The PanDA Auto Exclusion process for ANALY_AGLT2 Introduction Procedures here were documented by D. van der Ster in this talk. To see this you will need a CERN ...
Install Postgresql on CentOS/RHEL/SL with Replication for Esmond This Wiki topic covers installing Postgresql with replication to support the Esmond DB. You will ...
Upgrading Postgresql on CentOS/RHEL/SL with Hot standby Systems This Wiki topic covers upgrading our existing PostgreSQL version 9.3.11 on Scientific Linux 6.7 64...
Upgrading Postgresql from 9.5 to 10.5 We want to go to the most recent Postgresql for use by dCache, at least on head01.aglt2.org. Currently Postgresql is version...
Postgresql on ZFS AGLT2 has been running Postgresql on top of ZFS on our head01.aglt2.org (dCache headnode) for more than 1 year. Recently we came across an inter...
The instructions use the c6 1 24 1 (Dell C6420) as an example Switch ports Available ports Look for all the switches for available ports The c6420 nodes need 2 ...
Overview See these URLs for an overview: https://twiki.cern.ch/twiki/bin/view/Atlas/PandaRun https://twiki.cern.ch/twiki/bin/view/Atlas/PandaTools Setup procedur...
Index of other pages ForemanPuppetInitialSetup unorganized notes from initial setup. Mostly you won't need these. HOWTO: Build new host with foreman Mostly se...
Raritan Dominion MSU MSU has a Dominion KX132. This is a 32 port model. In 2008, the Dominion KX series has been replaced in Raritan's line with the Dominion KX ...
Recovering from a Lost Pool When we lose a pool we need to do a number of things to recover. Once we determine we have really lost the pool we will need to find t...
* SwitchAccess including how to find where a node is on the network * NodeConsoleAccess Including via KVM and IPMI/DRAC * NodePowerControl including PDUs an...
Reworking AGLT2's Logging Setup In upgrading atgrid we have an opportunity to migrate from syslog ng and php syslog ng to something new. The ELK stack (Elasticse...
ROCKS This is the main local page for the ROCKS cluster software. Subpages: * BuildingRocksRolls * RocksAglReleases Notes on configs used in production ...
Setup and Running ATLAS Software (from Ed Diehl email) I have found in the past that the validation scripts have errors themselves, or there are other obscure pro...
Security Monitoring: Setup and Configuration * Snort setup information and configuration * Syslog ng setup for AGLT2 * SEC setup and configuration usin...
See http://www.sensatronics.com/index.php/industrial monitors/model e4.html Need to connect using serial port to make IP configuration. It has a web server and ...
Tier2 Services at UM Services for Tier2 job submission and remote monitoring are distributed across several physical machines at UM. Below is a breadown of what ...
Athena can be tricky to set up and run under your user account. These are some minimal directions to follow. The ATLAS Computing Workbook is chock full of helpfu...
In order to setup your GRID Certificate, you need to have already completed the initial steps of requesting the certificate, registering for membership in the ATL...
Installing gssklog/gssklogd on our cluster We have user home spaces (including grid "group" accounts) in our AFS cell (atlas.umich.edu). Currently any user tryi...
Setting Up OMD on AGLT2 Systems Monitoring for AGLT2 has used lots of different software: Ganglia, Syslog ng, Cacti, Nagios, Shinken, Rancid, Monit/MMonit, OpenMa...
login to any of the interactive machine(unt3int01 05), run the following commands #localSetupATLAS or run export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo...
Setting up SSH Keys for AGLT2 SSH is able to use a variety of methods for authenticating users. Each method has security strengths and weaknesses. The normal user...
Notes on Building ShawnGenerator Below are the series of steps I used to create my "ShawnGenerator". This generator is based upon source code from Loek Hooft va...
This page keeps a "shopping list" of needed equipment and parts as well as a reference for suppliers. Add items with a date. When items are purchased, please ma...
Site Blacklisting via DQ2 Commands If you want to exclude your site from DQ2 you need to use the dq2 set location status command. The specific command is: dq2 set...
Using Slony to Replicate dCache Postgresql DBs We have been running postgresql 9.0.x on our dCacheheadnodes for almost two years. Currently we have have the follo...
Running ATLAS software on SLC43 x86_64 We had installed the mars01.cern.ch mars05.cern.ch systems with SLC43 and an experimental kernel. Though we could install ...
ATLAS Software Tutorials Introduction This is a listing of tutorials for using the ATLAS Software. When possible, tutorials specific to usage on the G...
Solaris Installation notes A lot of this was compiled and figured out via extensive reading of this forum. There are a number of confusions and misleading direct...
Squid rpm Installation or Update Follow directions at this OSG Twiki page for installing. More directions are available at General CERN Twiki page. Summary step...
Subversion Subversion is a software revision control system designed to be an improvement on CVS. It generally replicates the features of CVS. References * T...
Installation, Setup, and Usage reference for Sun hardware and Solaris SolarisOnPE2950 Installation of Solaris 10 (08/07) on umfs11 SunX4540ConfigFreeBSD In...
Hardware notes here...little to say about setup. As the manual also says, hit appropriate keys during boot and configure the management/IPMI card using an exter...
Info about actually using the 4540 is at SunX4540ConfigSolaris Migrating pre installed X4540 Solaris 10 to boot from flash drive NOTE: There's probably no need ...
System Install Checklist (UM Systems) Attached to this document is a tarball containing reasonable examples, many of which can probably be used with no modificati...
Installation Details for VDT v1.6.0 We are trying the VDT v1.6.0 installation on gate01.grid.umich.edu following instructions at the VDT 1.6.0 release note page. ...
AGLT2 Account and Resource Policy In order to meet the requirements placed upon our Tier2 we are implementing a GUMS/VOMS/PRIMA configuration (so called "Full Pri...
This document helps the UM Tier3 users to diagnose their condor job problems. Submitting Machines Tier 3 users can submit their condor jobs from the following ma...
Trouble Atlas Atlas Analysis Job mishandled OSG APP Paul, Bob, OSG_APP should be "/atlas/data08/OSG/APP". "atlas_app/atlas_rel" are subdirectory created when i...
Setups taken from bl 13 1 when it was set up as an interactive machine root@bl 13 1 network scripts # cat ifcfg eth0 (This NIC is NOT trunked) DEVICE=eth0 HWAD...
Michigan Computers Overview The Michigan computer cluster consists of several interactive machines, and 2 condor batch queue clusters. Here is the current list ...
Installing and Using X2Go for UM T3 Users This page describes how to get the X2Go client software, install it on Windows (explicitly, it will likely also go on a ...
Tier3 for Users For information on using ATLAS software please see this section of our index page: WebHome#AGLT2_User_Information Information here includes how to...
UMATLAS yum repository NOTE: As of January, 2018, sysprov02, an SL7 VM, has replaced sysprov01, and sysprov01 has been shut down. All refs to sysprov01 below have...
Update Kerberos on our Servers The kerberos servers were installed long ago when DES was the primary encryption. We need to change to using newer more secure algo...
Updating LFC for AGLT2 The LFC host for AGLT2 is lfc.aglt2.org. This is a VMware VM (SL5.2/x86_64). As of September 13, 2009 the LFC software was installed in /...
On December 23, 2005 I "upgraded" umfs01.grid.umich.edu from the i686 (32 bit) version of Scientific Linux 4.1 to the x86_64 (64 bit) version of Scientific Linux ...
Upgrade Planning for AGLT2 SL5 Systems Introduction We need to upgrade our remaining SL5 systems to SL6 soon. We should use this page to track which systems stil...
Useful Links This page will link to many pages useful for day to day administration Monitoring * Ganglia Monitoring * PerfSonar (latency) (bandwidth) * ...
Using CVMFS at AGLT2 General Information CVMFS is a new method of distributing ATLAS software that relies on using central repositories of software on servers lo...
Setting up Geant4 A 64 bit gcc 4.1.2 Geant4 installation for SL5 is located at /afs/atlas.umich.edu/opt/geant4. All available data packages are included. For a li...
Using "Monit" for Monitoring and Repairing AGLT2 Services NOTE: THIS PAGE IS NOW MOSTLY OBSOLETE, WITH MONIT INSTALLED VIA CFENGINE The monit application monitors...
Using OMD and GLPI for AGLT2 We have some nice tools installed to monitor our systems and software (OMD/Check_MK) and track the resolution of problems (GLPI). It ...
VMWare Setup and Updates This page should keep track of VMware related setup/updates and information. Update to vSphere 5.1 This section will document the detail...
Video Conferencing Help Asking for help or suggestions Email: aglt2 umich #64;umich.edu 348 West Hall Howto Guides Set outputs for each screen On the "HDMI ...
Virtuozzo Information and HowTo We have been testing Virtuozzo on our new virtualization hardware. Virtuozzo runs multiple "servers" on a host system, sharing t...
WLCG Accounting for Tier 2 Sites This page contains some plots showing WLCG Tier 2 accounting results for Tier 2's worldwide. Currently the plots are only availa...
AGLT2 Web Preferences The following settings are web preferences of the AGLT2 web. These preferences overwrite the site level preferences in . and , and c...
Installing and Using X2Go We are dropping the Remote Desktop machine aglt2rd, and replacing it with a Linux machine set, starting with bridge um at the UM site, a...
Introduction xCAT is a cluster management tool originally developed at IBM and now Open Source. xCAT v1 was rewritten with much of the same functionality but a n...
yum cron Configuration in SL7 Un modified yum cron ALWAYS sends emails upon completion. This is an overwhelming flood given the number of systems we have. We th...
Using ZFS on Linux for AGLT2 AFS Fileservers Recently ZFS on Linux became available. ZFS has lots of nice features including Copy On Write (COW), data integrity v...